ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.11213
  4. Cited By
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal
  Models

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

15 July 2024
Zijian Zhou
Zheng Zhu
Holger Caesar
Miaojing Shi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (3 upvotes)Github (46★)

Papers citing "OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models"

8 / 8 papers shown
ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis
ScenarioCLIP: Pretrained Transferable Visual Language Models and Action-Genome Dataset for Natural Scene Analysis
Advik Sinha
Saurabh Atreya
Aashutosh A V
Sk Aziz Ali
Abhijit Das
CLIP
209
0
0
25 Nov 2025
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Lin Li
Chuhan Zhang
Dong Zhang
Chong Sun
Chen Li
L. Chen
175
0
0
08 Nov 2025
Explaining multimodal LLMs via intra-modal token interactions
Explaining multimodal LLMs via intra-modal token interactions
Jiawei Liang
Ruoyu Chen
Xianghao Jiao
Siyuan Liang
Shiming Liu
Qunli Zhang
Zheng Hu
Xiaochun Cao
LRM
220
1
0
26 Sep 2025
Designing Memory-Augmented AR Agents for Spatiotemporal Reasoning in Personalized Task Assistance
Designing Memory-Augmented AR Agents for Spatiotemporal Reasoning in Personalized Task Assistance
Dongwook Choi
Taeyoon Kwon
Dongil Yang
Hyojun Kim
Jinyoung Yeo
193
0
0
12 Aug 2025
Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection
Hallucinate, Ground, Repeat: A Framework for Generalized Visual Relationship Detection
Shanmukha Vellamcheti
Sanjoy Kundu
Sathyanarayanan N. Aakur
303
0
0
06 Jun 2025
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor SpacesComputer Vision and Pattern Recognition (CVPR), 2025
Chenyangguang Zhang
Alexandros Delitzas
Fangjinhua Wang
Ruida Zhang
Xiangyang Ji
Marc Pollefeys
Francis Engelmann
3DV3DPC
466
38
0
24 Mar 2025
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding
Yan Tai
Luhao Zhu
Zhiqiang Chen
Ynan Ding
Yiying Dong
Xiaohong Liu
Guodong Guo
MLLMObjD
286
0
0
10 Mar 2025
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
TextPSG: Panoptic Scene Graph Generation from Textual DescriptionsIEEE International Conference on Computer Vision (ICCV), 2023
Chengyang Zhao
Songlin Yang
Zhenfang Chen
Mingyu Ding
Chuang Gan
473
24
0
10 Oct 2023
1
Page 1 of 1