ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.12476
  4. Cited By
Zero-shot Visual Relation Detection via Composite Visual Cues from Large
  Language Models

Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models

21 May 2023
Lin Li
Jun Xiao
Guikun Chen
Jian Shao
Yueting Zhuang
Long Chen
    VLM
ArXivPDFHTML

Papers citing "Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models"

28 / 28 papers shown
Title
Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension
Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension
Lin Li
Wei Chen
Jiahui Li
L. Chen
LRM
33
1
0
20 Apr 2025
PRISM-0: A Predicate-Rich Scene Graph Generation Framework for Zero-Shot Open-Vocabulary Tasks
PRISM-0: A Predicate-Rich Scene Graph Generation Framework for Zero-Shot Open-Vocabulary Tasks
Abdelrahman Elskhawy
Mengze Li
Nassir Navab
Benjamin Busam
VLM
46
0
0
01 Apr 2025
Dynamic Relation Inference via Verb Embeddings
Dynamic Relation Inference via Verb Embeddings
Omri Suissa
Muhiim Ali
Ariana Azarbal
Hui Shen
Shekhar Pradhan
41
0
0
17 Mar 2025
SGC-Net: Stratified Granular Comparison Network for Open-Vocabulary HOI Detection
Xin Lin
Chong Shi
Zuopeng Yang
Haojin Tang
Zhili Zhou
ObjD
31
0
0
01 Mar 2025
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation
Rohith Peddi
Saurabh
Ayush Abhay Shrivastava
Parag Singla
Vibhav Gogate
65
0
0
20 Nov 2024
Scene Graph Generation with Role-Playing Large Language Models
Scene Graph Generation with Role-Playing Large Language Models
Guikun Chen
Jin Li
Wenguan Wang
VLM
40
5
0
20 Oct 2024
Transforming Game Play: A Comparative Study of DCQN and DTQN
  Architectures in Reinforcement Learning
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning
William A. Stigall
43
0
0
14 Oct 2024
Boosting Weakly-Supervised Referring Image Segmentation via Progressive
  Comprehension
Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Zaiquan Yang
Yuhao Liu
Jiaying Lin
Gerhard Hancke
Rynson W. H. Lau
26
1
0
02 Oct 2024
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
42
0
0
19 Sep 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
56
3
0
16 Sep 2024
What Makes a Maze Look Like a Maze?
What Makes a Maze Look Like a Maze?
Joy Hsu
Jiayuan Mao
J. Tenenbaum
Noah D. Goodman
Jiajun Wu
OCL
52
6
0
12 Sep 2024
Exploring the Potential of Large Foundation Models for Open-Vocabulary
  HOI Detection
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection
Ting Lei
Shaofeng Yin
Yang Liu
VLM
25
9
0
09 Apr 2024
Towards Scene Graph Anticipation
Towards Scene Graph Anticipation
Rohith Peddi
Saksham Singh
Saurabh
Parag Singla
Vibhav Gogate
27
3
0
07 Mar 2024
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)
Zongxin Yang
Guikun Chen
Xiaodi Li
Wenguan Wang
Yi Yang
LM&Ro
LLMAG
39
35
0
16 Jan 2024
Compositional Zero-shot Learning via Progressive Language-based
  Observations
Compositional Zero-shot Learning via Progressive Language-based Observations
Lin Li
Guikun Chen
Jun Xiao
Long Chen
19
7
0
23 Nov 2023
Enhancing Scene Graph Generation with Hierarchical Relationships and
  Commonsense Knowledge
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge
Bowen Jiang
Zhijun Zhuang
Shreyas S. Shivakumar
Camillo J. Taylor
19
6
0
21 Nov 2023
Adaptive Visual Scene Understanding: Incremental Scene Graph Generation
Adaptive Visual Scene Understanding: Incremental Scene Graph Generation
Naitik Khandelwal
Xiao Liu
Mengmi Zhang
CLL
21
0
0
02 Oct 2023
Compositional Feature Augmentation for Unbiased Scene Graph Generation
Compositional Feature Augmentation for Unbiased Scene Graph Generation
Lin Li
Guikun Chen
Jun Xiao
Yi Yang
Chunping Wang
Long Chen
19
25
0
13 Aug 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present,
  and Future
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
Chaoyang Zhu
Long Chen
ObjD
VLM
19
32
0
18 Jul 2023
NICEST: Noisy Label Correction and Training for Robust Scene Graph
  Generation
NICEST: Noisy Label Correction and Training for Robust Scene Graph Generation
Lin Li
Jun Xiao
Hanrong Shi
Hanwang Zhang
Yi Yang
W. Liu
Long Chen
13
22
0
27 Jul 2022
Fine-grained Image Captioning with CLIP Reward
Fine-grained Image Captioning with CLIP Reward
Jaemin Cho
Seunghyun Yoon
Ajinkya Kale
Franck Dernoncourt
Trung Bui
Mohit Bansal
CLIP
121
76
0
26 May 2022
Large Language Models are Zero-Shot Reasoners
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Caixia Yan
Xiao Chang
Minnan Luo
Huan Liu
Xiaoqin Zhang
Qinghua Zheng
ObjD
VLM
59
74
0
04 Sep 2021
Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge
  Integration
Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge Integration
Xuan Kan
Hejie Cui
Carl Yang
55
39
0
11 Jul 2021
Open-vocabulary Object Detection via Vision and Language Knowledge
  Distillation
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Tsung-Yi Lin
Weicheng Kuo
Yin Cui
VLM
ObjD
223
897
0
28 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
What Makes Good In-Context Examples for GPT-$3$?
What Makes Good In-Context Examples for GPT-333?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAML
RALM
275
1,296
0
17 Jan 2021
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
29,632
0
16 Jan 2013
1