Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1909.08164
Cited By
Dynamic Graph Attention for Referring Expression Comprehension
IEEE International Conference on Computer Vision (ICCV), 2019
18 September 2019
Sibei Yang
Guanbin Li
Yizhou Yu
OCL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Dynamic Graph Attention for Referring Expression Comprehension"
50 / 120 papers shown
Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats
Jiaye Qian
Ge Zheng
Yuchen Zhu
Sibei Yang
MLLM
289
1
0
21 Nov 2025
Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context
Ge Zheng
Jiaye Qian
Jiajin Tang
Sibei Yang
94
2
0
23 Oct 2025
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
Jiangnan Xie
Xiaolong Zheng
Liang Zheng
ObjD
169
0
0
08 Sep 2025
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
Ming Dai
Wenxuan Cheng
Jiedong Zhuang
Jiang-Jiang Liu
Hongshen Zhao
Zhenhua Feng
Wankou Yang
ObjD
229
3
0
05 Sep 2025
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding
Zhan Shi
Song Wang
Junbo Chen
Jianke Zhu
262
0
0
02 Aug 2025
Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques
Weide Liu
Wei Zhou
Jun Liu
Ping Hu
Jun Cheng
Jungong Han
Weisi Lin
3DV
211
3
0
30 Jul 2025
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Wentao Xiang
Haoxian Tan
Cong Wei
Yujie Zhong
Dengjie Li
Yujiu Yang
VLM
217
2
0
22 Jul 2025
ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension
Yizhi Hu
Zezhao Tian
Xingqun Qi
Chen Su
Bingkun Yang
Junhui Yin
Muyi Sun
Man Zhang
Zhenan Sun
ObjD
146
0
0
22 Jul 2025
Referring Expression Instance Retrieval and A Strong End-to-End Baseline
Xiangzhao Hao
Kuan Zhu
Hongyu Guo
Haiyun Guo
Ning Jiang
Quan Lu
Ming Tang
Jinqiao Wang
290
1
0
23 Jun 2025
ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections
Ziling Huang
Yidan Zhang
Shiníchi Satoh
ObjD
191
1
0
18 Jun 2025
DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding
International Conference on Learning Representations (ICLR), 2025
Henry Zheng
Hao Shi
Qihang Peng
Yong Xien Chng
Rui Huang
Yepeng Weng
Peng Wang
Gao Huang
302
7
0
08 May 2025
Visual Intention Grounding for Egocentric Assistants
Pengzhan Sun
Junbin Xiao
Tze Ho Elden Tse
Yicong Li
Arjun Akula
Angela Yao
EgoV
279
1
0
18 Apr 2025
Multi-Object Grounding via Hierarchical Contrastive Siamese Transformers
Chengyi Du
Keyan Jin
234
0
0
14 Apr 2025
Referring to Any Person
Qing Jiang
Lin Wu
Zhaoyang Zeng
Tianhe Ren
Yuda Xiong
Yihao Chen
Qin Liu
Lei Zhang
927
12
0
11 Mar 2025
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
X. J. Yang
Jing Liu
Peng Wang
Guoqing Wang
Yue Yang
Mengqi Li
ObjD
488
4
0
27 Feb 2025
ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2025
Qihang Peng
Henry Zheng
Gao Huang
3DPC
376
3
0
26 Feb 2025
A Comprehensive Survey on Composed Image Retrieval
Xuemeng Song
Haoqiang Lin
Haokun Wen
Bohan Hou
Mingzhu Xu
Liqiang Nie
469
7
0
19 Feb 2025
Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
AAAI Conference on Artificial Intelligence (AAAI), 2025
Yaxian Wang
Henghui Ding
Shuting He
Xudong Jiang
Bifan Wei
Jun Liu
ObjD
261
8
0
03 Jan 2025
Towards Visual Grounding: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
955
31
0
28 Dec 2024
AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding
Hao Guo
Wei Fan
Baichun Wei
Jianfei Zhu
Jin Tian
Chunzhi Yi
Feng Jiang
257
0
0
13 Nov 2024
Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
IEEE transactions on multimedia (IEEE TMM), 2024
Minghong Xie
Ming Wang
Huafeng Li
Yafei Zhang
Dapeng Tao
Z. Yu
ObjD
178
6
0
31 Oct 2024
Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention
Neural Information Processing Systems (NeurIPS), 2024
Haomeng Zhang
Chiao-An Yang
Raymond A. Yeh
264
5
0
29 Oct 2024
Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension
Neural Information Processing Systems (NeurIPS), 2024
Zaiquan Yang
Yuhao Liu
Jiaying Lin
Gerhard Hancke
Rynson W. H. Lau
314
8
0
02 Oct 2024
SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
Neural Information Processing Systems (NeurIPS), 2024
Ming Dai
Lingfeng Yang
Yihao Xu
Zhenhua Feng
Wankou Yang
ObjD
446
39
0
26 Sep 2024
Make Graph-based Referring Expression Comprehension Great Again through Expression-guided Dynamic Gating and Regression
IEEE transactions on multimedia (IEEE TMM), 2024
Jingcheng Ke
Dele Wang
Jun-Cheng Chen
I-Hong Jhuo
Chia-Wen Lin
Yen-Yu Lin
258
1
0
05 Sep 2024
NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar
Runwei Guan
Tao Huang
Liye Jia
Haocheng Zhao
Shanliang Yao
Xiaohui Zhu
Ka Lok Man
Eng Gee Lim
Jeremy S. Smith
Yutao Yue
387
8
0
30 Aug 2024
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding
ACM Multimedia (MM), 2024
Minghang Zheng
Jiahua Zhang
Qingchao Chen
Yuxin Peng
Yang Liu
ObjD
290
5
0
29 Aug 2024
R2G: Reasoning to Ground in 3D Scenes
Pattern Recognition (Pattern Recogn.), 2024
Yixuan Li
Zan Wang
Wei Liang
297
2
0
24 Aug 2024
Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs
Visual Communications and Image Processing (VCIP), 2024
Jinming Liu
Yuntao Wei
Junyan Lin
Shengyang Zhao
Heming Sun
Zhibo Chen
Wenjun Zeng
Xin Jin
340
4
0
16 Aug 2024
LLMI3D: MLLM-based 3D Perception from a Single 2D Image
Fan Yang
Sicheng Zhao
Yanhao Zhang
Haoxiang Chen
Hui Chen
Wenbo Tang
Guiguang Ding
237
1
0
14 Aug 2024
ACTRESS: Active Retraining for Semi-supervised Visual Grounding
Weitai Kang
Mengxue Qu
Yunchao Wei
Yan Yan
326
8
0
03 Jul 2024
Visual Grounding with Attention-Driven Constraint Balancing
Weitai Kang
Luowei Zhou
Junyi Wu
Changchang Sun
Yan Yan
278
10
0
03 Jul 2024
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
Weitai Kang
Gaowen Liu
Mubarak Shah
Yan Yan
ObjD
408
19
0
03 Jul 2024
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding
Linhui Xiao
Xiaoshan Yang
Fang Peng
Yaowei Wang
Changsheng Xu
ObjD
319
33
0
20 Apr 2024
Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
Qiyuan Dai
Sibei Yang
213
25
0
18 Apr 2024
WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar
Runwei Guan
Liye Jia
Fengyufan Yang
Shanliang Yao
Erick Purwanto
...
Eng Gee Lim
Jeremy S. Smith
Ka Lok Man
Xuming Hu
Yutao Yue
367
18
0
19 Mar 2024
Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal Distillation
Chinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Jiaxi Wang
Wenhui Hu
Xueyang Liu
Beihu Wu
Yuting Qiu
Yingying Cai
275
1
0
29 Dec 2023
Cycle-Consistency Learning for Captioning and Grounding
Ning Wang
Jiajun Deng
Mingbo Jia
ObjD
231
13
0
23 Dec 2023
Context Disentangling and Prototype Inheriting for Robust Visual Grounding
Wei Tang
Liang Li
Xuejing Liu
Lu Jin
Jinhui Tang
Zechao Li
260
41
0
19 Dec 2023
Mono3DVG: 3D Visual Grounding in Monocular Images
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yangfan Zhan
Yuan. Yuan
Zhitong Xiong
MDE
266
33
0
13 Dec 2023
Continual Referring Expression Comprehension via Dual Modular Memorization
IEEE Transactions on Image Processing (IEEE TIP), 2022
Hengtao Shen
Cheng Chen
Peng Wang
Lianli Gao
Ming Wang
Jingkuan Song
ObjD
172
5
0
25 Nov 2023
Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models
Xiaoyu Yang
Lijian Xu
Hao Sun
Jiaming Song
Shaoting Zhang
ObjD
429
11
0
21 Nov 2023
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments
Neural Information Processing Systems (NeurIPS), 2023
Mengxue Qu
Yu-Huan Wu
Wu Liu
Xiaodan Liang
Jingkuan Song
Yao-Min Zhao
Yunchao Wei
235
19
0
26 Oct 2023
Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Jiang Ji
Meng Cao
Tengtao Song
Long Chen
Yi Wang
Yuexian Zou
263
6
0
25 Oct 2023
Towards Complex-query Referring Image Segmentation: A Novel Benchmark
Wei Ji
Li Li
Marco Pleines
Xiangyan Liu
Xu Yang
Juncheng Billy Li
Roger Zimmermann
182
12
0
29 Sep 2023
Temporal Collection and Distribution for Referring Video Object Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Jiajin Tang
Ge Zheng
Sibei Yang
VOS
181
41
0
07 Sep 2023
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Jiajin Tang
Ge Zheng
Jingyi Yu
Sibei Yang
ObjD
215
39
0
03 Sep 2023
Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding
European Conference on Computer Vision (ECCV), 2023
Cheng Shi
Sibei Yang
LRM
162
12
0
03 Sep 2023
Contrastive Grouping with Transformer for Referring Image Segmentation
Computer Vision and Pattern Recognition (CVPR), 2023
Jiajin Tang
Ge Zheng
Cheng Shi
Sibei Yang
ViT
314
57
0
02 Sep 2023
Grounded Image Text Matching with Mismatched Relation Reasoning
IEEE International Conference on Computer Vision (ICCV), 2023
Yu Wu
Yan-Tao Wei
Haozhe Jasper Wang
Yongfei Liu
Sibei Yang
Xuming He
243
12
0
02 Aug 2023
1
2
3
Next