Dynamic Graph Attention for Referring Expression Comprehension

IEEE International Conference on Computer Vision (ICCV), 2019

18 September 2019

Papers citing "Dynamic Graph Attention for Referring Expression Comprehension"

50 / 120 papers shown

Intervene-All-Paths: Unified Mitigation of LVLM Hallucinations across Alignment Formats

289

21 Nov 2025

Why LVLMs Are More Prone to Hallucinations in Longer Responses: The Role of Context

23 Oct 2025

Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding

169

08 Sep 2025

PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

229

05 Sep 2025

A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding

262

02 Aug 2025

Modality-Aware Feature Matching: A Comprehensive Review of Single- and Cross-Modality Techniques

211

30 Jul 2025

Advancing Visual Large Language Model for Multi-granular Versatile Perception

217

22 Jul 2025

ReMeREC: Relation-aware and Multi-entity Referring Expression Comprehension

146

22 Jul 2025

Referring Expression Instance Retrieval and A Strong End-to-End Baseline

290

23 Jun 2025

ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections

191

18 Jun 2025

DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual GroundingInternational Conference on Learning Representations (ICLR), 2025

302

08 May 2025

Visual Intention Grounding for Egocentric Assistants

279

18 Apr 2025

Multi-Object Grounding via Hierarchical Contrastive Siamese Transformers

Chengyi Du

Keyan Jin

234

14 Apr 2025

Referring to Any Person

927

11 Mar 2025

New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM CollaborationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025

488

27 Feb 2025

ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2025

376

26 Feb 2025

A Comprehensive Survey on Composed Image Retrieval

469

19 Feb 2025

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression ComprehensionAAAI Conference on Artificial Intelligence (AAAI), 2025

261

03 Jan 2025

Towards Visual Grounding: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

955

28 Dec 2024

AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding

257

13 Nov 2024

Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual GroundingIEEE transactions on multimedia (IEEE TMM), 2024

Huafeng Li

178

31 Oct 2024

Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial AttentionNeural Information Processing Systems (NeurIPS), 2024

Haomeng Zhang

Chiao-An Yang

Raymond A. Yeh

264

29 Oct 2024

Boosting Weakly-Supervised Referring Image Segmentation via Progressive ComprehensionNeural Information Processing Systems (NeurIPS), 2024

314

02 Oct 2024

SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal FusionNeural Information Processing Systems (NeurIPS), 2024

Wankou Yang

446

26 Sep 2024

Make Graph-based Referring Expression Comprehension Great Again through Expression-guided Dynamic Gating and RegressionIEEE transactions on multimedia (IEEE TMM), 2024

Yen-Yu Lin

258

05 Sep 2024

NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar

Eng Gee Lim

387

30 Aug 2024

ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual GroundingACM Multimedia (MM), 2024

Minghang Zheng

Jiahua Zhang

Qingchao Chen

Yuxin Peng

Yang Liu

ObjD

290

29 Aug 2024

R2G: Reasoning to Ground in 3D ScenesPattern Recognition (Pattern Recogn.), 2024

Yixuan Li

Zan Wang

Wei Liang

297

24 Aug 2024

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMsVisual Communications and Image Processing (VCIP), 2024

Shengyang Zhao

Zhibo Chen

Xin Jin

340

16 Aug 2024

LLMI3D: MLLM-based 3D Perception from a Single 2D Image

Fan Yang

Sicheng Zhao

Yanhao Zhang

Haoxiang Chen

Hui Chen

Wenbo Tang

Guiguang Ding

237

14 Aug 2024

ACTRESS: Active Retraining for Semi-supervised Visual Grounding

Weitai Kang

Mengxue Qu

Yunchao Wei

Yan Yan

326

03 Jul 2024

Visual Grounding with Attention-Driven Constraint Balancing

Weitai Kang

278

03 Jul 2024

SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding

Weitai Kang

Gaowen Liu

Mubarak Shah

Yan Yan

ObjD

408

03 Jul 2024

HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding

Linhui Xiao

319

20 Apr 2024

Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation

Qiyuan Dai

Sibei Yang

213

18 Apr 2024

WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar

Shanliang Yao

...

367

19 Mar 2024

Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal DistillationChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023

275

29 Dec 2023

Cycle-Consistency Learning for Captioning and Grounding

231

23 Dec 2023

Context Disentangling and Prototype Inheriting for Robust Visual Grounding

Wei Tang

260

19 Dec 2023

Mono3DVG: 3D Visual Grounding in Monocular ImagesAAAI Conference on Artificial Intelligence (AAAI), 2023

Yangfan Zhan

Yuan. Yuan

Zhitong Xiong

MDE

266

13 Dec 2023

Continual Referring Expression Comprehension via Dual Modular MemorizationIEEE Transactions on Image Processing (IEEE TIP), 2022

Lianli Gao

Jingkuan Song

172

25 Nov 2023

Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models

429

21 Nov 2023

RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open EnvironmentsNeural Information Processing Systems (NeurIPS), 2023

Jingkuan Song

235

26 Oct 2023

Video Referring Expression Comprehension via Transformer with Content-conditioned Query

263

25 Oct 2023

Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Wei Ji

Li Li

Roger Zimmermann

182

29 Sep 2023

Temporal Collection and Distribution for Referring Video Object SegmentationIEEE International Conference on Computer Vision (ICCV), 2023

181

07 Sep 2023

CoTDet: Affordance Knowledge Prompting for Task Driven Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023

Jingyi Yu

215

03 Sep 2023

Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference UnderstandingEuropean Conference on Computer Vision (ECCV), 2023

Cheng Shi

Sibei Yang

LRM

162

03 Sep 2023

Contrastive Grouping with Transformer for Referring Image SegmentationComputer Vision and Pattern Recognition (CVPR), 2023

314

02 Sep 2023

Grounded Image Text Matching with Mismatched Relation ReasoningIEEE International Conference on Computer Vision (ICCV), 2023

243

02 Aug 2023