v1v2 (latest)

Open-Vocabulary DETR with Conditional Matching

European Conference on Computer Vision (ECCV), 2022

22 March 2022

Papers citing "Open-Vocabulary DETR with Conditional Matching"

50 / 184 papers shown

VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models

Silin Cheng

Kai Han

MLLM VPVLM VLM

256

27 Nov 2025

MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities

318

25 Nov 2025

Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection

116

23 Nov 2025

Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation

152

08 Nov 2025

ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models

Pranav Saxena

Jimmy Chiun

VLM

112

24 Oct 2025

On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration

476

20 Oct 2025

Towards 3D Objectness Learning in an Open World

137

20 Oct 2025

CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection

363

16 Oct 2025

Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding

219

10 Oct 2025

C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection

...

257

27 Sep 2025

Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection

20 Sep 2025

When Language Model Guides Vision: Grounding DINO for Cattle Muzzle Detection

Rabin Dulal

Lihong Zheng

M. A. Kabir

112

08 Sep 2025

AttriPrompt: Dynamic Prompt Composition Learning for CLIP

164

07 Sep 2025

Object Detection with Multimodal Large Vision-Language Models: An In-depth ReviewInformation Fusion (Inf. Fusion), 2025

Ranjan Sapkota

Manoj Karkee

ObjD VLM

290

25 Aug 2025

Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes

Xinhao Xiang

Kuan-Chuan Peng

Suhas Lohit

Michael Jeffrey Jones

Jiawei Zhang

3DPC

154

22 Aug 2025

Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception

130

15 Aug 2025

DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition

138

07 Aug 2025

Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications

Noreen Anwar

Guillaume-Alexandre Bilodeau

W. Bouachir

06 Aug 2025

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding

132

06 Aug 2025

3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection

242

31 Jul 2025

Details Matter for Indoor Open-vocabulary 3D Instance Segmentation

...

183

30 Jul 2025

Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries

242

22 Jul 2025

Open World Object Detection: A Survey

360

01 Jul 2025

Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models

...

363

10 Jun 2025

DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models

496

29 May 2025

Open-Det: An Efficient Learning Framework for Open-Ended Detection

194

27 May 2025

DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionComputer Vision and Pattern Recognition (CVPR), 2025

307

07 May 2025

CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion

680

02 May 2025

VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning

512

28 Apr 2025

Decoupled Global-Local Alignment for Improving Compositional Understanding

701

23 Apr 2025

Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation

...

318

13 Apr 2025

Refining CLIP's Spatial Awareness: A Visual-Centric PerspectiveInternational Conference on Learning Representations (ICLR), 2025

307

03 Apr 2025

GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection

353

26 Mar 2025

Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionInternational Conference on Learning Representations (ICLR), 2025

1.0K

14 Mar 2025

OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with TransformerInternational Conference on Learning Representations (ICLR), 2025

432

13 Mar 2025

A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection

246

13 Mar 2025

DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection

558

12 Mar 2025

Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking

259

11 Mar 2025

YOLOE: Real-Time Seeing Anything

542

10 Mar 2025

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection

188

09 Mar 2025

RTGen: Real-Time Generative Detection Transformer

412

28 Feb 2025

InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models

Shuchang Zhou

Jiwei Wei

Shiyuan He

Yuyang Zhou

Chaoning Zhang

Jie Zou

Ning Xie

Yang Yang

VLM VPVLM

411

27 Feb 2025

Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection

Heqian Qiu

Hongliang Li

ObjD VLM

1.0K

28 Jan 2025

Enhancing Novel Object Detection via Cooperative Foundational ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

857

17 Jan 2025

Leveraging Content and Context Cues for Low-Light Image EnhancementIEEE transactions on multimedia (IEEE TMM), 2024

392

10 Dec 2024

Leverage Task Context for Object Affordance Ranking

299

25 Nov 2024

Exploiting VLM Localizability and Semantics for Open Vocabulary Action DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

284

17 Nov 2024

Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation

303

14 Nov 2024

Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation

233

04 Nov 2024

ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesNeural Information Processing Systems (NeurIPS), 2024

316

31 Oct 2024