ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11876
  4. Cited By
Open-Vocabulary DETR with Conditional Matching
v1v2 (latest)

Open-Vocabulary DETR with Conditional Matching

European Conference on Computer Vision (ECCV), 2022
22 March 2022
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "Open-Vocabulary DETR with Conditional Matching"

50 / 184 papers shown
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
VaMP: Variational Multi-Modal Prompt Learning for Vision-Language Models
Silin Cheng
Kai Han
MLLMVPVLMVLM
256
1
0
27 Nov 2025
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
Tooba Tehreem Sheikh
Jean Lahoud
Rao Muhammad Anwer
Fahad Shahbaz Khan
Salman Khan
Hisham Cholakkal
ObjDMedImVLM
318
0
0
25 Nov 2025
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Chuang Peng
Renshuai Tao
Zhongwei Ren
Xianglong Liu
Yunchao Wei
116
0
0
23 Nov 2025
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Lin Li
Chuhan Zhang
Dong Zhang
Chong Sun
Chen Li
L. Chen
152
0
0
08 Nov 2025
ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
Pranav Saxena
Jimmy Chiun
VLM
112
0
0
24 Oct 2025
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
Yehonathan Refael
Amit Aides
Aviad Barzilai
George Leifman
Genady Beryozkin
Vered Silverman
Bolous Jaber
Tomer Shekel
ObjD
476
0
0
20 Oct 2025
Towards 3D Objectness Learning in an Open World
Towards 3D Objectness Learning in an Open World
Taichi Liu
Zhenyu Wang
Ruofeng Liu
Guang Wang
Desheng Zhang
3DPCVLM
137
0
0
20 Oct 2025
CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection
CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection
Hojun Choi
Youngsun Lim
Jaeyo Shin
Hyunjung Shim
ObjDLRMVLM
363
1
0
16 Oct 2025
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Weikai Huang
Jieyu Zhang
Taoyang Jia
Chenhao Zheng
Ziqi Gao
J. S. Park
Winson Han
Ranjay Krishna
219
0
0
10 Oct 2025
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
Siheng Wang
Zhengdao Li
Yanshu Li
Canran Xiao
Haibo Zhan
...
Zhikang Dong
Jifeng Shen
Junhao Dong
Qiang Sun
Piotr Koniusz
ObjDVLM
257
6
0
27 Sep 2025
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Wenhuan Lu
Xinyue Song
Wenjun Ke
Zhizhi Yu
Wenhao Yang
Jianguo Wei
ObjD
91
0
0
20 Sep 2025
When Language Model Guides Vision: Grounding DINO for Cattle Muzzle Detection
When Language Model Guides Vision: Grounding DINO for Cattle Muzzle Detection
Rabin Dulal
Lihong Zheng
M. A. Kabir
112
0
0
08 Sep 2025
AttriPrompt: Dynamic Prompt Composition Learning for CLIP
AttriPrompt: Dynamic Prompt Composition Learning for CLIP
Qiqi Zhan
Shiwei Li
Qingjie Liu
Yunhong Wang
VLM
164
1
0
07 Sep 2025
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
Object Detection with Multimodal Large Vision-Language Models: An In-depth ReviewInformation Fusion (Inf. Fusion), 2025
Ranjan Sapkota
Manoj Karkee
ObjDVLM
290
15
0
25 Aug 2025
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Xinhao Xiang
Kuan-Chuan Peng
Suhas Lohit
Michael Jeffrey Jones
Jiawei Zhang
3DPC
154
1
0
22 Aug 2025
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Junjie Wang
Keyu Chen
Yulin Li
Bin Chen
Hengshuang Zhao
Xiaojuan Qi
Zhuotao Tian
CLIPVLM
130
1
0
15 Aug 2025
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
Haijing Liu
Tao Pu
Hefeng Wu
Keze Wang
Guanbin Li
ObjDVLM
138
1
0
07 Aug 2025
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Noreen Anwar
Guillaume-Alexandre Bilodeau
W. Bouachir
94
0
0
06 Aug 2025
NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
Zelin Peng
Yichen Zhao
Yu Huang
Piao Yang
Feilong Tang
Zhengqin Xu
Yunbo Wang
Wei Shen
VLM
132
0
0
06 Aug 2025
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Siyuan Li
Rui Huang
Yuqian Fu
Marc Pollefeys
Hermann Blum
Z. Bauer
3DPC
242
3
0
31 Jul 2025
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
Sanghun Jung
Jingjing Zheng
Ke Zhang
Nan Qiao
Albert Y. C. Chen
...
Xiao Zeng
Hsiang-Wei Huang
Byron Boots
Min Sun
Cheng-Hao Kuo
183
3
0
30 Jul 2025
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Pengfei Cai
Yan Song
Qing Gu
Nan Jiang
Haoyu Song
Ian Mcloughlin
VLM
242
1
0
22 Jul 2025
Open World Object Detection: A Survey
Open World Object Detection: A Survey
Yiming Li
Yi Wang
Wenqian Wang
Dan Lin
Bingbing Li
Kim-Hui Yap
ObjD
360
20
0
01 Jul 2025
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
Xuanchi Ren
Y. Lu
Tianshi Cao
Ruiyuan Gao
S. Huang
...
Jun Gao
Laura Leal-Taixe
Mike Chen
Sanja Fidler
Huan Ling
VGen
363
0
0
10 Jun 2025
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Chenbin Pan
Wenbin He
Zhengzhong Tu
Liu Ren
LRMVLM
496
2
0
29 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
Shihong Deng
ObjDVLM
194
1
0
27 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionComputer Vision and Pattern Recognition (CVPR), 2025
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Yulin Chen
Zhuotao Tian
VLM
307
5
0
07 May 2025
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Boyuan Meng
Xinming Zhang
Peilin Li
Zhe Wu
Yiming Li
Wenkai Zhao
B. Yu
Hui-Liang Shen
ViT
680
0
0
02 May 2025
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning
Run Luo
Renke Shan
Longze Chen
Ziqiang Liu
Lu Wang
Min Yang
Xiaobo Xia
MLLMVLM
512
3
0
28 Apr 2025
Decoupled Global-Local Alignment for Improving Compositional Understanding
Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu
Kaicheng Yang
Chao Guo
Haoran Xu
Ziyong Feng
Longji Xu
VLM
701
7
0
23 Apr 2025
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Yongchao Feng
Yajie Liu
Shuai Yang
Wenrui Cai
Jing Zhang
...
Jiahui Lv
Ziqiang Liu
Tengyuan Shi
Qingjie Liu
Longji Xu
MLLMVLM
318
9
0
13 Apr 2025
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Refining CLIP's Spatial Awareness: A Visual-Centric PerspectiveInternational Conference on Learning Representations (ICLR), 2025
Congpei Qiu
Yanhao Wu
Wei Ke
Xiuxiu Bai
Tong Zhang
VLM
307
6
0
03 Apr 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
353
2
0
26 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionInternational Conference on Learning Representations (ICLR), 2025
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjDVLM
1.0K
4
0
14 Mar 2025
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with TransformerInternational Conference on Learning Representations (ICLR), 2025
Jinyang Li
En Yu
Sijia Chen
Wenbing Tao
432
6
0
13 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjDVLM
246
3
0
13 Mar 2025
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
Chiara Cappellino
Gianluca Mancusi
Matteo Mosconi
Angelo Porrello
Simone Calderara
Rita Cucchiara
ObjDVLM
558
1
0
12 Mar 2025
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking
Yunhao Li
Yifan Jiao
Dan Meng
Heng Fan
L. Zhang
259
0
0
11 Mar 2025
YOLOE: Real-Time Seeing Anything
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
VLMObjD
542
33
0
10 Mar 2025
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Adrian Chow
Evelien Riddell
Yimu Wang
Sean Sedwards
Krzysztof Czarnecki
3DPC
188
1
0
09 Mar 2025
RTGen: Real-Time Generative Detection Transformer
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
Jiying Zhao
Wenhu Chen
ObjDVLM
412
0
0
28 Feb 2025
InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models
InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models
Shuchang Zhou
Jiwei Wei
Shiyuan He
Yuyang Zhou
Chaoning Zhang
Jie Zou
Ning Xie
Yang Yang
VLMVPVLM
411
0
0
27 Feb 2025
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Xiangyu Gao
Yu Dai
Benliu Qiu
Hongliang Li
Heqian Qiu
Hongliang Li
ObjDVLM
1.0K
0
0
28 Jan 2025
Enhancing Novel Object Detection via Cooperative Foundational Models
Enhancing Novel Object Detection via Cooperative Foundational ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Rohit K Bharadwaj
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
ObjDVLM
857
2
0
17 Jan 2025
Leveraging Content and Context Cues for Low-Light Image Enhancement
Leveraging Content and Context Cues for Low-Light Image EnhancementIEEE transactions on multimedia (IEEE TMM), 2024
Igor Morawski
Kai He
Shusil Dangi
Winston H. Hsu
392
4
0
10 Dec 2024
Leverage Task Context for Object Affordance Ranking
Leverage Task Context for Object Affordance Ranking
Haojie Huang
Hongchen Luo
Wei-dong Zhai
Yang Cao
Zheng-jun Zha
299
0
0
25 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Wentao Bao
Keqin Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLMObjD
284
7
0
17 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free
  Open Vocabulary Segmentation
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Yuheng Shi
Minjing Dong
Chang Xu
VLM
303
10
0
14 Nov 2024
Exploiting Unlabeled Data with Multiple Expert Teachers for Open
  Vocabulary Aerial Object Detection and Its Orientation Adaptation
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Yan Li
Weiwei Guo
Songyuan Li
Ning Liao
Shaofeng Zhang
Yi Yu
Wenxian Yu
Junchi Yan
ObjD
233
1
0
04 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from
  Only 2D Images
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesNeural Information Processing Systems (NeurIPS), 2024
Timing Yang
Yuanliang Ju
Li Yi
3DPC
316
14
0
31 Oct 2024
1234
Next