ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11876
  4. Cited By
Open-Vocabulary DETR with Conditional Matching
v1v2 (latest)

Open-Vocabulary DETR with Conditional Matching

European Conference on Computer Vision (ECCV), 2022
22 March 2022
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
    ObjDVLM
ArXiv (abs)PDFHTML

Papers citing "Open-Vocabulary DETR with Conditional Matching"

50 / 182 papers shown
Title
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
Tooba Tehreem Sheikh
Jean Lahoud
Rao Muhammad Anwer
Fahad Shahbaz Khan
Salman Khan
Hisham Cholakkal
ObjDMedImVLM
211
0
0
25 Nov 2025
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Can a Second-View Image Be a Language? Geometric and Semantic Cross-Modal Reasoning for X-ray Prohibited Item Detection
Chuang Peng
Renshuai Tao
Zhongwei Ren
Xianglong Liu
Yunchao Wei
92
0
0
23 Nov 2025
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
Lin Li
Chuhan Zhang
Dong Zhang
Chong Sun
Chen Li
L. Chen
108
0
0
08 Nov 2025
ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
Pranav Saxena
Jimmy Chiun
VLM
88
0
0
24 Oct 2025
Towards 3D Objectness Learning in an Open World
Towards 3D Objectness Learning in an Open World
Taichi Liu
Zhenyu Wang
Ruofeng Liu
Guang Wang
Desheng Zhang
3DPCVLM
85
0
0
20 Oct 2025
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
Yehonathan Refael
Amit Aides
Aviad Barzilai
George Leifman
Genady Beryozkin
Vered Silverman
Bolous Jaber
Tomer Shekel
ObjD
321
0
0
20 Oct 2025
CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection
CoT-PL: Visual Chain-of-Thought Reasoning Meets Pseudo-Labeling for Open-Vocabulary Object Detection
Hojun Choi
Youngsun Lim
Jaeyo Shin
Hyunjung Shim
ObjDLRMVLM
181
1
0
16 Oct 2025
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Synthetic Object Compositions for Scalable and Accurate Learning in Detection, Segmentation, and Grounding
Weikai Huang
Jieyu Zhang
Taoyang Jia
Chenhao Zheng
Ziqi Gao
J. S. Park
Winson Han
Ranjay Krishna
145
0
0
10 Oct 2025
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection
Siheng Wang
Zhengdao Li
Yanshu Li
Canran Xiao
Haibo Zhan
...
Zhikang Dong
Jifeng Shen
Junhao Dong
Qiang Sun
Piotr Koniusz
ObjDVLM
164
6
0
27 Sep 2025
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Speech-to-See: End-to-End Speech-Driven Open-Set Object Detection
Wenhuan Lu
Xinyue Song
Wenjun Ke
Zhizhi Yu
Wenhao Yang
Jianguo Wei
ObjD
76
0
0
20 Sep 2025
When Language Model Guides Vision: Grounding DINO for Cattle Muzzle Detection
When Language Model Guides Vision: Grounding DINO for Cattle Muzzle Detection
Rabin Dulal
Lihong Zheng
M. A. Kabir
60
0
0
08 Sep 2025
AttriPrompt: Dynamic Prompt Composition Learning for CLIP
AttriPrompt: Dynamic Prompt Composition Learning for CLIP
Qiqi Zhan
Shiwei Li
Qingjie Liu
Yunhong Wang
VLM
100
0
0
07 Sep 2025
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
Object Detection with Multimodal Large Vision-Language Models: An In-depth ReviewInformation Fusion (Inf. Fusion), 2025
Ranjan Sapkota
Manoj Karkee
ObjDVLM
243
11
0
25 Aug 2025
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Towards Open-Vocabulary Multimodal 3D Object Detection with Attributes
Xinhao Xiang
Kuan-Chuan Peng
Suhas Lohit
Michael Jeffrey Jones
Jiawei Zhang
3DPC
110
0
0
22 Aug 2025
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
Junjie Wang
Keyu Chen
Yulin Li
Bin Chen
Hengshuang Zhao
Xiaojuan Qi
Zhuotao Tian
CLIPVLM
98
1
0
15 Aug 2025
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
Haijing Liu
Tao Pu
Hefeng Wu
Keze Wang
Guanbin Li
ObjDVLM
94
0
0
07 Aug 2025
NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
Zelin Peng
Yichen Zhao
Yu Huang
Piao Yang
Feilong Tang
Zhengqin Xu
Yunbo Wang
Wei Shen
VLM
88
0
0
06 Aug 2025
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Dual-Stream Attention with Multi-Modal Queries for Object Detection in Transportation Applications
Noreen Anwar
Guillaume-Alexandre Bilodeau
W. Bouachir
65
0
0
06 Aug 2025
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Siyuan Li
Rui Huang
Yuqian Fu
Marc Pollefeys
Hermann Blum
Z. Bauer
3DPC
178
3
0
31 Jul 2025
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
Sanghun Jung
Jingjing Zheng
Ke Zhang
Nan Qiao
Albert Y. C. Chen
...
Xiao Zeng
Hsiang-Wei Huang
Byron Boots
Min Sun
Cheng-Hao Kuo
107
1
0
30 Jul 2025
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries
Pengfei Cai
Yan Song
Qing Gu
Nan Jiang
Haoyu Song
Ian Mcloughlin
VLM
186
0
0
22 Jul 2025
Open World Object Detection: A Survey
Open World Object Detection: A Survey
Yiming Li
Yi Wang
Wenqian Wang
Dan Lin
Bingbing Li
Kim-Hui Yap
ObjD
312
18
0
01 Jul 2025
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
Xuanchi Ren
Y. Lu
Tianshi Cao
Ruiyuan Gao
S. Huang
...
Jun Gao
Laura Leal-Taixe
Mike Chen
Sanja Fidler
Huan Ling
VGen
291
17
0
10 Jun 2025
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Chenbin Pan
Wenbin He
Zhengzhong Tu
Liu Ren
LRMVLM
443
2
0
29 May 2025
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Open-Det: An Efficient Learning Framework for Open-Ended Detection
Guiping Cao
Tao Wang
Wenjian Huang
X. Lan
Jianguo Zhang
Shihong Deng
ObjDVLM
133
1
0
27 May 2025
DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
DeCLIP: Decoupled Learning for Open-Vocabulary Dense PerceptionComputer Vision and Pattern Recognition (CVPR), 2025
Junjie Wang
Bin Chen
Yulin Li
Bin Kang
Yulin Chen
Zhuotao Tian
VLM
269
4
0
07 May 2025
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
Boyuan Meng
Xinming Zhang
Peilin Li
Zhe Wu
Yiming Li
Wenkai Zhao
B. Yu
Hui-Liang Shen
ViT
611
0
0
02 May 2025
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning
VCM: Vision Concept Modeling Based on Implicit Contrastive Learning with Vision-Language Instruction Fine-Tuning
Run Luo
Renke Shan
Longze Chen
Ziqiang Liu
Lu Wang
Min Yang
Xiaobo Xia
MLLMVLM
446
3
0
28 Apr 2025
Decoupled Global-Local Alignment for Improving Compositional Understanding
Decoupled Global-Local Alignment for Improving Compositional Understanding
Xiaoxing Hu
Kaicheng Yang
Chao Guo
Haoran Xu
Ziyong Feng
Longji Xu
VLM
642
7
0
23 Apr 2025
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Yongchao Feng
Yajie Liu
Shuai Yang
Wenrui Cai
Jing Zhang
...
Jiahui Lv
Ziqiang Liu
Tengyuan Shi
Qingjie Liu
Longji Xu
MLLMVLM
258
6
0
13 Apr 2025
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective
Refining CLIP's Spatial Awareness: A Visual-Centric PerspectiveInternational Conference on Learning Representations (ICLR), 2025
Congpei Qiu
Yanhao Wu
Wei Ke
Xiuxiu Bai
Tong Zhang
VLM
235
4
0
03 Apr 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
285
2
0
26 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object DetectionInternational Conference on Learning Representations (ICLR), 2025
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjDVLM
942
3
0
14 Mar 2025
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with TransformerInternational Conference on Learning Representations (ICLR), 2025
Jinyang Li
En Yu
Sijia Chen
Wenbing Tao
311
5
0
13 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjDVLM
199
3
0
13 Mar 2025
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
Chiara Cappellino
Gianluca Mancusi
Matteo Mosconi
Angelo Porrello
Simone Calderara
Rita Cucchiara
ObjDVLM
455
0
0
12 Mar 2025
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking
Yunhao Li
Yifan Jiao
Dan Meng
Heng Fan
L. Zhang
208
0
0
11 Mar 2025
YOLOE: Real-Time Seeing Anything
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
VLMObjD
466
30
0
10 Mar 2025
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
Adrian Chow
Evelien Riddell
Yimu Wang
Sean Sedwards
Krzysztof Czarnecki
3DPC
155
0
0
09 Mar 2025
RTGen: Real-Time Generative Detection Transformer
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
Jiying Zhao
Wenhu Chen
ObjDVLM
352
0
0
28 Feb 2025
InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models
InPK: Infusing Prior Knowledge into Prompt for Vision-Language Models
Shuchang Zhou
Jiwei Wei
Shiyuan He
Yuyang Zhou
Chaoning Zhang
Jie Zou
Ning Xie
Yang Yang
VLMVPVLM
351
0
0
27 Feb 2025
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Xiangyu Gao
Yu Dai
Benliu Qiu
Hongliang Li
Heqian Qiu
Hongliang Li
ObjDVLM
907
0
0
28 Jan 2025
Enhancing Novel Object Detection via Cooperative Foundational Models
Enhancing Novel Object Detection via Cooperative Foundational ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Rohit K Bharadwaj
Muzammal Naseer
Salman Khan
Fahad Shahbaz Khan
ObjDVLM
714
1
0
17 Jan 2025
Leveraging Content and Context Cues for Low-Light Image Enhancement
Leveraging Content and Context Cues for Low-Light Image EnhancementIEEE transactions on multimedia (IEEE TMM), 2024
Igor Morawski
Kai He
Shusil Dangi
Winston H. Hsu
327
3
0
10 Dec 2024
Leverage Task Context for Object Affordance Ranking
Leverage Task Context for Object Affordance Ranking
Haojie Huang
Hongchen Luo
Wei-dong Zhai
Yang Cao
Zheng-jun Zha
246
0
0
25 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Wentao Bao
Keqin Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLMObjD
248
7
0
17 Nov 2024
Harnessing Vision Foundation Models for High-Performance, Training-Free
  Open Vocabulary Segmentation
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation
Yuheng Shi
Minjing Dong
Chang Xu
VLM
269
10
0
14 Nov 2024
Exploiting Unlabeled Data with Multiple Expert Teachers for Open
  Vocabulary Aerial Object Detection and Its Orientation Adaptation
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Yan Li
Weiwei Guo
Songyuan Li
Ning Liao
Shaofeng Zhang
Yi Yu
Wenxian Yu
Junchi Yan
ObjD
211
1
0
04 Nov 2024
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from
  Only 2D Images
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesNeural Information Processing Systems (NeurIPS), 2024
Timing Yang
Yuanliang Ju
Li Yi
3DPC
240
12
0
31 Oct 2024
Open-Vocabulary Object Detection via Language Hierarchy
Open-Vocabulary Object Detection via Language HierarchyNeural Information Processing Systems (NeurIPS), 2024
Jiaxing Huang
Jingyi Zhang
Kai Jiang
Shijian Lu
ObjDVLM
277
3
0
27 Oct 2024
1234
Next