Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.16513
Cited By
v1
v2 (latest)
PromptDet: Towards Open-vocabulary Detection using Uncurated Images
European Conference on Computer Vision (ECCV), 2022
30 March 2022
Chengjian Feng
Yujie Zhong
Zequn Jie
Xiangxiang Chu
Haibing Ren
Xiaolin K. Wei
Weidi Xie
Lin Ma
VPVLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"PromptDet: Towards Open-vocabulary Detection using Uncurated Images"
50 / 115 papers shown
State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection
Jiaying Zhou
Qingchao Chen
161
0
0
22 Nov 2025
TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models
Li Zhang
Zhongxuan Han
Xiaohua Feng
Jiaming Zhang
Yuyuan Li
Linbo Jiang
Jianan Lin
Chaochao Chen
FedML
VLM
504
1
0
20 Nov 2025
NoisyGRPO: Incentivizing Multimodal CoT Reasoning via Noise Injection and Bayesian Estimation
Longtian Qiu
Shan Ning
Jiaxuan Sun
Xuming He
NoLa
OffRL
LRM
536
4
0
24 Oct 2025
On-the-Fly OVD Adaptation with FLAME: Few-shot Localization via Active Marginal-Samples Exploration
Yehonathan Refael
Amit Aides
Aviad Barzilai
George Leifman
Genady Beryozkin
Vered Silverman
Bolous Jaber
Tomer Shekel
ObjD
543
0
0
20 Oct 2025
Cluster-Aware Prompt Ensemble Learning for Few-Shot Vision-Language Model Adaptation
Pattern Recognition (Pattern Recogn.), 2025
Zhi Chen
Xin Yu
Xiaohui Tao
Yan Li
Zi Huang
VLM
236
12
0
10 Oct 2025
Cross-View Open-Vocabulary Object Detection in Aerial Imagery
Jyoti Kini
Rohit Gupta
Mubarak Shah
ObjD
VLM
276
1
0
04 Oct 2025
Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation
Jinchang Zhang
Zijun Li
Jiakai Lin
Guoyu Lu
ObjD
VLM
175
4
0
01 Oct 2025
Constrained Prompt Enhancement for Improving Zero-Shot Generalization of Vision-Language Models
Xiaojie Yin
Qilong Wang
Q. Hu
VLM
221
1
0
24 Aug 2025
Towards Open World Detection: A Survey
Andrei-Stefan Bulzan
Cosmin Cernazanu-Glavan
ObjD
VLM
265
0
0
22 Aug 2025
AME: Aligned Manifold Entropy for Robust Vision-Language Distillation
Guiming Cao
Yuming Ou
AAML
VLM
226
2
0
12 Aug 2025
Prompt-Guided Relational Reasoning for Social Behavior Understanding with Vision Foundation Models
Thinesh Thiyakesan Ponbagavathi
Chengzheng Yang
Alina Roitberg
VLM
246
1
0
11 Aug 2025
ODOV: Towards Open-Domain Open-Vocabulary Object Detection
Yupeng Zhang
Ruize Han
Fangnan Zhou
Song Wang
Wei Feng
Liang Wan
ObjD
VLM
265
1
0
02 Aug 2025
Advancing Visual Large Language Model for Multi-granular Versatile Perception
Wentao Xiang
Haoxian Tan
Cong Wei
Yujie Zhong
Dengjie Li
Yujiu Yang
VLM
345
2
0
22 Jul 2025
Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline
Shiyi Mu
Zichong Gu
Hanqi Lyu
Yilin Gao
Shugong Xu
3DPC
258
0
0
12 Jul 2025
Open World Object Detection: A Survey
Yiming Li
Yi Wang
Wenqian Wang
Dan Lin
Bingbing Li
Kim-Hui Yap
ObjD
480
27
0
01 Jul 2025
EarthGPT-X: A Spatial MLLM for Multi-level Multi-Source Remote Sensing Imagery Understanding with Visual Prompting
IEEE Transactions on Geoscience and Remote Sensing (IEEE TGRS), 2025
Wei Zhang
Miaoxin Cai
Yaqian Ning
Tianze Zhang
Yin Zhuang
He Chen
He Chen
Jun Li
Xuerui Mao
493
0
0
17 Apr 2025
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Yongchao Feng
Yajie Liu
Shuai Yang
Wenrui Cai
Jing Zhang
...
Jiahui Lv
Ziqiang Liu
Tengyuan Shi
Qingjie Liu
Longji Xu
MLLM
VLM
392
14
0
13 Apr 2025
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection
Xingyu Peng
Si Liu
Chen Gao
Yan Bai
Beipeng Mu
Xiaofei Wang
Huaxia Xia
470
3
0
26 Mar 2025
Squeeze Out Tokens from Sample for Finer-Grained Data Governance
Weixiong Lin
Chen Ju
Haicheng Wang
Shengchao Hu
Shuai Xiao
...
Yuheng Jiao
Mingshuai Yao
Jinsong Lan
Qingwen Liu
Ying Chen
331
3
0
18 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
International Conference on Learning Representations (ICLR), 2025
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
1.2K
10
0
14 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjD
VLM
375
8
0
13 Mar 2025
YOLO-UniOW: Efficient Universal Open-World Object Detection
Lihao Liu
Juexiao Feng
Hui Chen
Ao Wang
Lin Song
Jiawei Han
Guiguang Ding
ObjD
VLM
372
6
0
31 Dec 2024
Style-Pro: Style-Guided Prompt Learning for Generalizable Vision-Language Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Niloufar Alipour Talemi
Hossein Kashiani
Fatemeh Afghah
CLIP
VLM
382
3
0
25 Nov 2024
Active Prompt Learning with Vision-Language Model Priors
Hoyoung Kim
Seokhee Jin
Changhwan Sung
Jaechang Kim
Jungseul Ok
VLM
234
1
0
23 Nov 2024
Efficient Transfer Learning for Video-language Foundation Models
Computer Vision and Pattern Recognition (CVPR), 2024
Haoxing Chen
Zizheng Huang
Y. Hong
Yanshuo Wang
Zhongcai Lyu
Zhuoer Xu
Jun Lan
Zhangxuan Gu
VLM
456
5
0
18 Nov 2024
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Yan Li
Weiwei Guo
Songyuan Li
Ning Liao
Shaofeng Zhang
Yi Yu
Wenxian Yu
Junchi Yan
ObjD
310
2
0
04 Nov 2024
OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking
Neural Information Processing Systems (NeurIPS), 2024
Haiji Liang
Ruize Han
VLM
413
6
0
23 Oct 2024
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection
ACM Multimedia (MM), 2024
Zishuo Wang
Wenhao Zhou
Jinglin Xu
Yuxin Peng
ObjD
VLM
285
9
0
08 Oct 2024
Revisiting Prompt Pretraining of Vision-Language Models
Zhenyuan Chen
Lingfeng Yang
Shuo Chen
Zhaowei Chen
Jiajun Liang
Xiang Li
MLLM
VPVLM
VLM
400
5
0
10 Sep 2024
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
European Conference on Computer Vision (ECCV), 2024
Ting Lei
Shaofeng Yin
Yuxin Peng
Yang Liu
VLM
392
33
0
05 Aug 2024
A Simple Background Augmentation Method for Object Detection with Diffusion Model
European Conference on Computer Vision (ECCV), 2024
Yuhang Li
Jun Gao
Chen Chen
Yue Zhang
Jielei Zhang
DiffM
368
20
0
01 Aug 2024
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
Kuo Wang
Lechao Cheng
Weikai Chen
Pingping Zhang
Liang Lin
Fan Zhou
Guanbin Li
VLM
ObjD
286
12
0
31 Jul 2024
EarthMarker: Visual Prompt Learning for Region-level and Point-level Remote Sensing Imagery Comprehension
Wei Zhang
Miaoxin Cai
Tong Zhang
Jun Li
Zhuang Yin
Xuerui Mao
457
3
0
18 Jul 2024
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation
Pengfei Wang
Yuxi Wang
Shuai Li
Zhaoxiang Zhang
Zhen Lei
Lei Zhang
271
15
0
18 Jul 2024
CerberusDet: Unified Multi-Task Object Detection
Irina Tolstykh
Mikhail Chernyshov
Maksim Kuprashevich
VLM
ObjD
311
0
0
17 Jul 2024
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction
Penghui Du
Yu Wang
Yifan Sun
Luting Wang
Yue Liao
Qiang Chen
Errui Ding
Yan Wang
Jingdong Wang
Si Liu
VLM
ObjD
392
19
0
16 Jul 2024
Quantized Prompt for Efficient Generalization of Vision-Language Models
Tianxiang Hao
Xiaohan Ding
Juexiao Feng
Yuhong Yang
Hui Chen
Guiguang Ding
VLM
MQ
340
9
0
15 Jul 2024
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng
Yan Bai
Chen Gao
Lirong Yang
Fei Xia
Beipeng Mu
Xiaofei Wang
Si Liu
ObjD
282
11
0
12 Jul 2024
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Jinlong Li
Zequn Jie
Elisa Ricci
Lin Ma
Andrii Zadaianchuk
VLM
377
1
0
11 Jul 2024
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
289
8
0
09 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
381
30
0
05 Jul 2024
OVMR: Open-Vocabulary Recognition with Multi-Modal References
Computer Vision and Pattern Recognition (CVPR), 2024
Zehong Ma
Shiliang Zhang
Longhui Wei
Qi Tian
VLM
457
8
0
07 Jun 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPC
ObjD
498
18
0
02 Jun 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li
Jiacheng Zhang
Jichang Li
Ge Li
Si Liu
Liang Lin
Guanbin Li
ObjD
VLM
441
38
0
01 Jun 2024
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Fangyi Chen
Han Zhang
Zhantao Yang
Hao Chen
Kai Hu
Marios Savvides
ObjD
VLM
260
10
0
30 May 2024
Open-Vocabulary SAM3D: Understand Any 3D Scene
Hanchen Tai
Qingdong He
Jiangning Zhang
Yijie Qian
Ying Tai
Xiaobin Hu
Yabiao Wang
Yong Liu
VLM
324
1
0
24 May 2024
Open-Vocabulary Spatio-Temporal Action Detection
Tao Wu
Shuqiu Ge
Jie Qin
Gangshan Wu
Limin Wang
ObjD
280
9
0
17 May 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
315
17
0
16 May 2024
Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?
Hari Chandana Kuchibhotla
Sai Srinivas Kancheti
Abbavaram Gowtham Reddy
Vineeth N. Balasubramanian
VLM
388
0
0
13 May 2024
Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
Yanhao Zheng
Kai Liu
ObjD
244
5
0
12 Apr 2024
1
2
3
Next
Page 1 of 3