Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.14940
Cited By
Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model
Computer Vision and Pattern Recognition (CVPR), 2022
28 March 2022
Yu Du
Fangyun Wei
Zihe Zhang
Miaojing Shi
Yue Gao
Guoqi Li
VPVLM
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (181★)
Papers citing
"Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model"
28 / 278 papers shown
Title
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
321
22
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
266
32
0
02 Jan 2023
Learning to Detect and Segment for Open Vocabulary Object Detection
Computer Vision and Pattern Recognition (CVPR), 2022
Tao Wang
Nan Li
VLM
ObjD
238
31
0
23 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
International Conference on Machine Learning (ICML), 2022
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLM
DiffM
205
59
0
07 Dec 2022
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding
Computer Vision and Pattern Recognition (CVPR), 2022
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
VLM
175
198
0
29 Nov 2022
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
IEEE International Conference on Computer Vision (ICCV), 2022
Vishaal Udandarao
Ankush Gupta
Samuel Albanie
VLM
MLLM
399
140
0
28 Nov 2022
Learning Object-Language Alignments for Open-Vocabulary Object Detection
International Conference on Learning Representations (ICLR), 2022
Chuang Lin
Pei Sun
Yi Jiang
Ping Luo
Zhuang Li
Gholamreza Haffari
Zehuan Yuan
Jianfei Cai
VLM
ObjD
169
115
0
27 Nov 2022
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
Computer Vision and Pattern Recognition (CVPR), 2022
Siteng Huang
Biao Gong
Yulin Pan
Jianwen Jiang
Yiliang Lv
Yuyuan Li
Xuetao Zhang
VLM
VPVLM
238
59
0
23 Nov 2022
One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and Inter-Image Attention Design
Yikai Yan
Chaoyue Niu
Fan Wu
Qinya Li
Shaojie Tang
Chengfei Lyu
Guihai Chen
144
0
0
11 Nov 2022
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
Cheng Ma
Yang Liu
Jiankang Deng
Lingxi Xie
Weiming Dong
Changsheng Xu
VLM
VPVLM
214
59
0
04 Nov 2022
FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization
Junyan Wang
Yi Zhang
Jitao Sang
FaML
VLM
260
26
0
26 Oct 2022
Unified Vision and Language Prompt Learning
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
VLM
VPVLM
255
190
0
13 Oct 2022
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Computer Vision and Pattern Recognition (CVPR), 2022
Feng Liang
Bichen Wu
Xiaoliang Dai
Kunpeng Li
Yinan Zhao
Hang Zhang
Peizhao Zhang
Peter Vajda
Diana Marculescu
CLIP
VLM
495
602
0
09 Oct 2022
Bayesian Prompt Learning for Image-Language Model Generalization
IEEE International Conference on Computer Vision (ICCV), 2022
Mohammad Mahdi Derakhshani
Enrique Sanchez
Adrian Bulat
Victor G. Turrisi da Costa
Cees G. M. Snoek
Georgios Tzimiropoulos
Brais Martínez
VPVLM
VLM
393
59
0
05 Oct 2022
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models
Weicheng Kuo
Huayu Chen
Xiuye Gu
A. Piergiovanni
A. Angelova
MLLM
VLM
ObjD
379
171
0
30 Sep 2022
REST: REtrieve & Self-Train for generative action recognition
Adrian Bulat
Enrique Sanchez
Brais Martínez
Georgios Tzimiropoulos
VLM
234
4
0
29 Sep 2022
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
AAAI Conference on Artificial Intelligence (AAAI), 2022
Ziyu Guo
Renrui Zhang
Longtian Qiu
Xianzheng Ma
Xupeng Miao
Xuming He
Tengjiao Wang
VLM
AAML
203
163
0
28 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Neural Information Processing Systems (NeurIPS), 2022
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIP
VLM
295
213
0
20 Sep 2022
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
Neural Information Processing Systems (NeurIPS), 2022
Manli Shu
Weili Nie
De-An Huang
Zhiding Yu
Tom Goldstein
Anima Anandkumar
Chaowei Xiao
VLM
VPVLM
480
430
0
15 Sep 2022
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network
IET Computer Vision (ICV), 2022
Tiancheng Zhao
Peng Liu
Kyusong Lee
VLM
MLLM
ObjD
130
14
0
10 Sep 2022
Prompt Tuning with Soft Context Sharing for Vision-Language Models
Neurocomputing (Neurocomputing), 2022
Kun Ding
Ying Wang
Pengzhang Liu
Qiang Yu
Hao Zhang
Shiming Xiang
Chunhong Pan
VPVLM
VLM
250
20
0
29 Aug 2022
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features
Shichao Xu
Yikang Li
Jenhao Hsiao
C. Ho
Zhuang Qi
270
11
0
19 Aug 2022
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
Neural Information Processing Systems (NeurIPS), 2022
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjD
VLM
297
182
0
07 Jul 2022
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer
AAAI Conference on Artificial Intelligence (AAAI), 2022
Su He
Taian Guo
Tao Dai
Ruizhi Qiao
Bo Ren
Shutao Xia
VLM
262
65
0
05 Jul 2022
Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Peixian Chen
Kekai Sheng
Mengdan Zhang
Mingbao Lin
Chunjiang Ge
Shaohui Lin
Bo Ren
Ke Li
VLM
ObjD
309
31
0
22 Jun 2022
Unsupervised Prompt Learning for Vision-Language Models
Hao Huang
Jack Chu
Fangyun Wei
VPVLM
MLLM
VLM
283
156
0
07 Apr 2022
Open-Vocabulary DETR with Conditional Matching
European Conference on Computer Vision (ECCV), 2022
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
ObjD
VLM
328
258
0
22 Mar 2022
Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Yu Li
Tao Wang
Bingyi Kang
Sheng Tang
Chunfeng Wang
Jintao Li
Jiashi Feng
325
289
0
18 Jun 2020
Previous
1
2
3
4
5
6