ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.14940
  4. Cited By
Learning to Prompt for Open-Vocabulary Object Detection with
  Vision-Language Model

Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model

Computer Vision and Pattern Recognition (CVPR), 2022
28 March 2022
Yu Du
Fangyun Wei
Zihe Zhang
Miaojing Shi
Yue Gao
Guoqi Li
    VPVLMVLM
ArXiv (abs)PDFHTMLGithub (181★)

Papers citing "Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model"

28 / 278 papers shown
Title
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance
  Segmentation
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
321
22
0
03 Jan 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open
  Vocabulary Instance Segmentation
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Jianzong Wu
Xiangtai Li
Henghui Ding
Xia Li
Guangliang Cheng
Yu Tong
Chen Change Loy
VLM
266
32
0
02 Jan 2023
Learning to Detect and Segment for Open Vocabulary Object Detection
Learning to Detect and Segment for Open Vocabulary Object DetectionComputer Vision and Pattern Recognition (CVPR), 2022
Tao Wang
Nan Li
VLMObjD
238
31
0
23 Dec 2022
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using
  CLIP and StableDiffusion
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusionInternational Conference on Machine Learning (ICML), 2022
Hanqing Zhao
Dianmo Sheng
Jianmin Bao
Dongdong Chen
Dong Chen
...
Ce Liu
Wenbo Zhou
Qi Chu
Weiming Zhang
Neng H. Yu
VLMDiffM
205
59
0
07 Dec 2022
PLA: Language-Driven Open-Vocabulary 3D Scene Understanding
PLA: Language-Driven Open-Vocabulary 3D Scene UnderstandingComputer Vision and Pattern Recognition (CVPR), 2022
Runyu Ding
Jihan Yang
Chuhui Xue
Wenqing Zhang
Song Bai
Xiaojuan Qi
VLM
175
198
0
29 Nov 2022
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
SuS-X: Training-Free Name-Only Transfer of Vision-Language ModelsIEEE International Conference on Computer Vision (ICCV), 2022
Vishaal Udandarao
Ankush Gupta
Samuel Albanie
VLMMLLM
399
140
0
28 Nov 2022
Learning Object-Language Alignments for Open-Vocabulary Object Detection
Learning Object-Language Alignments for Open-Vocabulary Object DetectionInternational Conference on Learning Representations (ICLR), 2022
Chuang Lin
Pei Sun
Yi Jiang
Ping Luo
Zhuang Li
Gholamreza Haffari
Zehuan Yuan
Jianfei Cai
VLMObjD
169
115
0
27 Nov 2022
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal RetrievalComputer Vision and Pattern Recognition (CVPR), 2022
Siteng Huang
Biao Gong
Yulin Pan
Jianwen Jiang
Yiliang Lv
Yuyuan Li
Xuetao Zhang
VLMVPVLM
238
59
0
23 Nov 2022
One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and
  Inter-Image Attention Design
One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and Inter-Image Attention Design
Yikai Yan
Chaoyue Niu
Fan Wu
Qinya Li
Shaojie Tang
Chengfei Lyu
Guihai Chen
144
0
0
11 Nov 2022
Understanding and Mitigating Overfitting in Prompt Tuning for
  Vision-Language Models
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models
Cheng Ma
Yang Liu
Jiankang Deng
Lingxi Xie
Weiming Dong
Changsheng Xu
VLMVPVLM
214
59
0
04 Nov 2022
FairCLIP: Social Bias Elimination based on Attribute Prototype Learning
  and Representation Neutralization
FairCLIP: Social Bias Elimination based on Attribute Prototype Learning and Representation Neutralization
Junyan Wang
Yi Zhang
Jitao Sang
FaMLVLM
260
26
0
26 Oct 2022
Unified Vision and Language Prompt Learning
Unified Vision and Language Prompt Learning
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
VLMVPVLM
255
190
0
13 Oct 2022
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIPComputer Vision and Pattern Recognition (CVPR), 2022
Feng Liang
Bichen Wu
Xiaoliang Dai
Kunpeng Li
Yinan Zhao
Hang Zhang
Peizhao Zhang
Peter Vajda
Diana Marculescu
CLIPVLM
495
602
0
09 Oct 2022
Bayesian Prompt Learning for Image-Language Model Generalization
Bayesian Prompt Learning for Image-Language Model GeneralizationIEEE International Conference on Computer Vision (ICCV), 2022
Mohammad Mahdi Derakhshani
Enrique Sanchez
Adrian Bulat
Victor G. Turrisi da Costa
Cees G. M. Snoek
Georgios Tzimiropoulos
Brais Martínez
VPVLMVLM
393
59
0
05 Oct 2022
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language
  Models
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models
Weicheng Kuo
Huayu Chen
Xiuye Gu
A. Piergiovanni
A. Angelova
MLLMVLMObjD
379
171
0
30 Sep 2022
REST: REtrieve & Self-Train for generative action recognition
REST: REtrieve & Self-Train for generative action recognition
Adrian Bulat
Enrique Sanchez
Brais Martínez
Georgios Tzimiropoulos
VLM
234
4
0
29 Sep 2022
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free AttentionAAAI Conference on Artificial Intelligence (AAAI), 2022
Ziyu Guo
Renrui Zhang
Longtian Qiu
Xianzheng Ma
Xupeng Miao
Xuming He
Tengjiao Wang
VLMAAML
203
163
0
28 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for
  Open-world Detection
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world DetectionNeural Information Processing Systems (NeurIPS), 2022
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIPVLM
295
213
0
20 Sep 2022
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language
  Models
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Manli Shu
Weili Nie
De-An Huang
Zhiding Yu
Tom Goldstein
Anima Anandkumar
Chaowei Xiao
VLMVPVLM
480
430
0
15 Sep 2022
OmDet: Large-scale vision-language multi-dataset pre-training with
  multimodal detection network
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkIET Computer Vision (ICV), 2022
Tiancheng Zhao
Peng Liu
Kyusong Lee
VLMMLLMObjD
130
14
0
10 Sep 2022
Prompt Tuning with Soft Context Sharing for Vision-Language Models
Prompt Tuning with Soft Context Sharing for Vision-Language ModelsNeurocomputing (Neurocomputing), 2022
Kun Ding
Ying Wang
Pengzhang Liu
Qiang Yu
Hao Zhang
Shiming Xiang
Chunhong Pan
VPVLMVLM
250
20
0
29 Aug 2022
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on
  Aligned Visual-Textual Features
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features
Shichao Xu
Yikang Li
Jenhao Hsiao
C. Ho
Zhuang Qi
270
11
0
19 Aug 2022
Bridging the Gap between Object and Image-level Representations for
  Open-Vocabulary Detection
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary DetectionNeural Information Processing Systems (NeurIPS), 2022
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjDVLM
297
182
0
07 Jul 2022
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge
  Transfer
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge TransferAAAI Conference on Artificial Intelligence (AAAI), 2022
Su He
Taian Guo
Tao Dai
Ruizhi Qiao
Bo Ren
Shutao Xia
VLM
262
65
0
05 Jul 2022
Open Vocabulary Object Detection with Proposal Mining and Prediction
  Equalization
Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Peixian Chen
Kekai Sheng
Mengdan Zhang
Mingbao Lin
Chunjiang Ge
Shaohui Lin
Bo Ren
Ke Li
VLMObjD
309
31
0
22 Jun 2022
Unsupervised Prompt Learning for Vision-Language Models
Unsupervised Prompt Learning for Vision-Language Models
Hao Huang
Jack Chu
Fangyun Wei
VPVLMMLLMVLM
283
156
0
07 Apr 2022
Open-Vocabulary DETR with Conditional Matching
Open-Vocabulary DETR with Conditional MatchingEuropean Conference on Computer Vision (ECCV), 2022
Yuhang Zang
Wei Li
Kaiyang Zhou
Chen Huang
Chen Change Loy
ObjDVLM
328
258
0
22 Mar 2022
Overcoming Classifier Imbalance for Long-tail Object Detection with
  Balanced Group Softmax
Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
Yu Li
Tao Wang
Bingyi Kang
Sheng Tang
Chunfeng Wang
Jintao Li
Jiashi Feng
325
289
0
18 Jun 2020
Previous
123456