Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.12119
Cited By
v1
v2 (latest)
Visual Prompt Tuning
European Conference on Computer Vision (ECCV), 2022
23 March 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Visual Prompt Tuning"
50 / 1,271 papers shown
SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency
AAAI Conference on Artificial Intelligence (AAAI), 2023
Cong Wang
Jin-shan Pan
Wanyu Lin
Jiangxin Dong
Xiaomei Wu
VLM
MDE
332
53
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
IEEE International Conference on Computer Vision (ICCV), 2023
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
233
2
0
13 Mar 2023
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
IEEE International Conference on Computer Vision (ICCV), 2023
Juncheng Li
Minghe Gao
Longhui Wei
Siliang Tang
Wenqiao Zhang
Meng Li
Wei Ji
Qi Tian
Tat-Seng Chua
Yueting Zhuang
VLM
VPVLM
256
31
0
12 Mar 2023
From Visual Prompt Learning to Zero-Shot Transfer: Mapping Is All You Need
Ziqing Yang
Zeyang Sha
Michael Backes
Yang Zhang
VPVLM
VLM
179
4
0
09 Mar 2023
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Artificial Intelligence (AIJ), 2023
Ning Liao
Bowen Shi
Xiaopeng Zhang
Min Cao
Junchi Yan
Qi Tian
VLM
278
8
0
09 Mar 2023
Your representations are in the network: composable and parallel adaptation for large scale models
Neural Information Processing Systems (NeurIPS), 2023
Yonatan Dukler
Alessandro Achille
Hao Yang
Varsha Vivek
Luca Zancato
Benjamin Bowman
Avinash Ravichandran
Charless C. Fowlkes
A. Swaminathan
Stefano Soatto
297
3
0
07 Mar 2023
Multimodal Prompting with Missing Modalities for Visual Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Yi-Lun Lee
Yi-Hsuan Tsai
Wei-Chen Chiu
Chen-Yu Lee
VPVLM
267
148
0
06 Mar 2023
Dynamic Prompting: A Unified Framework for Prompt Tuning
Xianjun Yang
Wei Cheng
Xujiang Zhao
Wenchao Yu
Linda R. Petzold
Haifeng Chen
VLM
316
20
0
06 Mar 2023
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Computer Vision and Pattern Recognition (CVPR), 2023
Xiaoping Han
Xiatian Zhu
Licheng Yu
Li Zhang
Yi-Zhe Song
Tao Xiang
VLM
179
63
0
04 Mar 2023
Decision Transformer under Random Frame Dropping
International Conference on Learning Representations (ICLR), 2023
Kaizhe Hu
Rachel Zheng
Yang Gao
Huazhe Xu
OffRL
237
16
0
03 Mar 2023
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving
Computer Vision and Pattern Recognition (CVPR), 2023
Xiwen Liang
Minzhe Niu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
VLM
244
20
0
03 Mar 2023
Learning to Grow Pretrained Models for Efficient Transformer Training
International Conference on Learning Representations (ICLR), 2023
Peihao Wang
Yikang Shen
Lucas Torroba Hennigen
P. Greengard
Leonid Karlinsky
Rogerio Feris
David D. Cox
Zinan Lin
Yoon Kim
199
70
0
02 Mar 2023
Enhancing General Face Forgery Detection via Vision Transformer with Low-Rank Adaptation
Conference on Multimedia Information Processing and Retrieval (MIPR), 2023
Chen Kong
Haoliang Li
Shiqi Wang
ViT
CVBM
201
21
0
02 Mar 2023
Rethinking Efficient Tuning Methods from a Unified Perspective
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Yiliang Lv
Deli Zhao
Jingren Zhou
231
15
0
01 Mar 2023
Convolutional Visual Prompt for Robust Visual Perception
Neural Information Processing Systems (NeurIPS), 2023
Yun-Yun Tsai
Chengzhi Mao
Junfeng Yang
VLM
VPVLM
341
20
0
01 Mar 2023
Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning
International Conference on Learning Representations (ICLR), 2023
Ivona Najdenkoska
Xiantong Zhen
Marcel Worring
VLM
192
30
0
28 Feb 2023
Benchmarking Deepart Detection
Yabin Wang
Zhiwu Huang
Xiaopeng Hong
194
14
0
28 Feb 2023
Adapter Incremental Continual Learning of Efficient Audio Spectrogram Transformers
Interspeech (Interspeech), 2023
Nithish Muthuchamy Selvaraj
Xiaobao Guo
A. Kong
Bingquan Shen
Alex C. Kot
CLL
148
12
0
28 Feb 2023
Boosting Adversarial Transferability using Dynamic Cues
International Conference on Learning Representations (ICLR), 2023
Muzammal Naseer
Ahmad A Mahmood
Salman Khan
Fahad Shahbaz Khan
AAML
178
6
0
23 Feb 2023
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
134
3
0
22 Feb 2023
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Machine Intelligence Research (MIR), 2023
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CE
VLM
467
272
0
20 Feb 2023
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shirsha Bose
Ankit Jha
Enrico Fini
Mainak Singha
Elisa Ricci
Biplab Banerjee
VLM
286
45
0
18 Feb 2023
LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation
Jiaxin Cheng
Xiao Liang
Xingjian Shi
Tong He
Tianjun Xiao
Mu Li
DiffM
177
85
0
16 Feb 2023
Towards Efficient Visual Adaption via Structural Re-parameterization
Gen Luo
Minglang Huang
Weihao Ye
Xiaoshuai Sun
Guannan Jiang
Zhiyu Wang
Rongrong Ji
VLM
VPVLM
324
103
0
16 Feb 2023
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Minsu Kim
Hyungil Kim
Y. Ro
VLM
227
30
0
16 Feb 2023
À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable Prompting
Computer Vision and Pattern Recognition (CVPR), 2023
Benjamin Bowman
Alessandro Achille
Luca Zancato
Matthew Trager
Pramuditha Perera
Giovanni Paolini
Stefano Soatto
VPVLM
173
19
0
15 Feb 2023
Generalized Few-Shot Continual Learning with Contrastive Mixture of Adapters
Yawen Cui
Zitong Yu
Rizhao Cai
Xuna Wang
Alex C. Kot
Tianpeng Liu
CLL
206
8
0
12 Feb 2023
Cross-Modal Fine-Tuning: Align then Refine
International Conference on Machine Learning (ICML), 2023
Junhong Shen
Liam Li
Lucio Dery
Corey Staten
M. Khodak
Graham Neubig
Ameet Talwalkar
230
58
0
11 Feb 2023
Flexible-modal Deception Detection with Audio-Visual Adapter
Zhaoxu Li
Zitong Yu
Nithish Muthuchamy Selvaraj
Xiaobao Guo
Bingquan Shen
A. Kong
Alex C. Kot
146
9
0
11 Feb 2023
AIM: Adapting Image Models for Efficient Video Action Recognition
International Conference on Learning Representations (ICLR), 2023
Taojiannan Yang
Yi Zhu
Yusheng Xie
Aston Zhang
Chong Chen
Mu Li
ViT
412
220
0
06 Feb 2023
CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets
International Conference on Machine Learning (ICML), 2023
Cheng-i Wang
Julian McAuley
Zachary Chase Lipton
Saurabh Garg
VLM
382
114
0
06 Feb 2023
On the Efficacy of Differentially Private Few-shot Image Classification
Marlon Tobaben
Aliaksandra Shysheya
J. Bronskill
Andrew Paverd
Shruti Tople
Santiago Zanella Béguelin
Richard Turner
Antti Honkela
412
16
0
02 Feb 2023
Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt
Computer Vision and Pattern Recognition (CVPR), 2023
Hao Li
Dingwen Zhang
Nian Liu
Lechao Cheng
Yalun Dai
Chaoxi Zhang
Xinggang Wang
Junwei Han
149
21
0
02 Feb 2023
A Survey on Efficient Training of Transformers
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
399
73
0
02 Feb 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
237
137
0
30 Jan 2023
ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLM
OT
CLIP
248
24
0
28 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Computer Vision and Pattern Recognition (CVPR), 2023
Shaofei Cai
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
243
46
0
21 Jan 2023
Vision Learners Meet Web Image-Text Pairs
Bingchen Zhao
Quan Cui
Hao Wu
Osamu Yoshie
Cheng Yang
Oisin Mac Aodha
VLM
183
6
0
17 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Computer Vision and Pattern Recognition (CVPR), 2023
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
456
152
0
16 Jan 2023
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning
Zhenfang Chen
Qinhong Zhou
Songlin Yang
Yining Hong
Hao Zhang
Chuang Gan
LRM
VLM
269
54
0
12 Jan 2023
Exploring Efficient Few-shot Adaptation for Vision Transformers
C. Xu
Siqian Yang
Yabiao Wang
Zhanxiong Wang
Yanwei Fu
Xiangyang Xue
192
23
0
06 Jan 2023
Unleashing the Power of Visual Prompting At the Pixel Level
Junyang Wu
Xianhang Li
Chen Wei
Huiyu Wang
Alan Yuille
Yuyin Zhou
Cihang Xie
VPVLM
VLM
242
48
0
20 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
International Conference on Learning Representations (ICLR), 2022
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT
3DPC
299
137
0
16 Dec 2022
Understanding Zero-Shot Adversarial Robustness for Large-Scale Models
International Conference on Learning Representations (ICLR), 2022
Chengzhi Mao
Scott Geng
Junfeng Yang
Xin Eric Wang
Carl Vondrick
VLM
280
109
0
14 Dec 2022
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Computer Vision and Pattern Recognition (CVPR), 2022
Chengzhi Mao
Revant Teotia
Amrutha Sundar
Sachit Menon
Junfeng Yang
Xin Eric Wang
Carl Vondrick
262
34
0
12 Dec 2022
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery
Computer Vision and Pattern Recognition (CVPR), 2022
Shengxiang Zhang
Salman Khan
Zhiqiang Shen
Muzammal Naseer
Guangyi Chen
Fahad Shahbaz Khan
CLL
VLM
239
106
0
11 Dec 2022
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Roei Herzig
Ofir Abramovich
Elad Ben-Avraham
Assaf Arbelle
Leonid Karlinsky
Ariel Shamir
Trevor Darrell
Amir Globerson
368
20
0
08 Dec 2022
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval
Computer Vision and Image Understanding (CVIU), 2022
Mustafa Shukor
Nicolas Thome
Matthieu Cord
CLIP
CoGe
272
15
0
08 Dec 2022
Learning Domain Invariant Prompt for Vision-Language Models
IEEE Transactions on Image Processing (IEEE TIP), 2022
Cairong Zhao
Yubin Wang
Xinyang Jiang
Yifei Shen
Kaitao Song
Dongsheng Li
Duoqian Miao
VLM
VPVLM
337
49
0
08 Dec 2022
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time Adaptation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Yulu Gan
Yan Bai
Yihang Lou
Xianzheng Ma
Renrui Zhang
Nian Shi
Lin Luo
OOD
VLM
333
130
0
08 Dec 2022
Previous
1
2
3
...
23
24
25
26
Next