ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.12119
  4. Cited By
Visual Prompt Tuning
v1v2 (latest)

Visual Prompt Tuning

European Conference on Computer Vision (ECCV), 2022
23 March 2022
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
    VLMVPVLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Visual Prompt Tuning"

50 / 1,271 papers shown
SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency
SelfPromer: Self-Prompt Dehazing Transformers with Depth-ConsistencyAAAI Conference on Artificial Intelligence (AAAI), 2023
Cong Wang
Jin-shan Pan
Wanyu Lin
Jiangxin Dong
Xiaomei Wu
VLMMDE
332
53
0
13 Mar 2023
ViM: Vision Middleware for Unified Downstream Transferring
ViM: Vision Middleware for Unified Downstream TransferringIEEE International Conference on Computer Vision (ICCV), 2023
Yutong Feng
Biao Gong
Jianwen Jiang
Yiliang Lv
Yujun Shen
Deli Zhao
Jingren Zhou
233
2
0
13 Mar 2023
Gradient-Regulated Meta-Prompt Learning for Generalizable
  Vision-Language Models
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Juncheng Li
Minghe Gao
Longhui Wei
Siliang Tang
Wenqiao Zhang
Meng Li
Wei Ji
Qi Tian
Tat-Seng Chua
Yueting Zhuang
VLMVPVLM
256
31
0
12 Mar 2023
From Visual Prompt Learning to Zero-Shot Transfer: Mapping Is All You
  Need
From Visual Prompt Learning to Zero-Shot Transfer: Mapping Is All You Need
Ziqing Yang
Zeyang Sha
Michael Backes
Yang Zhang
VPVLMVLM
179
4
0
09 Mar 2023
Rethinking Visual Prompt Learning as Masked Visual Token Modeling
Rethinking Visual Prompt Learning as Masked Visual Token ModelingArtificial Intelligence (AIJ), 2023
Ning Liao
Bowen Shi
Xiaopeng Zhang
Min Cao
Junchi Yan
Qi Tian
VLM
278
8
0
09 Mar 2023
Your representations are in the network: composable and parallel
  adaptation for large scale models
Your representations are in the network: composable and parallel adaptation for large scale modelsNeural Information Processing Systems (NeurIPS), 2023
Yonatan Dukler
Alessandro Achille
Hao Yang
Varsha Vivek
Luca Zancato
Benjamin Bowman
Avinash Ravichandran
Charless C. Fowlkes
A. Swaminathan
Stefano Soatto
297
3
0
07 Mar 2023
Multimodal Prompting with Missing Modalities for Visual Recognition
Multimodal Prompting with Missing Modalities for Visual RecognitionComputer Vision and Pattern Recognition (CVPR), 2023
Yi-Lun Lee
Yi-Hsuan Tsai
Wei-Chen Chiu
Chen-Yu Lee
VPVLM
267
148
0
06 Mar 2023
Dynamic Prompting: A Unified Framework for Prompt Tuning
Dynamic Prompting: A Unified Framework for Prompt Tuning
Xianjun Yang
Wei Cheng
Xujiang Zhao
Wenchao Yu
Linda R. Petzold
Haifeng Chen
VLM
316
20
0
06 Mar 2023
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion
  Tasks
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion TasksComputer Vision and Pattern Recognition (CVPR), 2023
Xiaoping Han
Xiatian Zhu
Licheng Yu
Li Zhang
Yi-Zhe Song
Tao Xiang
VLM
179
63
0
04 Mar 2023
Decision Transformer under Random Frame Dropping
Decision Transformer under Random Frame DroppingInternational Conference on Learning Representations (ICLR), 2023
Kaizhe Hu
Rachel Zheng
Yang Gao
Huazhe Xu
OffRL
237
16
0
03 Mar 2023
Visual Exemplar Driven Task-Prompting for Unified Perception in
  Autonomous Driving
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2023
Xiwen Liang
Minzhe Niu
Jianhua Han
Hang Xu
Chunjing Xu
Xiaodan Liang
VLM
244
20
0
03 Mar 2023
Learning to Grow Pretrained Models for Efficient Transformer Training
Learning to Grow Pretrained Models for Efficient Transformer TrainingInternational Conference on Learning Representations (ICLR), 2023
Peihao Wang
Yikang Shen
Lucas Torroba Hennigen
P. Greengard
Leonid Karlinsky
Rogerio Feris
David D. Cox
Zinan Lin
Yoon Kim
199
70
0
02 Mar 2023
Enhancing General Face Forgery Detection via Vision Transformer with
  Low-Rank Adaptation
Enhancing General Face Forgery Detection via Vision Transformer with Low-Rank AdaptationConference on Multimedia Information Processing and Retrieval (MIPR), 2023
Chen Kong
Haoliang Li
Shiqi Wang
ViTCVBM
201
21
0
02 Mar 2023
Rethinking Efficient Tuning Methods from a Unified Perspective
Rethinking Efficient Tuning Methods from a Unified Perspective
Zeyinzi Jiang
Chaojie Mao
Ziyuan Huang
Yiliang Lv
Deli Zhao
Jingren Zhou
231
15
0
01 Mar 2023
Convolutional Visual Prompt for Robust Visual Perception
Convolutional Visual Prompt for Robust Visual PerceptionNeural Information Processing Systems (NeurIPS), 2023
Yun-Yun Tsai
Chengzhi Mao
Junfeng Yang
VLMVPVLM
341
20
0
01 Mar 2023
Meta Learning to Bridge Vision and Language Models for Multimodal
  Few-Shot Learning
Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot LearningInternational Conference on Learning Representations (ICLR), 2023
Ivona Najdenkoska
Xiantong Zhen
Marcel Worring
VLM
192
30
0
28 Feb 2023
Benchmarking Deepart Detection
Benchmarking Deepart Detection
Yabin Wang
Zhiwu Huang
Xiaopeng Hong
194
14
0
28 Feb 2023
Adapter Incremental Continual Learning of Efficient Audio Spectrogram
  Transformers
Adapter Incremental Continual Learning of Efficient Audio Spectrogram TransformersInterspeech (Interspeech), 2023
Nithish Muthuchamy Selvaraj
Xiaobao Guo
A. Kong
Bingquan Shen
Alex C. Kot
CLL
148
12
0
28 Feb 2023
Boosting Adversarial Transferability using Dynamic Cues
Boosting Adversarial Transferability using Dynamic CuesInternational Conference on Learning Representations (ICLR), 2023
Muzammal Naseer
Ahmad A Mahmood
Salman Khan
Fahad Shahbaz Khan
AAML
178
6
0
23 Feb 2023
Entity-Level Text-Guided Image Manipulation
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
134
3
0
22 Feb 2023
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Large-scale Multi-Modal Pre-trained Models: A Comprehensive SurveyMachine Intelligence Research (MIR), 2023
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CEVLM
467
272
0
20 Feb 2023
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based
  Domain Generalization
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain GeneralizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shirsha Bose
Ankit Jha
Enrico Fini
Mainak Singha
Elisa Ricci
Biplab Banerjee
VLM
286
45
0
18 Feb 2023
LayoutDiffuse: Adapting Foundational Diffusion Models for
  Layout-to-Image Generation
LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation
Jiaxin Cheng
Xiao Liang
Xingjian Shi
Tong He
Tianjun Xiao
Mu Li
DiffM
177
85
0
16 Feb 2023
Towards Efficient Visual Adaption via Structural Re-parameterization
Towards Efficient Visual Adaption via Structural Re-parameterization
Gen Luo
Minglang Huang
Weihao Ye
Xiaoshuai Sun
Guannan Jiang
Zhiyu Wang
Rongrong Ji
VLMVPVLM
324
103
0
16 Feb 2023
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech
  Recognition
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Minsu Kim
Hyungil Kim
Y. Ro
VLM
227
30
0
16 Feb 2023
À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable
  Prompting
À-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable PromptingComputer Vision and Pattern Recognition (CVPR), 2023
Benjamin Bowman
Alessandro Achille
Luca Zancato
Matthew Trager
Pramuditha Perera
Giovanni Paolini
Stefano Soatto
VPVLM
173
19
0
15 Feb 2023
Generalized Few-Shot Continual Learning with Contrastive Mixture of
  Adapters
Generalized Few-Shot Continual Learning with Contrastive Mixture of Adapters
Yawen Cui
Zitong Yu
Rizhao Cai
Xuna Wang
Alex C. Kot
Tianpeng Liu
CLL
206
8
0
12 Feb 2023
Cross-Modal Fine-Tuning: Align then Refine
Cross-Modal Fine-Tuning: Align then RefineInternational Conference on Machine Learning (ICML), 2023
Junhong Shen
Liam Li
Lucio Dery
Corey Staten
M. Khodak
Graham Neubig
Ameet Talwalkar
230
58
0
11 Feb 2023
Flexible-modal Deception Detection with Audio-Visual Adapter
Flexible-modal Deception Detection with Audio-Visual Adapter
Zhaoxu Li
Zitong Yu
Nithish Muthuchamy Selvaraj
Xiaobao Guo
Bingquan Shen
A. Kong
Alex C. Kot
146
9
0
11 Feb 2023
AIM: Adapting Image Models for Efficient Video Action Recognition
AIM: Adapting Image Models for Efficient Video Action RecognitionInternational Conference on Learning Representations (ICLR), 2023
Taojiannan Yang
Yi Zhu
Yusheng Xie
Aston Zhang
Chong Chen
Mu Li
ViT
412
220
0
06 Feb 2023
CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets
CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsInternational Conference on Machine Learning (ICML), 2023
Cheng-i Wang
Julian McAuley
Zachary Chase Lipton
Saurabh Garg
VLM
382
114
0
06 Feb 2023
On the Efficacy of Differentially Private Few-shot Image Classification
On the Efficacy of Differentially Private Few-shot Image Classification
Marlon Tobaben
Aliaksandra Shysheya
J. Bronskill
Andrew Paverd
Shruti Tople
Santiago Zanella Béguelin
Richard Turner
Antti Honkela
412
16
0
02 Feb 2023
Boosting Low-Data Instance Segmentation by Unsupervised Pre-training
  with Saliency Prompt
Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency PromptComputer Vision and Pattern Recognition (CVPR), 2023
Hao Li
Dingwen Zhang
Nian Liu
Lechao Cheng
Yalun Dai
Chaoxi Zhang
Xinggang Wang
Junwei Han
149
21
0
02 Feb 2023
A Survey on Efficient Training of Transformers
A Survey on Efficient Training of TransformersInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
399
73
0
02 Feb 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
GALIP: Generative Adversarial CLIPs for Text-to-Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffMVLM
237
137
0
30 Jan 2023
ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts
ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts
Kwanyoung Kim
Y. Oh
Jong Chul Ye
VLMOTCLIP
248
24
0
28 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning
  and Adaptive Horizon Prediction
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon PredictionComputer Vision and Pattern Recognition (CVPR), 2023
Shaofei Cai
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
243
46
0
21 Jan 2023
Vision Learners Meet Web Image-Text Pairs
Vision Learners Meet Web Image-Text Pairs
Bingchen Zhao
Quan Cui
Hao Wu
Osamu Yoshie
Cheng Yang
Oisin Mac Aodha
VLM
183
6
0
17 Jan 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
456
152
0
16 Jan 2023
See, Think, Confirm: Interactive Prompting Between Vision and Language
  Models for Knowledge-based Visual Reasoning
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning
Zhenfang Chen
Qinhong Zhou
Songlin Yang
Yining Hong
Hao Zhang
Chuang Gan
LRMVLM
269
54
0
12 Jan 2023
Exploring Efficient Few-shot Adaptation for Vision Transformers
Exploring Efficient Few-shot Adaptation for Vision Transformers
C. Xu
Siqian Yang
Yabiao Wang
Zhanxiong Wang
Yanwei Fu
Xiangyang Xue
192
23
0
06 Jan 2023
Unleashing the Power of Visual Prompting At the Pixel Level
Unleashing the Power of Visual Prompting At the Pixel Level
Junyang Wu
Xianhang Li
Chen Wei
Huiyu Wang
Alan Yuille
Yuyin Zhou
Cihang Xie
VPVLMVLM
242
48
0
20 Dec 2022
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image
  Transformers Help 3D Representation Learning?
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?International Conference on Learning Representations (ICLR), 2022
Runpei Dong
Zekun Qi
Linfeng Zhang
Junbo Zhang
Jian‐Yuan Sun
Zheng Ge
Li Yi
Kaisheng Ma
ViT3DPC
299
137
0
16 Dec 2022
Understanding Zero-Shot Adversarial Robustness for Large-Scale Models
Understanding Zero-Shot Adversarial Robustness for Large-Scale ModelsInternational Conference on Learning Representations (ICLR), 2022
Chengzhi Mao
Scott Geng
Junfeng Yang
Xin Eric Wang
Carl Vondrick
VLM
280
109
0
14 Dec 2022
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
Doubly Right Object Recognition: A Why Prompt for Visual RationalesComputer Vision and Pattern Recognition (CVPR), 2022
Chengzhi Mao
Revant Teotia
Amrutha Sundar
Sachit Menon
Junfeng Yang
Xin Eric Wang
Carl Vondrick
262
34
0
12 Dec 2022
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for
  Generalized Novel Category Discovery
PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category DiscoveryComputer Vision and Pattern Recognition (CVPR), 2022
Shengxiang Zhang
Salman Khan
Zhiqiang Shen
Muzammal Naseer
Guangyi Chen
Fahad Shahbaz Khan
CLLVLM
239
106
0
11 Dec 2022
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers
  using Synthetic Scene Data
PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene DataIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Roei Herzig
Ofir Abramovich
Elad Ben-Avraham
Assaf Arbelle
Leonid Karlinsky
Ariel Shamir
Trevor Darrell
Amir Globerson
368
20
0
08 Dec 2022
Vision and Structured-Language Pretraining for Cross-Modal Food
  Retrieval
Vision and Structured-Language Pretraining for Cross-Modal Food RetrievalComputer Vision and Image Understanding (CVIU), 2022
Mustafa Shukor
Nicolas Thome
Matthieu Cord
CLIPCoGe
272
15
0
08 Dec 2022
Learning Domain Invariant Prompt for Vision-Language Models
Learning Domain Invariant Prompt for Vision-Language ModelsIEEE Transactions on Image Processing (IEEE TIP), 2022
Cairong Zhao
Yubin Wang
Xinyang Jiang
Yifei Shen
Kaitao Song
Dongsheng Li
Duoqian Miao
VLMVPVLM
337
49
0
08 Dec 2022
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time
  Adaptation
Decorate the Newcomers: Visual Domain Prompt for Continual Test Time AdaptationAAAI Conference on Artificial Intelligence (AAAI), 2022
Yulu Gan
Yan Bai
Yihang Lou
Xianzheng Ma
Renrui Zhang
Nian Shi
Lin Luo
OODVLM
333
130
0
08 Dec 2022
Previous
123...23242526
Next