Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2209.07511
Cited By
Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models
Neural Information Processing Systems (NeurIPS), 2022
15 September 2022
Manli Shu
Weili Nie
De-An Huang
Zhiding Yu
Tom Goldstein
Anima Anandkumar
Chaowei Xiao
VLM
VPVLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models"
50 / 257 papers shown
CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
Neural Information Processing Systems (NeurIPS), 2023
R. S. Srinivasa
Jaejin Cho
Chouchang Yang
Yashas Malur Saidutta
Ching Hua Lee
Yilin Shen
Hongxia Jin
VLM
202
17
0
26 Sep 2023
GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph
Neural Information Processing Systems (NeurIPS), 2023
Xin Li
Dongze Lian
Zhihe Lu
Jiawang Bai
Zhibo Chen
Xinchao Wang
VLM
274
102
0
24 Sep 2023
Improving CLIP Robustness with Knowledge Distillation and Self-Training
Clement Laroudie
Andrei Bursuc
Mai Lan Ha
Gianni Franchi
VLM
207
6
0
19 Sep 2023
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
International Journal of Computer Vision (IJCV), 2023
Henry Hengyuan Zhao
Pichao Wang
Yuyang Zhao
Hao Luo
F. Wang
Mike Zheng Shou
ViT
493
21
0
15 Sep 2023
Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yaoting Wang
Weisong Liu
Guangyao Li
Jian Ding
Di Hu
Xi Li
VLM
306
38
0
13 Sep 2023
Language Models as Black-Box Optimizers for Vision-Language Models
Computer Vision and Pattern Recognition (CVPR), 2023
Shihong Liu
Zhiqiu Lin
Samuel Yu
Ryan Lee
Tiffany Ling
Deepak Pathak
Deva Ramanan
VLM
405
41
0
12 Sep 2023
BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
British Machine Vision Conference (BMVC), 2023
Yi Zhang
Ce Zhang
Zihan Liao
Yushun Tang
Zhihai He
BDL
VLM
272
11
0
03 Sep 2023
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
Hanqiu Deng
Zhaoxiang Zhang
Jinan Bao
Xingyu Li
VLM
327
10
0
30 Aug 2023
Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval
Seong-Hoon Eom
Namgyu Ho
Jaehoon Oh
Se-Young Yun
CLIP
VLM
228
2
0
29 Aug 2023
Fine-tuning can cripple your foundation model; preserving features may be the solution
Jishnu Mukhoti
Y. Gal
Juil Sock
P. Dokania
CLL
385
71
0
25 Aug 2023
Unsupervised Prototype Adapter for Vision-Language Models
Chinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
285
8
0
22 Aug 2023
DPL: Decoupled Prompt Learning for Vision-Language Models
C. Xu
Yuhan Zhu
Guozhen Zhang
Haocheng Shen
Yixuan Liao
Xiaoxin Chen
Gangshan Wu
Limin Wang
VLM
119
5
0
19 Aug 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
396
82
0
15 Aug 2023
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
IEEE International Conference on Computer Vision (ICCV), 2023
Chun-Mei Feng
Kai Yu
Yong Liu
Salman Khan
W. Zuo
VLM
279
150
0
11 Aug 2023
Improving Generalization of Image Captioning with Unsupervised Prompt Learning
Hongchen Wei
Zhenzhong Chen
VLM
190
4
0
05 Aug 2023
PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts
International Conference on Learning Representations (ICLR), 2023
Bang An
Sicheng Zhu
Michael-Andrei Panaitescu-Liess
Chaithanya Kumar Mummadi
Furong Huang
VLM
214
14
0
02 Aug 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Neurocomputing (Neurocomputing), 2023
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
196
20
0
28 Jul 2023
Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?
IEEE International Conference on Computer Vision (ICCV), 2023
Cheng-En Wu
Yu Tian
Haichao Yu
Heng Wang
Pedro Morgado
Yu Hen Hu
Linjie Yang
NoLa
VPVLM
VLM
133
26
0
22 Jul 2023
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Chaoyang Zhu
Long Chen
ObjD
VLM
511
72
0
18 Jul 2023
Self-regulating Prompts: Foundational Model Adaptation without Forgetting
IEEE International Conference on Computer Vision (ICCV), 2023
Muhammad Uzair Khattak
Syed Talal Wasim
Muzammal Naseer
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
VLM
388
309
0
13 Jul 2023
Neural Priming for Sample-Efficient Adaptation
Neural Information Processing Systems (NeurIPS), 2023
Matthew Wallingford
Vivek Ramanujan
Alex Fang
Aditya Kusupati
Roozbeh Mottaghi
Aniruddha Kembhavi
Ludwig Schmidt
Ali Farhadi
VLM
486
19
0
16 Jun 2023
What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and Locations
IEEE International Conference on Computer Vision (ICCV), 2023
Chiara Plizzari
Toby Perrett
Barbara Caputo
Dima Damen
EgoV
369
20
0
14 Jun 2023
Waffling around for Performance: Visual Classification with Random Words and Broad Concepts
IEEE International Conference on Computer Vision (ICCV), 2023
Karsten Roth
Jae Myung Kim
A. Sophia Koepke
Oriol Vinyals
Cordelia Schmid
Zeynep Akata
VLM
252
111
0
12 Jun 2023
How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?
International Journal of Computer Vision (IJCV), 2023
Yifei Ming
Shouqing Yang
OODD
VLM
295
54
0
09 Jun 2023
Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt Tuning
Neural Information Processing Systems (NeurIPS), 2023
Cristina Menghini
Andrew T. Delworth
Stephen H. Bach
VLM
423
33
0
02 Jun 2023
Vocabulary-free Image Classification
Neural Information Processing Systems (NeurIPS), 2023
Alessandro Conti
Enrico Fini
Goran Frehse
Paolo Rota
Yiming Wang
Elisa Ricci
VLM
465
34
0
01 Jun 2023
LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections
Neural Information Processing Systems (NeurIPS), 2023
M. Jehanzeb Mirza
Leonid Karlinsky
Wei Lin
Mateusz Koziñski
Horst Possegger
Rogerio Feris
Horst Bischof
VLM
349
48
0
29 May 2023
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models
International Conference on Learning Representations (ICLR), 2023
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
VLM
317
39
0
29 May 2023
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Sitian Shen
Zilin Zhu
Linqian Fan
Harry Zhang
Xinxiao Wu
DiffM
347
37
0
25 May 2023
Continual Vision-Language Representation Learning with Off-Diagonal Information
International Conference on Machine Learning (ICML), 2023
Zixuan Ni
Longhui Wei
Siliang Tang
Yueting Zhuang
Qi Tian
VLM
CLL
376
35
0
11 May 2023
Visual Tuning
ACM Computing Surveys (ACM Comput. Surv.), 2023
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
438
60
0
10 May 2023
Progressive Visual Prompt Learning with Contrastive Feature Re-formation
International Journal of Computer Vision (IJCV), 2023
C. Xu
Yuhan Zhu
Haocheng Shen
Fengyuan Shi
Boheng Chen
Yixuan Liao
Xiaoxin Chen
Limin Wang
VLM
297
47
0
17 Apr 2023
APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP
Mainak Singha
Ankit Jha
Bhupendra S. Solanki
Shirsha Bose
Biplab Banerjee
VLM
171
43
0
12 Apr 2023
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition
Neural Information Processing Systems (NeurIPS), 2023
Shuhuai Ren
Aston Zhang
Yi Zhu
Shuai Zhang
Shuai Zheng
Mu Li
Alexander J. Smola
Xu Sun
VPVLM
VLM
237
41
0
10 Apr 2023
Defense-Prefix for Preventing Typographic Attacks on CLIP
Hiroki Azuma
Yusuke Matsui
VLM
AAML
289
25
0
10 Apr 2023
Black Box Few-Shot Adaptation for Vision-Language models
IEEE International Conference on Computer Vision (ICCV), 2023
Yassine Ouali
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
VLM
249
45
0
04 Apr 2023
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
IEEE International Conference on Computer Vision (ICCV), 2023
Xiang-yu Zhu
Renrui Zhang
Bowei He
A-Long Zhou
Dong Wang
Bingyan Zhao
Shiyang Feng
VLM
250
107
0
03 Apr 2023
Vision-Language Models for Vision Tasks: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Jingyi Zhang
Jiaxing Huang
Sheng Jin
Shijian Lu
VLM
501
1,044
0
03 Apr 2023
A Comprehensive Survey on Test-Time Adaptation under Distribution Shifts
International Journal of Computer Vision (IJCV), 2023
Jian Liang
Ran He
Tien-Ping Tan
OOD
VLM
TTA
318
389
0
27 Mar 2023
Robust Test-Time Adaptation in Dynamic Scenarios
Computer Vision and Pattern Recognition (CVPR), 2023
Longhui Yuan
Binhui Xie
Shuangliang Li
TTA
334
187
0
24 Mar 2023
Challenges and Practices of Deep Learning Model Reengineering: A Case Study on Computer Vision
Empirical Software Engineering (EMSE), 2023
Wenxin Jiang
Vishnu Banna
Naveen Vivek
Abhinav Goel
Nicholas Synovic
George K. Thiruvathukal
James C. Davis
VLM
186
28
0
13 Mar 2023
Dynamic Prompting: A Unified Framework for Prompt Tuning
Xianjun Yang
Wei Cheng
Xujiang Zhao
Wenchao Yu
Linda R. Petzold
Haifeng Chen
VLM
322
20
0
06 Mar 2023
Temporal Coherent Test-Time Optimization for Robust Video Classification
International Conference on Learning Representations (ICLR), 2023
Chenyu Yi
Siyuan Yang
Yufei Wang
Haoliang Li
Yap-Peng Tan
Alex C. Kot
TTA
215
16
0
28 Feb 2023
Test-Time Distribution Normalization for Contrastively Learned Vision-language Models
Neural Information Processing Systems (NeurIPS), 2023
Yi Zhou
Juntao Ren
Fengyu Li
Ramin Zabih
Ser-Nam Lim
VLM
245
21
0
22 Feb 2023
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shirsha Bose
Ankit Jha
Enrico Fini
Mainak Singha
Elisa Ricci
Biplab Banerjee
VLM
298
46
0
18 Feb 2023
A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models
International Conference on Machine Learning (ICML), 2023
J. Allingham
Jie Jessie Ren
Michael W. Dusenberry
Xiuye Gu
Huayu Chen
Dustin Tran
J. Liu
Balaji Lakshminarayanan
LLMAG
VLM
278
56
0
13 Feb 2023
CLIPood: Generalizing CLIP to Out-of-Distributions
International Conference on Machine Learning (ICML), 2023
Yang Shu
Xingzhuo Guo
Jialong Wu
Ximei Wang
Jianmin Wang
Mingsheng Long
OODD
VLM
379
109
0
02 Feb 2023
Understanding Zero-Shot Adversarial Robustness for Large-Scale Models
International Conference on Learning Representations (ICLR), 2022
Chengzhi Mao
Scott Geng
Junfeng Yang
Xin Eric Wang
Carl Vondrick
VLM
283
112
0
14 Dec 2022
Improving Zero-shot Generalization and Robustness of Multi-modal Models
Computer Vision and Pattern Recognition (CVPR), 2022
Yunhao Ge
Jie Jessie Ren
Andrew Gallagher
Yuxiao Wang
Ming-Hsuan Yang
Hartwig Adam
Laurent Itti
Balaji Lakshminarayanan
Jiaping Zhao
VLM
324
55
0
04 Dec 2022
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
IEEE International Conference on Computer Vision (ICCV), 2022
Vishaal Udandarao
Ankush Gupta
Samuel Albanie
VLM
MLLM
460
143
0
28 Nov 2022
Previous
1
2
3
4
5
6
Next
Page 5 of 6