Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Neural Information Processing Systems (NeurIPS), 2022

15 September 2022

De-An Huang

Papers citing "Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models"

50 / 257 papers shown

CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive LossNeural Information Processing Systems (NeurIPS), 2023

R. S. Srinivasa

Jaejin Cho

Chouchang Yang

Yashas Malur Saidutta

202

26 Sep 2023

GraphAdapter: Tuning Vision-Language Models With Dual Knowledge GraphNeural Information Processing Systems (NeurIPS), 2023

Xin Li

274

102

24 Sep 2023

Improving CLIP Robustness with Knowledge Distillation and Self-Training

207

19 Sep 2023

SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient ChannelsInternational Journal of Computer Vision (IJCV), 2023

493

15 Sep 2023

Prompting Segmentation with Sound Is Generalizable Audio-Visual Source LocalizerAAAI Conference on Artificial Intelligence (AAAI), 2023

Xi Li

306

13 Sep 2023

Language Models as Black-Box Optimizers for Vision-Language ModelsComputer Vision and Pattern Recognition (CVPR), 2023

405

12 Sep 2023

BDC-Adapter: Brownian Distance Covariance for Better Vision-Language ReasoningBritish Machine Vision Conference (BMVC), 2023

272

03 Sep 2023

Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization

327

30 Aug 2023

Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval

228

29 Aug 2023

Fine-tuning can cripple your foundation model; preserving features may be the solution

385

25 Aug 2023

Unsupervised Prototype Adapter for Vision-Language ModelsChinese Conference on Pattern Recognition and Computer Vision (CPRCV), 2023

285

22 Aug 2023

DPL: Decoupled Prompt Learning for Vision-Language Models

Yuhan Zhu

Gangshan Wu

119

19 Aug 2023

A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision

Julio Silva-Rodríguez

Jose Dolz

396

15 Aug 2023

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt TuningIEEE International Conference on Computer Vision (ICCV), 2023

Salman Khan

279

150

11 Aug 2023

Improving Generalization of Image Captioning with Unsupervised Prompt Learning

Hongchen Wei

Zhenzhong Chen

VLM

190

05 Aug 2023

PerceptionCLIP: Visual Classification by Inferring and Conditioning on ContextsInternational Conference on Learning Representations (ICLR), 2023

Bang An

Sicheng Zhu

Michael-Andrei Panaitescu-Liess

Chaithanya Kumar Mummadi

Furong Huang

VLM

214

02 Aug 2023

Cross-Modal Concept Learning and Inference for Vision-Language ModelsNeurocomputing (Neurocomputing), 2023

196

28 Jul 2023

Why Is Prompt Tuning for Vision-Language Models Robust to Noisy Labels?IEEE International Conference on Computer Vision (ICCV), 2023

Heng Wang

Linjie Yang

133

22 Jul 2023

A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Chaoyang Zhu

Long Chen

ObjD VLM

511

18 Jul 2023

Self-regulating Prompts: Foundational Model Adaptation without ForgettingIEEE International Conference on Computer Vision (ICCV), 2023

Muhammad Uzair Khattak

Salman Khan

388

309

13 Jul 2023

Neural Priming for Sample-Efficient AdaptationNeural Information Processing Systems (NeurIPS), 2023

486

16 Jun 2023

What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and LocationsIEEE International Conference on Computer Vision (ICCV), 2023

Dima Damen

369

14 Jun 2023

Waffling around for Performance: Visual Classification with Random Words and Broad ConceptsIEEE International Conference on Computer Vision (ICCV), 2023

A. Sophia Koepke

252

111

12 Jun 2023

How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?International Journal of Computer Vision (IJCV), 2023

Yifei Ming

Shouqing Yang

OODD VLM

295

09 Jun 2023

Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt TuningNeural Information Processing Systems (NeurIPS), 2023

423

02 Jun 2023

Vocabulary-free Image ClassificationNeural Information Processing Systems (NeurIPS), 2023

465

01 Jun 2023

LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image CollectionsNeural Information Processing Systems (NeurIPS), 2023

349

29 May 2023

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2023

317

29 May 2023

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D ClassificationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

347

25 May 2023

Continual Vision-Language Representation Learning with Off-Diagonal InformationInternational Conference on Machine Learning (ICML), 2023

376

11 May 2023

Visual TuningACM Computing Surveys (ACM Comput. Surv.), 2023

...

438

10 May 2023

Progressive Visual Prompt Learning with Contrastive Feature Re-formationInternational Journal of Computer Vision (IJCV), 2023

Yuhan Zhu

297

17 Apr 2023

APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP

Mainak Singha

Ankit Jha

Bhupendra S. Solanki

Shirsha Bose

Biplab Banerjee

VLM

171

12 Apr 2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual RecognitionNeural Information Processing Systems (NeurIPS), 2023

237

10 Apr 2023

Defense-Prefix for Preventing Typographic Attacks on CLIP

Hiroki Azuma

Yusuke Matsui

VLM AAML

289

10 Apr 2023

Black Box Few-Shot Adaptation for Vision-Language modelsIEEE International Conference on Computer Vision (ICCV), 2023

Yassine Ouali

Adrian Bulat

Brais Martínez

Georgios Tzimiropoulos

VLM

249

04 Apr 2023

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior RefinementIEEE International Conference on Computer Vision (ICCV), 2023

250

107

03 Apr 2023

Vision-Language Models for Vision Tasks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

501

1,044

03 Apr 2023

A Comprehensive Survey on Test-Time Adaptation under Distribution ShiftsInternational Journal of Computer Vision (IJCV), 2023

Jian Liang

Ran He

Tien-Ping Tan

OOD VLM TTA

318

389

27 Mar 2023

Robust Test-Time Adaptation in Dynamic ScenariosComputer Vision and Pattern Recognition (CVPR), 2023

334

187

24 Mar 2023

Challenges and Practices of Deep Learning Model Reengineering: A Case Study on Computer VisionEmpirical Software Engineering (EMSE), 2023

George K. Thiruvathukal

James C. Davis

VLM

186

13 Mar 2023

Dynamic Prompting: A Unified Framework for Prompt Tuning

322

06 Mar 2023

Temporal Coherent Test-Time Optimization for Robust Video ClassificationInternational Conference on Learning Representations (ICLR), 2023

215

28 Feb 2023

Test-Time Distribution Normalization for Contrastively Learned Vision-language ModelsNeural Information Processing Systems (NeurIPS), 2023

Ser-Nam Lim

245

22 Feb 2023

StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain GeneralizationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Ankit Jha

Mainak Singha

Biplab Banerjee

298

18 Feb 2023

A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image ModelsInternational Conference on Machine Learning (ICML), 2023

J. Allingham

Jie Jessie Ren

Michael W. Dusenberry

Balaji Lakshminarayanan

LLMAG VLM

278

13 Feb 2023

CLIPood: Generalizing CLIP to Out-of-DistributionsInternational Conference on Machine Learning (ICML), 2023

Ximei Wang

379

109

02 Feb 2023

Understanding Zero-Shot Adversarial Robustness for Large-Scale ModelsInternational Conference on Learning Representations (ICLR), 2022

Scott Geng

Carl Vondrick

283

112

14 Dec 2022

Improving Zero-shot Generalization and Robustness of Multi-modal ModelsComputer Vision and Pattern Recognition (CVPR), 2022

Balaji Lakshminarayanan

Jiaping Zhao

VLM

324

04 Dec 2022

SuS-X: Training-Free Name-Only Transfer of Vision-Language ModelsIEEE International Conference on Computer Vision (ICCV), 2022

460

143

28 Nov 2022