Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2209.03320
Cited By
v1
v2
v3 (latest)
What does a platypus look like? Generating customized prompts for zero-shot image classification
IEEE International Conference on Computer Vision (ICCV), 2022
7 September 2022
Sarah M Pratt
Ian Covert
Rosanne Liu
Ali Farhadi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (191★)
Papers citing
"What does a platypus look like? Generating customized prompts for zero-shot image classification"
50 / 194 papers shown
SeMoBridge: Semantic Modality Bridge for Efficient Few-Shot Adaptation of CLIP
Christoph Timmermann
Hyunse Lee
Woojin Lee
VLM
192
2
0
10 Apr 2026
AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
Zheng Li
Yibing Song
Xin Zhang
L. Luo
Xiang Li
J. Yang
213
0
0
26 Nov 2025
DiVE-k: Differential Visual Reasoning for Fine-grained Image Recognition
Raja Kumar
Arka Sadhu
Ram Nevatia
VLM
262
0
0
23 Nov 2025
Culture in Action: Evaluating Text-to-Image Models through Social Activities
Sina Malakouti
Boqing Gong
Adriana Kovashka
EGVM
VLM
424
1
0
07 Nov 2025
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts
Weihao Bo
Yanpeng Sun
Y. Wang
X. Zhang
Zechao Li
FedML
VLM
329
0
0
01 Nov 2025
Model Inversion with Layer-Specific Modeling and Alignment for Data-Free Continual Learning
Ruilin Tong
Haodong Lu
Yuhang Liu
Dong Gong
CLL
285
1
0
30 Oct 2025
Free-Grained Hierarchical Visual Recognition
Seulki Park
Zilin Wang
Stella X. Yu
NoLa
188
1
0
16 Oct 2025
Cluster-Aware Prompt Ensemble Learning for Few-Shot Vision-Language Model Adaptation
Pattern Recognition (Pattern Recogn.), 2025
Zhi Chen
Xin Yu
Xiaohui Tao
Yan Li
Zi Huang
VLM
236
12
0
10 Oct 2025
Conditional Representation Learning for Customized Tasks
Honglin Liu
Chao Sun
Peng Hu
Yunfan Li
Xi Peng
203
2
0
06 Oct 2025
microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
Sathira Silva
Eman Ali
Chetan Arora
Muhammad Haris Khan
VLM
189
0
0
02 Oct 2025
Hierarchical Representation Matching for CLIP-based Class-Incremental Learning
Zhen-Hao Wen
Yan Wang
Ji Feng
Han-Jia Ye
De-Chuan Zhan
Da-Wei Zhou
CLL
VLM
206
1
0
26 Sep 2025
No Labels Needed: Zero-Shot Image Classification with Collaborative Self-Learning
Matheus Vinícius Todescato
Joel Luís Carbonera
VLM
160
0
0
23 Sep 2025
3D Aware Region Prompted Vision Language Model
A. Cheng
Yang Fu
Yukang Chen
Zhijian Liu
X. Li
...
Jan Kautz
Pavlo Molchanov
Hongxu Yin
Xiaolong Wang
Sifei Liu
167
20
0
16 Sep 2025
Causality-guided Prompt Learning for Vision-language Models via Visual Granulation
Mengyu Gao
Qiulei Dong
VLM
338
2
0
04 Sep 2025
Domain Generalization in-the-Wild: Disentangling Classification from Domain-Aware Representations
Ha Min Son
Zhe Zhao
Shahbaz Rezaei
Xin Liu
323
0
0
29 Aug 2025
Constrained Prompt Enhancement for Improving Zero-Shot Generalization of Vision-Language Models
Xiaojie Yin
Qilong Wang
Q. Hu
VLM
222
1
0
24 Aug 2025
Adapting Vision-Language Models Without Labels: A Comprehensive Survey
Hao Dong
Lijun Sheng
Jian Liang
Ran He
Eleni Chatzi
Olga Fink
OffRL
VLM
258
5
0
07 Aug 2025
Decoupling Continual Semantic Segmentation
Yifu Guo
Y. Lu
Wentao Zhang
Zishan Xu
Dexia Chen
Siyu Zhang
Yizhe Zhang
Ruixuan Wang
CLL
280
3
0
07 Aug 2025
Open-Vocabulary HOI Detection with Interaction-aware Prompt and Concept Calibration
Ting Lei
Shaofeng Yin
Qingchao Chen
Yuxin Peng
Yang Liu
VLM
164
2
0
05 Aug 2025
Causal Disentanglement and Cross-Modal Alignment for Enhanced Few-Shot Learning
Tianjiao Jiang
Zhen Zhang
Y. Liu
J. Q. Shi
251
2
0
05 Aug 2025
Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models
Xinyu Chen
Haotian Zhai
Can Zhang
Xiupeng Shi
Ruirui Li
VLM
328
3
0
02 Aug 2025
Evading Data Provenance in Deep Neural Networks
Hongyu Zhu
Sichu Liang
Wenwen Wang
Zhuomeng Zhang
Fangqi Li
Shi-Lin Wang
AAML
321
4
0
01 Aug 2025
Vocabulary-free Fine-grained Visual Recognition via Enriched Contextually Grounded Vision-Language Model
Dmitry Demidov
Zaigham Zaheer
Omkar Thawakar
Salman Khan
Fahad Shahbaz Khan
VLM
147
1
0
30 Jul 2025
Beyond Class Tokens: LLM-guided Dominant Property Mining for Few-shot Classification
Wei Zhuo
Runjie Luo
Wufeng Xue
Linlin Shen
402
0
0
28 Jul 2025
Prototype-Guided Pseudo-Labeling with Neighborhood-Aware Consistency for Unsupervised Adaptation
Eman Ali
Chetan Arora
Muhammad Haris Khan
VLM
263
0
0
22 Jul 2025
Dynamic Multimodal Prototype Learning in Vision-Language Models
Xingyu Zhu
Shuo Wang
B. Zhu
Miaoge Li
Yunfan Li
Junfeng Fang
Zhicai Wang
Dongsheng Wang
Hanwang Zhang
VLM
298
12
0
04 Jul 2025
Enabling Validation for Robust Few-Shot Recognition
Hanxin Wang
Tian Liu
Shu Kong
VLM
590
1
0
05 Jun 2025
Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Computer Vision and Pattern Recognition (CVPR), 2025
Xianing Chen
Si Huo
Borui Jiang
Hailin Hu
Xinghao Chen
OOD
406
5
0
22 May 2025
From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection
Lincan Cai
Jingxuan Kang
Shuang Li
Wenxuan Ma
Binhui Xie
Zhida Qin
Jian Liang
VLM
397
4
0
19 May 2025
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models
Mainak Singha
Subhankar Roy
Sarthak Mehrotra
Ankit Jha
Moloud Abdar
Biplab Banerjee
Elisa Ricci
VLM
VPVLM
670
2
0
29 Apr 2025
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
Yasser Benigmim
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Raoul de Charette
VLM
526
3
0
14 Apr 2025
SCRAMBLe : Enhancing Multimodal LLM Compositionality with Synthetic Preference Data
Samarth Mishra
Kate Saenko
Venkatesh Saligrama
CoGe
LRM
382
0
0
07 Apr 2025
Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification
Shijian Wang
Linxin Song
Ryotaro Shimizu
M. Goto
Hanqian Wu
VLM
278
1
0
06 Apr 2025
Self-Evolving Visual Concept Library using Vision-Language Critics
Computer Vision and Pattern Recognition (CVPR), 2025
Atharva Sehgal
Patrick Yuan
Ziniu Hu
Yisong Yue
Jennifer J. Sun
Swarat Chaudhuri
VLM
286
2
0
31 Mar 2025
Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability
Computer Vision and Pattern Recognition (CVPR), 2025
Jianyang Zhang
Qianli Luo
Guowu Yang
Wenjing Yang
Weide Liu
Guosheng Lin
Fengmao Lv
348
1
0
26 Mar 2025
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Deepayan Das
Davide Talon
Yiming Wang
Goran Frehse
Elisa Ricci
VLM
LRM
611
5
0
24 Mar 2025
Mitigating Cache Noise in Test-Time Adaptation for Large Vision-Language Models
Haotian Zhai
Xinyu Chen
Can Zhang
Tianming Sha
Ruirui Li
BDL
VLM
525
3
0
24 Mar 2025
An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection
Louis Y. Kim
Michelle Karker
Victoria Valledor
Seiyoung C. Lee
Karl F. Brzoska
Margaret Duff
Anthony Palladino
VLM
ObjD
275
1
0
21 Mar 2025
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance
Hui Liu
Wenya Wang
Kecheng Chen
Jie Liu
Yibing Liu
Tiexin Qin
Peisong He
Xinghao Jiang
Haoliang Li
BDL
VLM
1.0K
0
0
20 Mar 2025
OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP
Computer Vision and Pattern Recognition (CVPR), 2025
M. Cui
Divyam Gupta
Mainak Singha
Sai Bhargav Rongali
Ankit Jha
Muhammad Haris Khan
Biplab Banerjee
VLM
430
6
0
20 Mar 2025
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification
Ans Munir
Faisal Z. Qureshi
M. H. Khan
Mohsen Ali
VLM
464
1
0
15 Mar 2025
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
IEEE journal of biomedical and health informatics (JBHI), 2025
M. Rahaman
Ewan K. A. Millar
Erik H. W. Meijering
VLM
359
4
0
13 Mar 2025
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
Xiangyan Qu
Jing Yu
Jiamin Zhuang
Gaopeng Gou
Gang Xiong
Qi Wu
VLM
462
2
0
10 Mar 2025
Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients
Niklas Penzel
Joachim Denzler
FAtt
438
0
0
07 Mar 2025
Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts
Tong Liang
Jim Davis
VLM
395
2
0
04 Mar 2025
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models
Seyed Mohamad Ali Tousi
Ramy M. A. Farag
Jacket Demby's
Gbenga Omotara
John A. Lory
Guilherme N. DeSouza
1.0K
3
0
03 Mar 2025
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
Computer Vision and Pattern Recognition (CVPR), 2025
Xiangyan Qu
Gaopeng Gou
Jiamin Zhuang
Jing Yu
Kun Song
Qihao Wang
Yili Li
Gang Xiong
VLM
811
15
0
27 Feb 2025
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models
Computer Vision and Pattern Recognition (CVPR), 2025
Kevin Miller
Samarth Mishra
Aditya Gangrade
Kate Saenko
Venkatesh Saligrama
VLM
385
1
0
24 Feb 2025
Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition
International Conference on Learning Representations (ICLR), 2025
Xinyu Tian
Shu Zou
Zhaoyuan Yang
Mengqi He
Jing Zhang
VLM
345
7
0
19 Feb 2025
DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery
Computer Vision and Pattern Recognition (CVPR), 2025
Utkarsh Mall
Cheng Perng Phoo
Mia Chiquier
Bharath Hariharan
Kavita Bala
Carl Vondrick
549
4
0
17 Feb 2025
1
2
3
4
Next
Page 1 of 4