Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.04544
Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Adapter: Better Vision-Language Models with Feature Adapters"
50 / 637 papers shown
Title
Pre-Trained Vision-Language Models as Partial Annotators
Qian-Wei Wang
Yuqiu Xie
Letian Zhang
Zimo Liu
Shu-Tao Xia
VLM
22
2
0
23 May 2024
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise
Ruiqi Wu
Chenran Zhang
Jianle Zhang
Yi Zhou
Tao Zhou
Huazhu Fu
24
8
0
20 May 2024
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment
Sunyuan Qiang
Xianfei Li
Yanyan Liang
Wenlong Liao
Tao He
Pai Peng
ObjD
14
0
0
14 May 2024
Navigating the Future of Federated Recommendation Systems with Foundation Models
Zhiwei Li
Guodong Long
Chunxu Zhang
Honglei Zhang
Jing Jiang
Chengqi Zhang
103
0
0
12 May 2024
TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable Prompt
Xiangyu Wu
Qingjun Jiang
Yang Yang
Yifeng Wu
Qingguo Chen
Jianfeng Lu
VLM
VPVLM
24
6
0
11 May 2024
VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with Lightweight Blocks
Manish Dhakal
Rabin Adhikari
Safal Thapaliya
Bishesh Khanal
VLM
19
3
0
10 May 2024
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
Xuehai He
Jian Zheng
Jacob Zhiyuan Fang
Robinson Piramuthu
Mohit Bansal
Vicente Ordonez
Gunnar A. Sigurdsson
Nanyun Peng
Xin Eric Wang
DiffM
43
1
0
08 May 2024
Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
Zhaoxiang Zhang
Hanqiu Deng
Jinan Bao
Xingyu Li
VLM
25
1
0
08 May 2024
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
Maxime Zanella
Ismail Ben Ayed
VLM
MLLM
35
22
0
03 May 2024
Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
Yifei Ming
Yixuan Li
VLM
23
7
0
02 May 2024
Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective
Wanqi Zhou
Shuanghao Bai
Qibin Zhao
Badong Chen
VLM
AAML
39
5
0
30 Apr 2024
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment
Tengjun Huang
31
0
0
28 Apr 2024
What Foundation Models can Bring for Robot Learning in Manipulation : A Survey
Dingzhe Li
Yixiang Jin
A. Yong
Hongze Yu
Jun Shi
Xiaoshuai Hao
Peng Hao
Huaping Liu
Fuchun Sun
Bin Fang
AI4CE
LM&Ro
64
12
0
28 Apr 2024
Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Xiao Wang
Qian Zhu
Jiandong Jin
Jun Zhu
Futian Wang
Bowei Jiang
Yaowei Wang
Yonghong Tian
ViT
23
3
0
27 Apr 2024
Training-Free Unsupervised Prompt for Vision-Language Models
Sifan Long
Linbin Wang
Zhen Zhao
Zichang Tan
Yiming Wu
Shengsheng Wang
Jingdong Wang
VLM
VPVLM
36
1
0
25 Apr 2024
Improving Multi-label Recognition using Class Co-Occurrence Probabilities
Samyak Rawlekar
Shubhang Bhatnagar
Vishnuvardhan Pogunulu Srinivasulu
Narendra Ahuja
VLM
27
5
0
24 Apr 2024
Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography
Xuxin Chen
Yuheng Li
Mingzhe Hu
Ella Salari
Xiaoqian Chen
Richard L. J. Qiu
Bin Zheng
Xiaofeng Yang
VLM
25
6
0
24 Apr 2024
Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering
Jiawei Yao
Qi Qian
Juhua Hu
22
14
0
24 Apr 2024
ECOR: Explainable CLIP for Object Recognition
Ali Rasekh
Sepehr Kazemi Ranjbar
Milad Heidari
Wolfgang Nejdl
VLM
33
4
0
19 Apr 2024
Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models
Juncheng Yang
Zuchao Li
Shuai Xie
Weiping Zhu
Wei Yu
Shijun Li
VLM
11
2
0
19 Apr 2024
Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models
Shouwei Ruan
Yinpeng Dong
Hanqing Liu
Yao Huang
Hang Su
Xingxing Wei
VLM
37
1
0
18 Apr 2024
Progressive Multi-modal Conditional Prompt Tuning
Xiaoyu Qiu
Hao Feng
Yuechen Wang
Wen-gang Zhou
Houqiang Li
VLM
29
1
0
18 Apr 2024
Optimization of Prompt Learning via Multi-Knowledge Representation for Vision-Language Models
Enming Zhang
Bingke Zhu
Yingying Chen
Qinghai Miao
Ming Tang
Jinqiao Wang
VLM
34
0
0
16 Apr 2024
Conditional Prototype Rectification Prompt Learning
Haoxing Chen
Yaohui Li
Zizheng Huang
Yan Hong
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Huijia Zhu
Weiqiang Wang
VLM
29
3
0
15 Apr 2024
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
Yaohui Li
Qifeng Zhou
Haoxing Chen
Jianbing Zhang
Xinyu Dai
Hao Zhou
VLM
31
0
0
15 Apr 2024
PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization
Zining Chen
Weiqiu Wang
Zhicheng Zhao
Fei Su
Aidong Men
Hongying Meng
VLM
30
7
0
13 Apr 2024
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
Yuwei Tang
Zhenyi Lin
Qilong Wang
Pengfei Zhu
Qinghua Hu
26
11
0
13 Apr 2024
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification
Zhenwei Wang
Qiule Sun
Bingbing Zhang
Pengfei Wang
Jianxin Zhang
Qiang Zhang
VLM
38
1
0
13 Apr 2024
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Agneet Chatterjee
Tejas Gokhale
Chitta Baral
Yezhou Yang
VLM
25
2
0
12 Apr 2024
Improving Continuous Sign Language Recognition with Adapted Image Models
Lianyu Hu
Tongkai Shi
Liqing Gao
Zekang Liu
Wei Feng
VLM
18
5
0
12 Apr 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
31
22
0
12 Apr 2024
Transductive Zero-Shot and Few-Shot CLIP
Ségolène Martin
Yunshi Huang
Fereshteh Shakeri
J. Pesquet
Ismail Ben Ayed
BDL
VLM
23
13
0
08 Apr 2024
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
Xiaofan Li
Zhizhong Zhang
Xin Tan
Chengwei Chen
Yanyun Qu
Yuan Xie
Lizhuang Ma
VLM
47
35
0
08 Apr 2024
WorDepth: Variational Language Prior for Monocular Depth Estimation
Ziyao Zeng
Daniel Wang
Fengyu Yang
Hyoungseob Park
Yangchao Wu
Stefano Soatto
Byung-Woo Hong
Dong Lao
Alex Wong
MDE
38
26
0
04 Apr 2024
Learning Transferable Negative Prompts for Out-of-Distribution Detection
Tianqi Li
Guansong Pang
Xiaolong Bai
Wenjun Miao
Jingyi Zheng
VLM
42
12
0
04 Apr 2024
LP++: A Surprisingly Strong Linear Probe for Few-Shot CLIP
Yunshi Huang
Fereshteh Shakeri
Jose Dolz
Malik Boudiaf
Houda Bahig
Ismail Ben Ayed
19
19
0
02 Apr 2024
Unknown Prompt, the only Lacuna: Unveiling CLIP's Potential for Open Domain Generalization
Mainak Singha
Ankit Jha
Shirsha Bose
Ashwin Nair
Moloud Abdar
Biplab Banerjee
VLM
35
10
0
31 Mar 2024
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
35
2
0
31 Mar 2024
Bayesian Exploration of Pre-trained Models for Low-shot Image Classification
Yibo Miao
Yu Lei
Feng Zhou
Zhijie Deng
VLM
UQCV
BDL
38
1
0
30 Mar 2024
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization
Anna Kukleva
Fadime Sener
Edoardo Remelli
Bugra Tekin
Eric Sauser
Bernt Schiele
Shugao Ma
VLM
EgoV
29
1
0
28 Mar 2024
CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models
Saurav Jha
Dong Gong
Lina Yao
CLIP
VLM
33
7
0
28 Mar 2024
Efficient Test-Time Adaptation of Vision-Language Models
Adilbek Karmanov
Dayan Guan
Shijian Lu
Abdulmotaleb El Saddik
Eric P. Xing
TTA
VLM
14
37
0
27 Mar 2024
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Yabin Zhang
Wen-Qing Zhu
Hui Tang
Zhiyuan Ma
Kaiyang Zhou
Lei Zhang
VLM
29
21
0
26 Mar 2024
FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs
Sepehr Dehdashtian
Lan Wang
Vishnu Naresh Boddeti
VLM
22
11
0
22 Mar 2024
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Qiong Wu
Weihao Ye
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
MoE
25
1
0
22 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
136
301
0
21 Mar 2024
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Elaine Sui
Xiaohan Wang
Serena Yeung-Levy
VLM
14
5
0
19 Mar 2024
Learning Cross-view Visual Geo-localization without Ground Truth
Haoyuan Li
Chang Xu
Wen Yang
Huai Yu
Gui-Song Xia
28
8
0
19 Mar 2024
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
Chaoqin Huang
Aofan Jiang
Jinghao Feng
Ya-Qin Zhang
Xinchao Wang
Yanfeng Wang
MedIm
28
24
0
19 Mar 2024
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters
Jiazuo Yu
Yunzhi Zhuge
Lu Zhang
Ping Hu
Dong Wang
Huchuan Lu
You He
VLM
KELM
CLL
OODD
108
67
0
18 Mar 2024
Previous
1
2
3
...
5
6
7
...
11
12
13
Next