ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04544
  4. Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
    VLM
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

50 / 637 papers shown
Title
Pre-Trained Vision-Language Models as Partial Annotators
Pre-Trained Vision-Language Models as Partial Annotators
Qian-Wei Wang
Yuqiu Xie
Letian Zhang
Zimo Liu
Shu-Tao Xia
VLM
22
2
0
23 May 2024
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus
  Image-Text Expertise
MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise
Ruiqi Wu
Chenran Zhang
Jianle Zhang
Yi Zhou
Tao Zhou
Huazhu Fu
24
8
0
20 May 2024
Open-Vocabulary Object Detection via Neighboring Region Attention
  Alignment
Open-Vocabulary Object Detection via Neighboring Region Attention Alignment
Sunyuan Qiang
Xianfei Li
Yanyan Liang
Wenlong Liao
Tao He
Pai Peng
ObjD
14
0
0
14 May 2024
Navigating the Future of Federated Recommendation Systems with Foundation Models
Navigating the Future of Federated Recommendation Systems with Foundation Models
Zhiwei Li
Guodong Long
Chunxu Zhang
Honglei Zhang
Jing Jiang
Chengqi Zhang
103
0
0
12 May 2024
TAI++: Text as Image for Multi-Label Image Classification by Co-Learning
  Transferable Prompt
TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable Prompt
Xiangyu Wu
Qingjun Jiang
Yang Yang
Yifeng Wu
Qingguo Chen
Jianfeng Lu
VLM
VPVLM
24
6
0
11 May 2024
VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with
  Lightweight Blocks
VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with Lightweight Blocks
Manish Dhakal
Rabin Adhikari
Safal Thapaliya
Bishesh Khanal
VLM
19
3
0
10 May 2024
FlexEControl: Flexible and Efficient Multimodal Control for
  Text-to-Image Generation
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
Xuehai He
Jian Zheng
Jacob Zhiyuan Fang
Robinson Piramuthu
Mohit Bansal
Vicente Ordonez
Gunnar A. Sigurdsson
Nanyun Peng
Xin Eric Wang
DiffM
43
1
0
08 May 2024
Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
Zhaoxiang Zhang
Hanqiu Deng
Jinan Bao
Xingyu Li
VLM
25
1
0
08 May 2024
On the test-time zero-shot generalization of vision-language models: Do
  we really need prompt learning?
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
Maxime Zanella
Ismail Ben Ayed
VLM
MLLM
35
22
0
03 May 2024
Understanding Retrieval-Augmented Task Adaptation for Vision-Language
  Models
Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
Yifei Ming
Yixuan Li
VLM
23
7
0
02 May 2024
Revisiting the Adversarial Robustness of Vision Language Models: a
  Multimodal Perspective
Revisiting the Adversarial Robustness of Vision Language Models: a Multimodal Perspective
Wanqi Zhou
Shuanghao Bai
Qibin Zhao
Badong Chen
VLM
AAML
39
5
0
30 Apr 2024
Efficient Remote Sensing with Harmonized Transfer Learning and Modality
  Alignment
Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment
Tengjun Huang
31
0
0
28 Apr 2024
What Foundation Models can Bring for Robot Learning in Manipulation : A
  Survey
What Foundation Models can Bring for Robot Learning in Manipulation : A Survey
Dingzhe Li
Yixiang Jin
A. Yong
Hongze Yu
Jun Shi
Xiaoshuai Hao
Peng Hao
Huaping Liu
Fuchun Sun
Bin Fang
AI4CE
LM&Ro
64
12
0
28 Apr 2024
Spatio-Temporal Side Tuning Pre-trained Foundation Models for
  Video-based Pedestrian Attribute Recognition
Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition
Xiao Wang
Qian Zhu
Jiandong Jin
Jun Zhu
Futian Wang
Bowei Jiang
Yaowei Wang
Yonghong Tian
ViT
23
3
0
27 Apr 2024
Training-Free Unsupervised Prompt for Vision-Language Models
Training-Free Unsupervised Prompt for Vision-Language Models
Sifan Long
Linbin Wang
Zhen Zhao
Zichang Tan
Yiming Wu
Shengsheng Wang
Jingdong Wang
VLM
VPVLM
36
1
0
25 Apr 2024
Improving Multi-label Recognition using Class Co-Occurrence
  Probabilities
Improving Multi-label Recognition using Class Co-Occurrence Probabilities
Samyak Rawlekar
Shubhang Bhatnagar
Vishnuvardhan Pogunulu Srinivasulu
Narendra Ahuja
VLM
27
5
0
24 Apr 2024
Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP)
  for Enhanced Breast Cancer Diagnosis with Multi-view Mammography
Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography
Xuxin Chen
Yuheng Li
Mingzhe Hu
Ella Salari
Xiaoqian Chen
Richard L. J. Qiu
Bin Zheng
Xiaofeng Yang
VLM
25
6
0
24 Apr 2024
Multi-Modal Proxy Learning Towards Personalized Visual Multiple
  Clustering
Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering
Jiawei Yao
Qi Qian
Juhua Hu
22
14
0
24 Apr 2024
ECOR: Explainable CLIP for Object Recognition
ECOR: Explainable CLIP for Object Recognition
Ali Rasekh
Sepehr Kazemi Ranjbar
Milad Heidari
Wolfgang Nejdl
VLM
33
4
0
19 Apr 2024
Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for
  Vision-Language Models
Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models
Juncheng Yang
Zuchao Li
Shuai Xie
Weiping Zhu
Wei Yu
Shijun Li
VLM
11
2
0
19 Apr 2024
Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language
  Pre-training Models
Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models
Shouwei Ruan
Yinpeng Dong
Hanqing Liu
Yao Huang
Hang Su
Xingxing Wei
VLM
37
1
0
18 Apr 2024
Progressive Multi-modal Conditional Prompt Tuning
Progressive Multi-modal Conditional Prompt Tuning
Xiaoyu Qiu
Hao Feng
Yuechen Wang
Wen-gang Zhou
Houqiang Li
VLM
29
1
0
18 Apr 2024
Optimization of Prompt Learning via Multi-Knowledge Representation for
  Vision-Language Models
Optimization of Prompt Learning via Multi-Knowledge Representation for Vision-Language Models
Enming Zhang
Bingke Zhu
Yingying Chen
Qinghai Miao
Ming Tang
Jinqiao Wang
VLM
34
0
0
16 Apr 2024
Conditional Prototype Rectification Prompt Learning
Conditional Prototype Rectification Prompt Learning
Haoxing Chen
Yaohui Li
Zizheng Huang
Yan Hong
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Huijia Zhu
Weiqiang Wang
VLM
29
3
0
15 Apr 2024
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for
  Few-shot Learning
The Devil is in the Few Shots: Iterative Visual Knowledge Completion for Few-shot Learning
Yaohui Li
Qifeng Zhou
Haoxing Chen
Jianbing Zhang
Xinyu Dai
Hao Zhou
VLM
31
0
0
15 Apr 2024
PracticalDG: Perturbation Distillation on Vision-Language Models for
  Hybrid Domain Generalization
PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization
Zining Chen
Weiqiu Wang
Zhicheng Zhao
Fei Su
Aidong Men
Hongying Meng
VLM
30
7
0
13 Apr 2024
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
AMU-Tuning: Effective Logit Bias for CLIP-based Few-shot Learning
Yuwei Tang
Zhenyi Lin
Qilong Wang
Pengfei Zhu
Qinghua Hu
26
11
0
13 Apr 2024
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical
  Image Classification
PM2: A New Prompting Multi-modal Model Paradigm for Few-shot Medical Image Classification
Zhenwei Wang
Qiule Sun
Bingbing Zhang
Pengfei Wang
Jianxin Zhang
Qiang Zhang
VLM
38
1
0
13 Apr 2024
On the Robustness of Language Guidance for Low-Level Vision Tasks:
  Findings from Depth Estimation
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Agneet Chatterjee
Tejas Gokhale
Chitta Baral
Yezhou Yang
VLM
25
2
0
12 Apr 2024
Improving Continuous Sign Language Recognition with Adapted Image Models
Improving Continuous Sign Language Recognition with Adapted Image Models
Lianyu Hu
Tongkai Shi
Liqing Gao
Zekang Liu
Wei Feng
VLM
18
5
0
12 Apr 2024
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic
  Segmentation
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation
Sina Hajimiri
Ismail Ben Ayed
Jose Dolz
VLM
31
22
0
12 Apr 2024
Transductive Zero-Shot and Few-Shot CLIP
Transductive Zero-Shot and Few-Shot CLIP
Ségolène Martin
Yunshi Huang
Fereshteh Shakeri
J. Pesquet
Ismail Ben Ayed
BDL
VLM
23
13
0
08 Apr 2024
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly
  Detection
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection
Xiaofan Li
Zhizhong Zhang
Xin Tan
Chengwei Chen
Yanyun Qu
Yuan Xie
Lizhuang Ma
VLM
47
35
0
08 Apr 2024
WorDepth: Variational Language Prior for Monocular Depth Estimation
WorDepth: Variational Language Prior for Monocular Depth Estimation
Ziyao Zeng
Daniel Wang
Fengyu Yang
Hyoungseob Park
Yangchao Wu
Stefano Soatto
Byung-Woo Hong
Dong Lao
Alex Wong
MDE
38
26
0
04 Apr 2024
Learning Transferable Negative Prompts for Out-of-Distribution Detection
Learning Transferable Negative Prompts for Out-of-Distribution Detection
Tianqi Li
Guansong Pang
Xiaolong Bai
Wenjun Miao
Jingyi Zheng
VLM
42
12
0
04 Apr 2024
LP++: A Surprisingly Strong Linear Probe for Few-Shot CLIP
LP++: A Surprisingly Strong Linear Probe for Few-Shot CLIP
Yunshi Huang
Fereshteh Shakeri
Jose Dolz
Malik Boudiaf
Houda Bahig
Ismail Ben Ayed
19
19
0
02 Apr 2024
Unknown Prompt, the only Lacuna: Unveiling CLIP's Potential for Open
  Domain Generalization
Unknown Prompt, the only Lacuna: Unveiling CLIP's Potential for Open Domain Generalization
Mainak Singha
Ankit Jha
Shirsha Bose
Ashwin Nair
Moloud Abdar
Biplab Banerjee
VLM
35
10
0
31 Mar 2024
Training-Free Semantic Segmentation via LLM-Supervision
Training-Free Semantic Segmentation via LLM-Supervision
Wenfang Sun
Yingjun Du
Gaowen Liu
Ramana Rao Kompella
Cees G. M. Snoek
VLM
35
2
0
31 Mar 2024
Bayesian Exploration of Pre-trained Models for Low-shot Image
  Classification
Bayesian Exploration of Pre-trained Models for Low-shot Image Classification
Yibo Miao
Yu Lei
Feng Zhou
Zhijie Deng
VLM
UQCV
BDL
38
1
0
30 Mar 2024
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action
  Generalization
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization
Anna Kukleva
Fadime Sener
Edoardo Remelli
Bugra Tekin
Eric Sauser
Bernt Schiele
Shugao Ma
VLM
EgoV
29
1
0
28 Mar 2024
CLAP4CLIP: Continual Learning with Probabilistic Finetuning for
  Vision-Language Models
CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models
Saurav Jha
Dong Gong
Lina Yao
CLIP
VLM
33
7
0
28 Mar 2024
Efficient Test-Time Adaptation of Vision-Language Models
Efficient Test-Time Adaptation of Vision-Language Models
Adilbek Karmanov
Dayan Guan
Shijian Lu
Abdulmotaleb El Saddik
Eric P. Xing
TTA
VLM
14
37
0
27 Mar 2024
Dual Memory Networks: A Versatile Adaptation Approach for
  Vision-Language Models
Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
Yabin Zhang
Wen-Qing Zhu
Hui Tang
Zhiyuan Ma
Kaiyang Zhou
Lei Zhang
VLM
29
21
0
26 Mar 2024
FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in
  RKHSs
FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs
Sepehr Dehdashtian
Lan Wang
Vishnu Naresh Boddeti
VLM
22
11
0
22 Mar 2024
Not All Attention is Needed: Parameter and Computation Efficient
  Transfer Learning for Multi-modal Large Language Models
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Qiong Wu
Weihao Ye
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
MoE
25
1
0
22 Mar 2024
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Zeyu Han
Chao Gao
Jinyang Liu
Jeff Zhang
Sai Qian Zhang
136
301
0
21 Mar 2024
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization
  with Vision-Language Models
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models
Elaine Sui
Xiaohan Wang
Serena Yeung-Levy
VLM
14
5
0
19 Mar 2024
Learning Cross-view Visual Geo-localization without Ground Truth
Learning Cross-view Visual Geo-localization without Ground Truth
Haoyuan Li
Chang Xu
Wen Yang
Huai Yu
Gui-Song Xia
28
8
0
19 Mar 2024
Adapting Visual-Language Models for Generalizable Anomaly Detection in
  Medical Images
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
Chaoqin Huang
Aofan Jiang
Jinghao Feng
Ya-Qin Zhang
Xinchao Wang
Yanfeng Wang
MedIm
28
24
0
19 Mar 2024
Boosting Continual Learning of Vision-Language Models via
  Mixture-of-Experts Adapters
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters
Jiazuo Yu
Yunzhi Zhuge
Lu Zhang
Ping Hu
Dong Wang
Huchuan Lu
You He
VLM
KELM
CLL
OODD
108
67
0
18 Mar 2024
Previous
123...567...111213
Next