Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.04544
Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
VLM
CLIP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Adapter: Better Vision-Language Models with Feature Adapters"
50 / 635 papers shown
Title
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
Taha Koleilat
Hojat Asgariandehkordi
H. Rivaz
Yiming Xiao
VLM
100
0
0
21 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
68
1
0
18 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Wentao Bao
K. Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLM
ObjD
32
2
0
17 Nov 2024
UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
Jiachen Liang
Ruibing Hou
Minyang Hu
Hong Chang
Shiguang Shan
Xilin Chen
VLM
36
1
0
11 Nov 2024
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Tao Ma
Hongbin Zhou
Qiusheng Huang
Xuemeng Yang
Jianfei Guo
Bo Zhang
Min Dou
Yu Qiao
Botian Shi
Hongsheng Li
21
1
0
08 Nov 2024
Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
Bharat Yalavarthi
N. Ratha
25
0
0
06 Nov 2024
Identifying Implicit Social Biases in Vision-Language Models
Kimia Hamidieh
Haoran Zhang
Walter Gerych
Thomas Hartvigsen
Marzyeh Ghassemi
VLM
28
11
0
01 Nov 2024
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Shengxun Wei
Zan Gao
Yibo Zhao
Weili Guan
Weili Guan
Shengyong Chen
44
1
0
01 Nov 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
90
9
0
31 Oct 2024
Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier
Kai Wang
Fei Yang
Bogdan Raducanu
Joost van de Weijer
26
1
0
29 Oct 2024
Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models
Lu Yu
Haiyang Zhang
Changsheng Xu
AAML
VLM
21
3
0
29 Oct 2024
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization
Wanhua Li
Zibin Meng
Jiawei Zhou
D. Wei
Chuang Gan
Hanspeter Pfister
LRM
VLM
22
5
0
28 Oct 2024
Domain Adaptation with a Single Vision-Language Embedding
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
16
0
0
28 Oct 2024
Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Dongliang Guo
Mengxuan Hu
Zihan Guan
Junfeng Guo
Thomas Hartvigsen
Sheng R. Li
AAML
16
0
0
23 Oct 2024
MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images
Pablo Meseguer
Rocío del Amor
Valery Naranjo
VLM
14
0
0
21 Oct 2024
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
Taolin Zhang
J. T. Wang
Hang Guo
Tao Dai
B. Chen
Shu-Tao Xia
VLM
TTA
14
0
0
20 Oct 2024
Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Ce Zhang
Simon Stepputtis
Katia P. Sycara
Yaqi Xie
VLM
30
5
0
16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
21
0
0
16 Oct 2024
A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem
Kun Ding
Ying Wang
Gaofeng Meng
Shiming Xiang
VLM
27
0
0
15 Oct 2024
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Hossein Abdi
Mingfei Sun
Andi Zhang
Samuel Kaski
Wei Pan
23
0
0
15 Oct 2024
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
Jiaxiang Gou
Luping Ji
Pei Liu
Mao Ye
VLM
28
0
0
14 Oct 2024
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Jianqi Chen
Panwen Hu
Xiaojun Chang
Z. Shi
Michael C. Kampffmeyer
Xiaodan Liang
46
5
0
14 Oct 2024
Understanding Robustness of Parameter-Efficient Tuning for Image Classification
Jiacheng Ruan
Xian Gao
Suncheng Xiang
Mingye Xie
Ting Liu
Yuzhuo Fu
AAML
VLM
19
0
0
13 Oct 2024
Deep Transfer Learning: Model Framework and Error Analysis
Yuling Jiao
Huazhen Lin
Yuchen Luo
Jerry Zhijian Yang
34
1
0
12 Oct 2024
Debiasing Vison-Language Models with Text-Only Training
Yunfan Yang
Chaoquan Jiang
Zhiyu Lin
Jinlin Xiao
Jiaming Zhang
Jitao Sang
VLM
23
1
0
12 Oct 2024
Contrastive Learning for Implicit Social Factors in Social Media Popularity Prediction
Zhizhen Zhang
Ruihong Qiu
Xiaohui Xie
18
0
0
12 Oct 2024
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection
H. Li
Rui Zhang
Hantao Yao
X. Zhang
Yifan Hao
Xinkai Song
Xiaqing Li
Yongwei Zhao
Ling Li
Yunji Chen
ObjD
VLM
23
3
0
11 Oct 2024
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
Kun Ding
Qiang Yu
Haojian Zhang
Gaofeng Meng
Shiming Xiang
VLM
18
0
0
11 Oct 2024
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
Anh-Quan Cao
M. Jaritz
Matthieu Guillaumin
Raoul de Charette
Loris Bazzani
VLM
CLIP
32
2
0
10 Oct 2024
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter
Yumiao Zhao
Bo Jiang
Xiao Wang
Qin Xu
Jin Tang
VLM
23
0
0
10 Oct 2024
FLIER: Few-shot Language Image Models Embedded with Latent Representations
Zhinuo Zhou
Peng Zhou
Xiaoyong Pan
VLM
19
0
0
10 Oct 2024
Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal Action Segmentation
Bowen Chen
Haoyu Ji
Zhiyong Wang
Benjamin Filtjens
C. Wang
Weihong Ren
Bart Vanrumste
Honghai Liu
43
0
0
08 Oct 2024
ModalPrompt:Dual-Modality Guided Prompt for Continual Learning of Large Multimodal Models
Fanhu Zeng
Fei Zhu
Haiyang Guo
Xu-Yao Zhang
Cheng-Lin Liu
VLM
CLL
18
6
0
08 Oct 2024
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection
Zishuo Wang
Wenhao Zhou
Jinglin Xu
Yuxin Peng
ObjD
VLM
11
1
0
08 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
16
7
0
03 Oct 2024
Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models
Shuoyuan Wang
Yixuan Li
Hongxin Wei
VLM
36
2
0
03 Oct 2024
Robust Imitation Learning for Mobile Manipulator Focusing on Task-Related Viewpoints and Regions
Yutaro Ishida
Yuki Noguchi
Takayuki Kanai
Kazuhiro Shintani
Hiroshi Bito
19
1
0
02 Oct 2024
Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective
Yanan Zhang
Jiangmeng Li
Lixiang Liu
Wenwen Qiang
VLM
18
1
0
01 Oct 2024
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
Shu Yang
Zhiyuan Cai
Luyang Luo
Ning Ma
Shuchang Xu
Hao Chen
16
0
0
30 Sep 2024
FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification
Kexue Fu
Xiaoyuan Luo
Linhao Qu
Shuo Wang
Ying Xiong
Ilias Maglogiannis
Longxiang Gao
Manning Wang
26
1
0
29 Sep 2024
DOTA: Distributional Test-Time Adaptation of Vision-Language Models
Zongbo Han
Jialong Yang
Junfan Li
Qinghua Hu
Qianli Xu
Mike Zheng Shou
Changqing Zhang
TTA
VLM
41
2
0
28 Sep 2024
Learning to Obstruct Few-Shot Image Classification over Restricted Classes
Amber Yijia Zheng
Chiao-An Yang
Raymond A. Yeh
19
1
0
28 Sep 2024
Cascade Prompt Learning for Vision-Language Model Adaptation
Ge Wu
Xin Zhang
Zheng Li
Zhaowei Chen
Jiajun Liang
Jian Yang
Xiang Li
VLM
19
6
0
26 Sep 2024
Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Nghia Nguyen
Minh Nhat Vu
Tung D. Ta
Baoru Huang
T. Vo
Ngan Le
Anh Nguyen
VLM
CLIP
38
3
0
26 Sep 2024
Global-Local Medical SAM Adaptor Based on Full Adaption
Meng Wang
Yarong Feng
Yongwei Tang
Tian Zhang
Yuxin Liang
Chao Lv
MedIm
30
0
0
26 Sep 2024
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification
Ming Li
J. Zhong
Chenxin Li
Liuzhuozheng Li
Nie Lin
Masashi Sugiyama
CLIP
VLM
18
2
0
25 Sep 2024
MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification
Siddhant Bikram Shah
Shuvam Shiwakoti
Maheep Chaudhary
Haohan Wang
VLM
17
8
0
23 Sep 2024
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
Guoyang Zhao
Fulong Ma
Weiqing Qi
Chenguang Zhang
Yuxuan Liu
Ming Liu
Jun Ma
VLM
CLIP
36
3
0
23 Sep 2024
PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization
Haoran Zhang
Shuanghao Bai
Wanqi Zhou
Jingwen Fu
Badong Chen
VLM
OOD
TTA
18
2
0
21 Sep 2024
Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning
Cong Yang
Zuchao Li
Hongzan Jiao
Zhi Gao
Lefei Zhang
32
1
0
19 Sep 2024
Previous
1
2
3
4
5
6
...
11
12
13
Next