ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04544
  4. Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
    VLM
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

50 / 635 papers shown
Title
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
BiomedCoOp: Learning to Prompt for Biomedical Vision-Language Models
Taha Koleilat
Hojat Asgariandehkordi
H. Rivaz
Yiming Xiao
VLM
100
0
0
21 Nov 2024
Video-to-Task Learning via Motion-Guided Attention for Few-Shot Action Recognition
Hanyu Guo
Wanchuan Yu
Suzhou Que
Kaiwen Du
Yan Yan
Hanzi Wang
68
1
0
18 Nov 2024
Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
Wentao Bao
K. Li
Yuxiao Chen
Deep Patel
Martin Renqiang Min
Yu Kong
VLM
ObjD
32
2
0
17 Nov 2024
UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language
  Models
UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
Jiachen Liang
Ruibing Hou
Minyang Hu
Hong Chang
Shiguang Shan
Xilin Chen
VLM
36
1
0
11 Nov 2024
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for
  Autonomous Driving
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
Tao Ma
Hongbin Zhou
Qiusheng Huang
Xuemeng Yang
Jianfei Guo
Bo Zhang
Min Dou
Yu Qiao
Botian Shi
Hongsheng Li
21
1
0
08 Nov 2024
Aligning Characteristic Descriptors with Images for Human-Expert-like
  Explainability
Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
Bharat Yalavarthi
N. Ratha
25
0
0
06 Nov 2024
Identifying Implicit Social Biases in Vision-Language Models
Identifying Implicit Social Biases in Vision-Language Models
Kimia Hamidieh
Haoran Zhang
Walter Gerych
Thomas Hartvigsen
Marzyeh Ghassemi
VLM
28
11
0
01 Nov 2024
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Shengxun Wei
Zan Gao
Yibo Zhao
Weili Guan
Weili Guan
Shengyong Chen
44
1
0
01 Nov 2024
Language-guided Hierarchical Fine-grained Image Forgery Detection and
  Localization
Language-guided Hierarchical Fine-grained Image Forgery Detection and Localization
Xiao Guo
Xiaohong Liu
I. Masi
Xiaoming Liu
90
9
0
31 Oct 2024
Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic
  Classifier
Multi-Class Textual-Inversion Secretly Yields a Semantic-Agnostic Classifier
Kai Wang
Fei Yang
Bogdan Raducanu
Joost van de Weijer
26
1
0
29 Oct 2024
Text-Guided Attention is All You Need for Zero-Shot Robustness in
  Vision-Language Models
Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models
Lu Yu
Haiyang Zhang
Changsheng Xu
AAML
VLM
21
3
0
29 Oct 2024
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy
  Segment Optimization
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization
Wanhua Li
Zibin Meng
Jiawei Zhou
D. Wei
Chuang Gan
Hanspeter Pfister
LRM
VLM
22
5
0
28 Oct 2024
Domain Adaptation with a Single Vision-Language Embedding
Domain Adaptation with a Single Vision-Language Embedding
Mohammad Fahes
Tuan-Hung Vu
Andrei Bursuc
Patrick Pérez
Raoul de Charette
VLM
16
0
0
28 Oct 2024
Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained
  Models via Model Editing
Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Dongliang Guo
Mengxuan Hu
Zihan Guan
Junfeng Guo
Thomas Hartvigsen
Sheng R. Li
AAML
16
0
0
23 Oct 2024
MI-VisionShot: Few-shot adaptation of vision-language models for
  slide-level classification of histopathological images
MI-VisionShot: Few-shot adaptation of vision-language models for slide-level classification of histopathological images
Pablo Meseguer
Rocío del Amor
Valery Naranjo
VLM
14
0
0
21 Oct 2024
BoostAdapter: Improving Vision-Language Test-Time Adaptation via
  Regional Bootstrapping
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
Taolin Zhang
J. T. Wang
Hang Guo
Tao Dai
B. Chen
Shu-Tao Xia
VLM
TTA
14
0
0
20 Oct 2024
Dual Prototype Evolving for Test-Time Generalization of Vision-Language
  Models
Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models
Ce Zhang
Simon Stepputtis
Katia P. Sycara
Yaqi Xie
VLM
30
5
0
16 Oct 2024
TransAgent: Transfer Vision-Language Foundation Models with
  Heterogeneous Agent Collaboration
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
Yiwei Guo
Shaobin Zhuang
Kunchang Li
Yu Qiao
Yali Wang
VLM
CLIP
21
0
0
16 Oct 2024
A Survey of Low-shot Vision-Language Model Adaptation via Representer
  Theorem
A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem
Kun Ding
Ying Wang
Gaofeng Meng
Shiming Xiang
VLM
27
0
0
15 Oct 2024
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
LoKO: Low-Rank Kalman Optimizer for Online Fine-Tuning of Large Models
Hossein Abdi
Mingfei Sun
Andi Zhang
Samuel Kaski
Wei Pan
23
0
0
15 Oct 2024
Queryable Prototype Multiple Instance Learning with Vision-Language
  Models for Incremental Whole Slide Image Classification
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification
Jiaxiang Gou
Luping Ji
Pei Liu
Mao Ye
VLM
28
0
0
14 Oct 2024
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes
Jianqi Chen
Panwen Hu
Xiaojun Chang
Z. Shi
Michael C. Kampffmeyer
Xiaodan Liang
46
5
0
14 Oct 2024
Understanding Robustness of Parameter-Efficient Tuning for Image
  Classification
Understanding Robustness of Parameter-Efficient Tuning for Image Classification
Jiacheng Ruan
Xian Gao
Suncheng Xiang
Mingye Xie
Ting Liu
Yuzhuo Fu
AAML
VLM
19
0
0
13 Oct 2024
Deep Transfer Learning: Model Framework and Error Analysis
Deep Transfer Learning: Model Framework and Error Analysis
Yuling Jiao
Huazhen Lin
Yuchen Luo
Jerry Zhijian Yang
34
1
0
12 Oct 2024
Debiasing Vison-Language Models with Text-Only Training
Debiasing Vison-Language Models with Text-Only Training
Yunfan Yang
Chaoquan Jiang
Zhiyu Lin
Jinlin Xiao
Jiaming Zhang
Jitao Sang
VLM
23
1
0
12 Oct 2024
Contrastive Learning for Implicit Social Factors in Social Media
  Popularity Prediction
Contrastive Learning for Implicit Social Factors in Social Media Popularity Prediction
Zhizhen Zhang
Ruihong Qiu
Xiaohui Xie
18
0
0
12 Oct 2024
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object
  Detection
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection
H. Li
Rui Zhang
Hantao Yao
X. Zhang
Yifan Hao
Xinkai Song
Xiaqing Li
Yongwei Zhao
Ling Li
Yunji Chen
ObjD
VLM
23
3
0
11 Oct 2024
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
Kun Ding
Qiang Yu
Haojian Zhang
Gaofeng Meng
Shiming Xiang
VLM
18
0
0
11 Oct 2024
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts
Anh-Quan Cao
M. Jaritz
Matthieu Guillaumin
Raoul de Charette
Loris Bazzani
VLM
CLIP
32
2
0
10 Oct 2024
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with
  Heterogeneous Graph Adapter
HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter
Yumiao Zhao
Bo Jiang
Xiao Wang
Qin Xu
Jin Tang
VLM
23
0
0
10 Oct 2024
FLIER: Few-shot Language Image Models Embedded with Latent
  Representations
FLIER: Few-shot Language Image Models Embedded with Latent Representations
Zhinuo Zhou
Peng Zhou
Xiaoyong Pan
VLM
19
0
0
10 Oct 2024
Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal
  Action Segmentation
Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal Action Segmentation
Bowen Chen
Haoyu Ji
Zhiyong Wang
Benjamin Filtjens
C. Wang
Weihong Ren
Bart Vanrumste
Honghai Liu
43
0
0
08 Oct 2024
ModalPrompt:Dual-Modality Guided Prompt for Continual Learning of Large
  Multimodal Models
ModalPrompt:Dual-Modality Guided Prompt for Continual Learning of Large Multimodal Models
Fanhu Zeng
Fei Zhu
Haiyang Guo
Xu-Yao Zhang
Cheng-Lin Liu
VLM
CLL
18
6
0
08 Oct 2024
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in
  Open-Vocabulary Detection
SIA-OVD: Shape-Invariant Adapter for Bridging the Image-Region Gap in Open-Vocabulary Detection
Zishuo Wang
Wenhao Zhou
Jinglin Xu
Yuxin Peng
ObjD
VLM
11
1
0
08 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through
  Language Descriptions
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
16
7
0
03 Oct 2024
Understanding and Mitigating Miscalibration in Prompt Tuning for
  Vision-Language Models
Understanding and Mitigating Miscalibration in Prompt Tuning for Vision-Language Models
Shuoyuan Wang
Yixuan Li
Hongxin Wei
VLM
36
2
0
03 Oct 2024
Robust Imitation Learning for Mobile Manipulator Focusing on
  Task-Related Viewpoints and Regions
Robust Imitation Learning for Mobile Manipulator Focusing on Task-Related Viewpoints and Regions
Yutaro Ishida
Yuki Noguchi
Takayuki Kanai
Kazuhiro Shintani
Hiroshi Bito
19
1
0
02 Oct 2024
Rethinking Misalignment in Vision-Language Model Adaptation from a
  Causal Perspective
Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective
Yanan Zhang
Jiangmeng Li
Lixiang Liu
Wenwen Qiang
VLM
18
1
0
01 Oct 2024
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning
  for Surgical Phase Recognition
SurgPETL: Parameter-Efficient Image-to-Surgical-Video Transfer Learning for Surgical Phase Recognition
Shu Yang
Zhiyuan Cai
Luyang Luo
Ning Ma
Shuchang Xu
Hao Chen
16
0
0
30 Sep 2024
FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image
  Classification
FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification
Kexue Fu
Xiaoyuan Luo
Linhao Qu
Shuo Wang
Ying Xiong
Ilias Maglogiannis
Longxiang Gao
Manning Wang
26
1
0
29 Sep 2024
DOTA: Distributional Test-Time Adaptation of Vision-Language Models
DOTA: Distributional Test-Time Adaptation of Vision-Language Models
Zongbo Han
Jialong Yang
Junfan Li
Qinghua Hu
Qianli Xu
Mike Zheng Shou
Changqing Zhang
TTA
VLM
41
2
0
28 Sep 2024
Learning to Obstruct Few-Shot Image Classification over Restricted
  Classes
Learning to Obstruct Few-Shot Image Classification over Restricted Classes
Amber Yijia Zheng
Chiao-An Yang
Raymond A. Yeh
19
1
0
28 Sep 2024
Cascade Prompt Learning for Vision-Language Model Adaptation
Cascade Prompt Learning for Vision-Language Model Adaptation
Ge Wu
Xin Zhang
Zheng Li
Zhaowei Chen
Jiajun Liang
Jian Yang
Xiang Li
VLM
19
6
0
26 Sep 2024
Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Robotic-CLIP: Fine-tuning CLIP on Action Data for Robotic Applications
Nghia Nguyen
Minh Nhat Vu
Tung D. Ta
Baoru Huang
T. Vo
Ngan Le
Anh Nguyen
VLM
CLIP
38
3
0
26 Sep 2024
Global-Local Medical SAM Adaptor Based on Full Adaption
Global-Local Medical SAM Adaptor Based on Full Adaption
Meng Wang
Yarong Feng
Yongwei Tang
Tian Zhang
Yuxin Liang
Chao Lv
MedIm
30
0
0
26 Sep 2024
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient
  Modification
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification
Ming Li
J. Zhong
Chenxin Li
Liuzhuozheng Li
Nie Lin
Masashi Sugiyama
CLIP
VLM
18
2
0
25 Sep 2024
MemeCLIP: Leveraging CLIP Representations for Multimodal Meme
  Classification
MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification
Siddhant Bikram Shah
Shuvam Shiwakoti
Maheep Chaudhary
Haohan Wang
VLM
17
8
0
23 Sep 2024
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
TSCLIP: Robust CLIP Fine-Tuning for Worldwide Cross-Regional Traffic Sign Recognition
Guoyang Zhao
Fulong Ma
Weiqing Qi
Chenguang Zhang
Yuxuan Liu
Ming Liu
Jun Ma
VLM
CLIP
36
3
0
23 Sep 2024
PromptTA: Prompt-driven Text Adapter for Source-free Domain
  Generalization
PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization
Haoran Zhang
Shuanghao Bai
Wanqi Zhou
Jingwen Fu
Badong Chen
VLM
OOD
TTA
18
2
0
21 Sep 2024
Enhancing Perception of Key Changes in Remote Sensing Image Change
  Captioning
Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning
Cong Yang
Zuchao Li
Hongzan Jiao
Zhi Gao
Lefei Zhang
32
1
0
19 Sep 2024
Previous
123456...111213
Next