ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04544
  4. Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
    VLM
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

50 / 635 papers shown
Title
Guide Your Agent with Adaptive Multimodal Rewards
Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim
Younggyo Seo
Hao Liu
Lisa Lee
Jinwoo Shin
Honglak Lee
Kimin Lee
16
9
0
19 Sep 2023
CLIP-based Synergistic Knowledge Transfer for Text-based Person
  Retrieval
CLIP-based Synergistic Knowledge Transfer for Text-based Person Retrieval
Yating Liu
Yaowei Li
Zimo Liu
Wenming Yang
Yaowei Wang
Qingmin Liao
VLM
21
11
0
18 Sep 2023
Efficient Pyramid Channel Attention Network for Pathological Myopia
  Recognition
Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition
Xiaoqing Zhang
Jilu Zhao
Yan Li
Hao Wu
Xiangtian Zhou
Jiang Liu
10
1
0
17 Sep 2023
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video
  Transfer Learning
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Yingya Zhang
Changxin Gao
Deli Zhao
Nong Sang
16
18
0
14 Sep 2023
DePT: Decoupled Prompt Tuning
DePT: Decoupled Prompt Tuning
Ji Zhang
Shihan Wu
Lianli Gao
Hengtao Shen
Jingkuan Song
VLM
16
27
0
14 Sep 2023
TAP: Targeted Prompting for Task Adaptive Generation of Textual Training
  Instances for Visual Classification
TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
M. Jehanzeb Mirza
Leonid Karlinsky
Wei Lin
Horst Possegger
Rogerio Feris
Horst Bischof
VLM
27
6
0
13 Sep 2023
Efficient Adaptive Human-Object Interaction Detection with
  Concept-guided Memory
Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Ting Lei
Fabian Caba
Qingchao Chen
Hailin Jin
Yuxin Peng
Yang Liu
VLM
34
17
0
07 Sep 2023
Parameter and Computation Efficient Transfer Learning for
  Vision-Language Pre-trained Models
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models
Qiong Wu
Wei Yu
Yiyi Zhou
Shubin Huang
Xiaoshuai Sun
R. Ji
VLM
6
6
0
04 Sep 2023
BDC-Adapter: Brownian Distance Covariance for Better Vision-Language
  Reasoning
BDC-Adapter: Brownian Distance Covariance for Better Vision-Language Reasoning
Yi Zhang
Ce Zhang
Zihan Liao
Yushun Tang
Zhihai He
BDL
VLM
13
10
0
03 Sep 2023
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for
  Vision-Language Models
LoGoPrompt: Synthetic Text Images Can Be Good Visual Prompts for Vision-Language Models
Cheng Shi
Sibei Yang
VLM
11
21
0
03 Sep 2023
Big-model Driven Few-shot Continual Learning
Big-model Driven Few-shot Continual Learning
Ziqi Gu
Chunyan Xu
Zihan Lu
Xin Liu
Anbo Dai
Zhen Cui
CLL
22
1
0
02 Sep 2023
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot
  Anomaly Localization
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
Hanqiu Deng
Zhaoxiang Zhang
Jinan Bao
Xingyu Li
VLM
14
4
0
30 Aug 2023
Read-only Prompt Optimization for Vision-Language Few-shot Learning
Read-only Prompt Optimization for Vision-Language Few-shot Learning
Dongjun Lee
Seokwon Song
Jihee G. Suh
Joonmyeong Choi
S. Lee
Hyunwoo J.Kim
VLM
29
39
0
29 Aug 2023
Referring Image Segmentation Using Text Supervision
Referring Image Segmentation Using Text Supervision
Fang Liu
Yuhao Liu
Yuqiu Kong
Ke Xu
L. Zhang
Baocai Yin
Gerhard Hancke
Rynson W. H. Lau
27
25
0
28 Aug 2023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient
  Parameter and Memory
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Y. Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
23
17
0
28 Aug 2023
Fine-tuning can cripple your foundation model; preserving features may
  be the solution
Fine-tuning can cripple your foundation model; preserving features may be the solution
Jishnu Mukhoti
Y. Gal
Philip H. S. Torr
P. Dokania
CLL
24
29
0
25 Aug 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic
  Alignment
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Kun Zhang
F. Khan
VLM
56
4
0
24 Aug 2023
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text
  Retrieval
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval
Yuan. Yuan
Yangfan Zhan
Zhitong Xiong
VLM
23
38
0
24 Aug 2023
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
Hualiang Wang
Yi Li
Huifeng Yao
X. Li
VLM
OODD
24
92
0
23 Aug 2023
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised
  Learning
GOPro: Generate and Optimize Prompts in CLIP using Self-Supervised Learning
Mainak Singha
Ankit Jha
Biplab Banerjee
VLM
17
4
0
22 Aug 2023
Unsupervised Prototype Adapter for Vision-Language Models
Unsupervised Prototype Adapter for Vision-Language Models
Yi Zhang
Ce Zhang
Xue-mei Hu
Z. He
VLM
17
4
0
22 Aug 2023
ViLLA: Fine-Grained Vision-Language Representation Learning from
  Real-World Data
ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World Data
M. Varma
Jean-Benoit Delbrouck
Sarah Hooper
Akshay S. Chaudhari
C. Langlotz
VLM
CoGe
40
5
0
22 Aug 2023
An Examination of the Compositionality of Large Generative
  Vision-Language Models
An Examination of the Compositionality of Large Generative Vision-Language Models
Teli Ma
Rong Li
Junwei Liang
CoGe
19
2
0
21 Aug 2023
COCA: Classifier-Oriented Calibration via Textual Prototype for
  Source-Free Universal Domain Adaptation
COCA: Classifier-Oriented Calibration via Textual Prototype for Source-Free Universal Domain Adaptation
Xinghong Liu
Yi Zhou
Tao Zhou
Chun-Mei Feng
Ling Shao
VLM
17
2
0
21 Aug 2023
An Empirical Study of CLIP for Text-based Person Search
An Empirical Study of CLIP for Text-based Person Search
Min Cao
Yang Bai
Ziyin Zeng
Mang Ye
Min Zhang
VLM
36
36
0
19 Aug 2023
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud
  Recognition
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Xuanyu Yi
Jiajun Deng
Qianru Sun
Xiansheng Hua
J. Lim
Hanwang Zhang
3DPC
9
14
0
18 Aug 2023
The Unreasonable Effectiveness of Large Language-Vision Models for
  Source-free Video Domain Adaptation
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation
Giacomo Zara
Alessandro Conti
Subhankar Roy
Stéphane Lathuilière
Paolo Rota
Elisa Ricci
25
11
0
17 Aug 2023
Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
Guangyi Chen
Xiao Liu
Guangrun Wang
Kun Zhang
Philip H.S.Torr
Xiaoping Zhang
Yansong Tang
17
17
0
16 Aug 2023
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
A Foundation Language-Image Model of the Retina (FLAIR): Encoding Expert Knowledge in Text Supervision
Julio Silva-Rodríguez
H. Chakor
Riadh Kobbi
Jose Dolz
Ismail Ben Ayed
VLM
MedIm
53
32
0
15 Aug 2023
ICPC: Instance-Conditioned Prompting with Contrastive Learning for
  Semantic Segmentation
ICPC: Instance-Conditioned Prompting with Contrastive Learning for Semantic Segmentation
Chaohui Yu
Qiang-feng Zhou
Zhibin Wang
Fan Wang
VLM
20
1
0
14 Aug 2023
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Orthogonal Temporal Interpolation for Zero-Shot Video Recognition
Yan Zhu
Junbao Zhuo
B. Ma
Jiajia Geng
Xiaoming Wei
Xiaolin K. Wei
Shuhui Wang
VLM
17
5
0
14 Aug 2023
Foundation Model is Efficient Multimodal Multitask Model Selector
Foundation Model is Efficient Multimodal Multitask Model Selector
Fanqing Meng
Wenqi Shao
Zhanglin Peng
Chong Jiang
Kaipeng Zhang
Yu Qiao
Ping Luo
17
13
0
11 Aug 2023
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt
  Tuning
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Chun-Mei Feng
Kai Yu
Yong Liu
Salman Khan
W. Zuo
VLM
17
75
0
11 Aug 2023
Exploring Part-Informed Visual-Language Learning for Person Re-Identification
Exploring Part-Informed Visual-Language Learning for Person Re-Identification
Y. Lin
Cong Liu
Yehansen Chen
Jinshui Hu
Bing Yin
Baocai Yin
Zengfu Wang
60
6
0
04 Aug 2023
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition
  with Limited Annotations
DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations
Ping Hu
Ximeng Sun
Stan Sclaroff
Kate Saenko
VLM
24
21
0
03 Aug 2023
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge
  using Vision-Language Pre-Training Model
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Ka Leong Cheng
Wenpo Song
Zheng Ma
Wenhao Zhu
Zi-Yue Zhu
Jianbing Zhang
CLIP
VLM
22
10
0
02 Aug 2023
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning
  in End-to-End Autonomous Driving
DriveAdapter: Breaking the Coupling Barrier of Perception and Planning in End-to-End Autonomous Driving
Xiaosong Jia
Yulu Gao
Li Chen
Junchi Yan
Patrick Langechuan Liu
Hongyang Li
9
64
0
01 Aug 2023
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for
  Complex Visual Reasoning Tasks
Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks
Kousik Rajesh
Mrigank Raman
M. A. Karim
Pranit Chawla
VLM
23
2
0
31 Jul 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Cross-Modal Concept Learning and Inference for Vision-Language Models
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
15
15
0
28 Jul 2023
Improving Social Media Popularity Prediction with Multiple Post
  Dependencies
Improving Social Media Popularity Prediction with Multiple Post Dependencies
Zhizhen Zhang
Xiao-Zhu Xie
Meng Yang
Ye Tian
Yong-jia Jiang
Yong Cui
19
5
0
28 Jul 2023
PromptStyler: Prompt-driven Style Generation for Source-free Domain
  Generalization
PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization
Junhyeong Cho
Gilhyun Nam
Sungyeon Kim
Hunmin Yang
Suha Kwak
VLM
OOD
TTA
16
47
0
27 Jul 2023
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained
  Vision-Language Models
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Kecheng Zheng
Wei Wu
Ruili Feng
Kai Zhu
Jiawei Liu
Deli Zhao
Zhengjun Zha
Wei Chen
Yujun Shen
VLM
6
8
0
27 Jul 2023
Selective Perception: Optimizing State Descriptions with Reinforcement
  Learning for Language Model Actors
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Kolby Nottingham
Yasaman Razeghi
Kyungmin Kim
JB Lanier
Pierre Baldi
Roy Fox
Sameer Singh
10
8
0
21 Jul 2023
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil
Chris Vorster
D. Molloy
N. Murphy
Kevin McGuinness
Noel E. O'Connor
CLIP
VLM
MLLM
8
51
0
21 Jul 2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for
  Referring Image Segmentation
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
Zunnan Xu
Zhihong Chen
Yong Zhang
Yibing Song
Xiang Wan
Guanbin Li
VLM
9
47
0
21 Jul 2023
UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with
  Vision-Language Models
UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models
Xin Li
Sima Behpour
T. Doan
Wenbin He
Liangke Gou
Liu Ren
VLM
16
3
0
20 Jul 2023
Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Anindya Mondal
Sauradip Nag
J. Prada
Xiatian Zhu
Anjan Dutta
11
7
0
20 Jul 2023
Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged
  Object Detection
Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged Object Detection
Yinghui Xing
Dexuan Kong
Shizhou Zhang
Geng Chen
Lingyan Ran
Peng Wang
Yanning Zhang
31
4
0
20 Jul 2023
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for
  Multi-Label Image Classification
PatchCT: Aligning Patch Set and Label Set with Conditional Transport for Multi-Label Image Classification
Miaoge Li
Dongsheng Wang
Xinyang Liu
Zequn Zeng
Ruiying Lu
Bo Chen
Mingyuan Zhou
VLM
OT
11
15
0
18 Jul 2023
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts
Improving Zero-Shot Generalization for CLIP with Synthesized Prompts
Z. Wang
Jian Liang
R. He
Nana Xu
Zilei Wang
Tien-Ping Tan
VLM
19
47
0
14 Jul 2023
Previous
123...1011121389
Next