ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04544
  4. Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
    VLM
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

50 / 637 papers shown
Title
Robust Calibration of Large Vision-Language Adapters
Robust Calibration of Large Vision-Language Adapters
Balamurali Murugesan
Julio Silva-Rodríguez
Ismail Ben Ayed
Jose Dolz
OODD
VLM
24
6
0
18 Jul 2024
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via
  Modal Fusion Map
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map
Yilin Ye
Shishi Xiao
Xingchen Zeng
Wei Zeng
36
2
0
17 Jul 2024
Cross-Modal Augmentation for Few-Shot Multimodal Fake News Detection
Cross-Modal Augmentation for Few-Shot Multimodal Fake News Detection
Ye Jiang
Taihang Wang
Xiaoman Xu
Yimin Wang
Xingyi Song
Diana Maynard
22
2
0
16 Jul 2024
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language
  Pre-trained Models
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou
Yongjian Wu
Jiya Saiyin
Bingzheng Wei
Maode Lai
Eric Chang
Yan Xu
VLM
30
0
0
16 Jul 2024
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed
  Image Restoration
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
Yulin Ren
Xin Li
Bingchen Li
Xingrui Wang
Mengxi Guo
Shijie Zhao
Li Zhang
Zhibo Chen
DiffM
36
7
0
15 Jul 2024
Quantized Prompt for Efficient Generalization of Vision-Language Models
Quantized Prompt for Efficient Generalization of Vision-Language Models
Tianxiang Hao
Xiaohan Ding
Juexiao Feng
Yuhong Yang
Hui Chen
Guiguang Ding
VLM
MQ
16
5
0
15 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better
  Vision-Language Reasoning
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
39
2
0
11 Jul 2024
Enhancing Robustness of Vision-Language Models through Orthogonality
  Learning and Cross-Regularization
Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Jinlong Li
Zequn Jie
Elisa Ricci
Lin Ma
N. Sebe
VLM
34
1
0
11 Jul 2024
AddressCLIP: Empowering Vision-Language Models for City-wide Image
  Address Localization
AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization
Shixiong Xu
Chenghao Zhang
Lubin Fan
Gaofeng Meng
Shiming Xiang
Jieping Ye
VLM
35
4
0
11 Jul 2024
LEMoN: Label Error Detection using Multimodal Neighbors
LEMoN: Label Error Detection using Multimodal Neighbors
Haoran Zhang
Aparna Balagopalan
Nassim Oufattole
Hyewon Jeong
Yan Wu
Jiacheng Zhu
Marzyeh Ghassemi
42
0
0
10 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for
  Resource-Limited Transfer Learning
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
37
4
0
10 Jul 2024
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot
  Classification
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification
Jiaying Shi
Xuetong Xue
Shenghui Xu
VLM
24
0
0
08 Jul 2024
Mind the Interference: Retaining Pre-trained Knowledge in Parameter
  Efficient Continual Learning of Vision-Language Models
Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
Longxiang Tang
Zhuotao Tian
Kai Li
Chunming He
Hantao Zhou
Hengshuang Zhao
Xiu Li
Jiaya Jia
CLL
VLM
34
18
0
07 Jul 2024
CLIPVQA:Video Quality Assessment via CLIP
CLIPVQA:Video Quality Assessment via CLIP
Fengchuang Xing
Mingjie Li
Yuan-Gen Wang
Guopu Zhu
Xiaochun Cao
CLIP
ViT
36
4
0
06 Jul 2024
AWT: Transferring Vision-Language Models via Augmentation, Weighting,
  and Transportation
AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Yuhan Zhu
Yuyang Ji
Zhiyu Zhao
Gangshan Wu
Limin Wang
VLM
39
7
0
05 Jul 2024
Dude: Dual Distribution-Aware Context Prompt Learning For Large
  Vision-Language Model
Dude: Dual Distribution-Aware Context Prompt Learning For Large Vision-Language Model
D. M. Nguyen
An T. Le
Trung Q. Nguyen
N. T. Diep
Tai Nguyen
D. Duong-Tran
Jan Peters
Li Shen
Mathias Niepert
Daniel Sonntag
VLM
26
2
0
05 Jul 2024
Do Generalised Classifiers really work on Human Drawn Sketches?
Do Generalised Classifiers really work on Human Drawn Sketches?
Hmrishav Bandyopadhyay
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
Tao Xiang
A. Bhunia
Yi-Zhe Song
VLM
31
2
0
04 Jul 2024
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
Changdae Oh
Gyeongdeok Seo
Geunyoung Jung
Zhi-Qi Cheng
Hosik Choi
Jiyoung Jung
Kyungwoo Song
VLM
26
1
0
04 Jul 2024
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
Lukas Mauch
Marzieh Edraki
Aaron Courville
OODD
CLL
VLM
52
3
0
03 Jul 2024
Knowledge Composition using Task Vectors with Learned Anisotropic
  Scaling
Knowledge Composition using Task Vectors with Learned Anisotropic Scaling
Frederic Z. Zhang
Paul Albert
Cristian Rodriguez-Opazo
Anton van den Hengel
Ehsan Abbasnejad
MoMe
37
7
0
03 Jul 2024
Conceptual Codebook Learning for Vision-Language Models
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang
Ke Yu
Siqi Wu
Zhihai He
VLM
32
2
0
02 Jul 2024
GalLoP: Learning Global and Local Prompts for Vision-Language Models
GalLoP: Learning Global and Local Prompts for Vision-Language Models
Marc Lafon
Elias Ramzi
Clément Rambour
Nicolas Audebert
Nicolas Thome
VLM
29
7
0
01 Jul 2024
CPT: Consistent Proxy Tuning for Black-box Optimization
CPT: Consistent Proxy Tuning for Black-box Optimization
Yuanyang He
Zitong Huang
Xinxing Xu
Rick Siow Mong Goh
Salman Khan
W. Zuo
Yong Liu
Chun-Mei Feng
30
0
0
01 Jul 2024
Embedded Visual Prompt Tuning
Embedded Visual Prompt Tuning
Wenqiang Zu
Shenghao Xie
Qing Zhao
Guoqi Li
Lei Ma
VLM
MedIm
44
9
0
01 Jul 2024
CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with
  Multi-View Images Generation
CLIP3D-AD: Extending CLIP for 3D Few-Shot Anomaly Detection with Multi-View Images Generation
Zuo Zuo
Jiahao Dong
Yao Wu
Yanyun Qu
Zongze Wu
24
3
0
27 Jun 2024
Advancing Cross-domain Discriminability in Continual Learning of
  Vison-Language Models
Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models
Yicheng Xu
Yuxin Chen
Jiahao Nie
Yusong Wang
Huiping Zhuang
Manabu Okumura
VLM
CLL
41
5
0
27 Jun 2024
Efficient and Long-Tailed Generalization for Pre-trained Vision-Language
  Model
Efficient and Long-Tailed Generalization for Pre-trained Vision-Language Model
Jiang-Xin Shi
Chi Zhang
Tong Wei
Yu-Feng Li
VLM
14
2
0
18 Jun 2024
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning
Shuo Xu
Sai Wang
Xinyue Hu
Yutian Lin
Bo Du
Yu Wu
CoGe
46
0
0
18 Jun 2024
Mining Open Semantics from CLIP: A Relation Transition Perspective for
  Few-Shot Learning
Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Cilin Yan
Haochen Wang
Xiaolong Jiang
Yao Hu
Xu Tang
Guoliang Kang
E. Gavves
VLM
24
0
0
17 Jun 2024
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Tian Liu
Huixin Zhang
Shubham Parashar
Shu Kong
21
2
0
17 Jun 2024
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP
Shuyang Lin
Tong Jia
Hao Wang
Bowen Ma
Mingyuan Li
Dongyue Chen
VLM
ObjD
23
0
0
16 Jun 2024
Industrial Language-Image Dataset (ILID): Adapting Vision Foundation
  Models for Industrial Settings
Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings
Keno Moenck
Duc Trung Thieu
Julian Koch
Thorsten Schuppstuhl
VLM
27
0
0
14 Jun 2024
Flash-VStream: Memory-Based Real-Time Understanding for Long Video
  Streams
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams
Haoji Zhang
Yiqin Wang
Yansong Tang
Yong-Jin Liu
Jiashi Feng
Jifeng Dai
Xiaojie Jin
32
37
0
12 Jun 2024
Regularized Training with Generated Datasets for Name-Only Transfer of
  Vision-Language Models
Regularized Training with Generated Datasets for Name-Only Transfer of Vision-Language Models
Minho Park
S. Park
Jooyeol Yun
Jaegul Choo
VLM
22
0
0
08 Jun 2024
Boosting Vision-Language Models with Transduction
Boosting Vision-Language Models with Transduction
Maxime Zanella
Benoît Gérin
Ismail Ben Ayed
VLM
40
5
0
03 Jun 2024
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view
  Understanding
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding
Thanh-Dat Truong
Utsav Prabhu
Dongyi Wang
Bhiksha Raj
Susan Gauch
J. Subbiah
Khoa Luu
43
2
0
03 Jun 2024
DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in
  the Wild
DP-IQA: Utilizing Diffusion Prior for Blind Image Quality Assessment in the Wild
Honghao Fu
Yufei Wang
Wenhan Yang
Bihan Wen
27
2
0
30 May 2024
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for
  Retrieval-Augmented Large Language Models
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models
Yutao Zhu
Zhaoheng Huang
Zhicheng Dou
Ji-Rong Wen
RALM
45
5
0
30 May 2024
ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval
  from Linguistically Complex Descriptions
ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions
Honglin Lin
Siyu Li
Gu Nan
Chaoyue Tang
Xueting Wang
...
Yankai Rong
Zhili Zhou
Yutong Gao
Qimei Cui
Xiaofeng Tao
25
0
0
29 May 2024
Low-Rank Few-Shot Adaptation of Vision-Language Models
Low-Rank Few-Shot Adaptation of Vision-Language Models
Maxime Zanella
Ismail Ben Ayed
OffRL
VLM
46
22
0
28 May 2024
WIDIn: Wording Image for Domain-Invariant Representation in
  Single-Source Domain Generalization
WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization
Jiawei Ma
Yulei Niu
Shiyuan Huang
G. Han
Shih-Fu Chang
VLM
27
1
0
28 May 2024
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation
Ya Lu
Jishnu Jaykumar
Yunhui Guo
Nicholas Ruozzi
Yu Xiang
VLM
ISeg
48
4
0
28 May 2024
Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling
Synergy and Diversity in CLIP: Enhancing Performance Through Adaptive Backbone Ensembling
Cristian Rodriguez-Opazo
Ehsan Abbasnejad
Damien Teney
Edison Marrese-Taylor
Hamed Damirchi
A. Hengel
VLM
20
1
0
27 May 2024
CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot
  Classification
CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification
Qijie Wang
Guandu Liu
Bin Wang
VLM
19
2
0
26 May 2024
CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD
  Generalization and Open-Set OOD Detection
CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection
Lin Zhu
Yifeng Yang
Qinying Gu
Xinbing Wang
Cheng Zhou
Nanyang Ye
VLM
22
2
0
26 May 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar
  Generation
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Yuchi Wang
Junliang Guo
Jianhong Bai
Runyi Yu
Tianyu He
Xu Tan
Xu Sun
Jiang Bian
DiffM
33
9
0
24 May 2024
Disease-informed Adaptation of Vision-Language Models
Disease-informed Adaptation of Vision-Language Models
Jiajin Zhang
Ge Wang
M. Kalra
P. Yan
VLM
34
2
0
24 May 2024
CLIP model is an Efficient Online Lifelong Learner
CLIP model is an Efficient Online Lifelong Learner
Leyuan Wang
Liuyu Xiang
Yujie Wei
Yunlong Wang
Zhaofeng He
VLM
CLL
27
2
0
24 May 2024
Learning Invariant Causal Mechanism from Vision-Language Models
Learning Invariant Causal Mechanism from Vision-Language Models
Zeen Song
Siyu Zhao
Xingyu Zhang
Jiangmeng Li
Changwen Zheng
Wenwen Qiang
CML
BDL
VLM
30
0
0
24 May 2024
What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models
What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models
Abdelrahman Abdelhamed
Mahmoud Afifi
Alec Go
MLLM
VLM
21
3
0
24 May 2024
Previous
123456...111213
Next