ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.04544
  4. Cited By
CLIP-Adapter: Better Vision-Language Models with Feature Adapters

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

9 October 2021
Peng Gao
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Yongfeng Zhang
Hongsheng Li
Yu Qiao
    VLM
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

50 / 635 papers shown
Title
LARE: Latent Augmentation using Regional Embedding with Vision-Language
  Model
LARE: Latent Augmentation using Regional Embedding with Vision-Language Model
Kosuke Sakurai
Tatsuya Ishii
Ryotaro Shimizu
Linxin Song
Masayuki Goto
VLM
19
0
0
19 Sep 2024
CLIP Adaptation by Intra-modal Overlap Reduction
CLIP Adaptation by Intra-modal Overlap Reduction
A. Kravets
V. Namboodiri
VLM
24
0
0
17 Sep 2024
CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using
  a Single Camera
CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera
Jingpei Lu
Zekai Liang
Tristin Xie
Florian Ritcher
Shan Lin
Sainan Liu
Michael C. Yip
29
4
0
16 Sep 2024
MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection
MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection
Yaning Zhang
Tianyi Wang
Zitong Yu
Zan Gao
Linlin Shen
Shengyong Chen
DiffM
65
3
0
15 Sep 2024
QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems
QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems
Zhixian He
Pengcheng Zhao
Fuwei Zhang
Shujin Lin
31
0
0
14 Sep 2024
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Generalization Boosted Adapter for Open-Vocabulary Segmentation
Wenhao Xu
Changwei Wang
Xuxiang Feng
Rongtao Xu
Longzhao Huang
Zherui Zhang
Li Guo
Shibiao Xu
VLM
31
2
0
13 Sep 2024
Rethinking Prompting Strategies for Multi-Label Recognition with Partial
  Annotations
Rethinking Prompting Strategies for Multi-Label Recognition with Partial Annotations
Samyak Rawlekar
Shubhang Bhatnagar
Narendra Ahuja
VLM
18
1
0
12 Sep 2024
SPARK: Self-supervised Personalized Real-time Monocular Face Capture
SPARK: Self-supervised Personalized Real-time Monocular Face Capture
Kelian Baert
Shrisha Bharadwaj
Fabien Castan
Benoit Maujean
Marc Christie
Victoria Fernandez-Abrevaya
A. Boukhayma
CVBM
3DH
42
2
0
12 Sep 2024
Self-Masking Networks for Unsupervised Adaptation
Self-Masking Networks for Unsupervised Adaptation
Alfonso Taboada Warmerdam
Mathilde Caron
Yuki M. Asano
29
1
0
11 Sep 2024
Revisiting Prompt Pretraining of Vision-Language Models
Revisiting Prompt Pretraining of Vision-Language Models
Zhenyuan Chen
Lingfeng Yang
Shuo Chen
Zhaowei Chen
Jiajun Liang
Xiang Li
MLLM
VPVLM
VLM
33
1
0
10 Sep 2024
Few-shot Adaptation of Medical Vision-Language Models
Few-shot Adaptation of Medical Vision-Language Models
Fereshteh Shakeri
Yunshi Huang
Julio Silva-Rodríguez
Houda Bahig
An Tang
Jose Dolz
Ismail Ben Ayed
VLM
31
2
0
05 Sep 2024
Multi-Modal Adapter for Vision-Language Models
Multi-Modal Adapter for Vision-Language Models
Dominykas Seputis
Serghei Mihailov
Soham Chatterjee
Zehao Xiao
VLM
14
1
0
03 Sep 2024
Affordance-based Robot Manipulation with Flow Matching
Affordance-based Robot Manipulation with Flow Matching
Fan Zhang
Michael Gienger
31
5
0
02 Sep 2024
COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Munish Monga
Sachin Kumar Giroh
Ankit Jha
Mainak Singha
Biplab Banerjee
Jocelyn Chanussot
35
2
0
31 Aug 2024
Adapting Vision-Language Models to Open Classes via Test-Time Prompt
  Tuning
Adapting Vision-Language Models to Open Classes via Test-Time Prompt Tuning
Zhengqing Gao
Xiang Ao
Xu-Yao Zhang
Cheng-Lin Liu
VLM
VPVLM
26
0
0
29 Aug 2024
Hierarchical Visual Categories Modeling: A Joint Representation Learning
  and Density Estimation Framework for Out-of-Distribution Detection
Hierarchical Visual Categories Modeling: A Joint Representation Learning and Density Estimation Framework for Out-of-Distribution Detection
Jinglun Li
Xinyu Zhou
Pinxue Guo
Yixuan Sun
Yiwen Huang
Weifeng Ge
Wenqiang Zhang
20
2
0
28 Aug 2024
HPT++: Hierarchically Prompting Vision-Language Models with
  Multi-Granularity Knowledge Generation and Improved Structure Modeling
HPT++: Hierarchically Prompting Vision-Language Models with Multi-Granularity Knowledge Generation and Improved Structure Modeling
Yubin Wang
Xinyang Jiang
De Cheng
Wenli Sun
Dongsheng Li
Cairong Zhao
VLM
27
0
0
27 Aug 2024
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models
Shuai Fu
Xiequn Wang
Qiushi Huang
Yu Zhang
VLM
29
2
0
26 Aug 2024
Online Zero-Shot Classification with CLIP
Online Zero-Shot Classification with CLIP
Qi Qian
Juhua Hu
VLM
24
4
0
23 Aug 2024
Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action
  Recognition
Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition
Bozheng Li
Mushui Liu
Gaoang Wang
Yunlong Yu
13
5
0
22 Aug 2024
CLIPCleaner: Cleaning Noisy Labels with CLIP
CLIPCleaner: Cleaning Noisy Labels with CLIP
Chen Feng
Georgios Tzimiropoulos
Ioannis Patras
VLM
24
1
0
19 Aug 2024
MePT: Multi-Representation Guided Prompt Tuning for Vision-Language
  Model
MePT: Multi-Representation Guided Prompt Tuning for Vision-Language Model
Xinyang Wang
Yi Yang
Minfeng Zhu
Kecheng Zheng
Shi Liu
Wei Chen
VPVLM
MLLM
VLM
34
1
0
19 Aug 2024
NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
NoRA: Nested Low-Rank Adaptation for Efficient Fine-Tuning Large Models
Cheng Lin
Lujun Li
Dezhi Li
Jie Zou
Wei Xue
Yike Guo
AI4TS
28
4
0
18 Aug 2024
Segment Anything with Multiple Modalities
Segment Anything with Multiple Modalities
Aoran Xiao
Weihao Xuan
Heli Qi
Yun Xing
Naoto Yokoya
Shijian Lu
VLM
16
7
0
17 Aug 2024
DPA: Dual Prototypes Alignment for Unsupervised Adaptation of
  Vision-Language Models
DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models
Eman Ali
Sathira Silva
Muhammad Haris Khan
VLM
16
0
0
16 Aug 2024
Snuffy: Efficient Whole Slide Image Classifier
Snuffy: Efficient Whole Slide Image Classifier
Hossein Jafarinia
Alireza Alipanah
Danial Hamdi
Saeed Razavi
Nahal Mirzaie
M. Rohban
3DH
36
1
0
15 Aug 2024
Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot
  HOI Detection
Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection
Yixin Guo
Yu Liu
Jianghao Li
Weimin Wang
Qi Jia
VLM
27
2
0
12 Aug 2024
Freehand Sketch Generation from Mechanical Components
Freehand Sketch Generation from Mechanical Components
Zhichao Liao
Di Huang
Heming Fang
Yue Ma
Fengyuan Piao
Xinghui Li
Long Zeng
Pingfa Feng
30
2
0
12 Aug 2024
Benchmarking In-the-wild Multimodal Disease Recognition and A Versatile
  Baseline
Benchmarking In-the-wild Multimodal Disease Recognition and A Versatile Baseline
Tianqi Wei
Zhi Chen
Zi Huang
Xin Yu
12
6
0
06 Aug 2024
FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware
  Diffusion Fine-Tuning
FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning
Zhi Chen
Zecheng Zhao
Yadan Luo
Zi Huang
DiffM
35
4
0
06 Aug 2024
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection
Ting Lei
Shaofeng Yin
Yuxin Peng
Yang Liu
VLM
27
5
0
05 Aug 2024
A Survey of Mamba
A Survey of Mamba
Shuwei Shi
Shibing Chu
Rui An
Wenqi Fan
Yuee Xie
Hui Liu
Yuanping Chen
Qing Li
AI4CE
35
26
0
02 Aug 2024
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for
  Continual Learning
Exploiting the Semantic Knowledge of Pre-trained Text-Encoders for Continual Learning
Lu Yu
Hesong Li
Ying Fu
J. Weijer
Changsheng Xu
CLL
44
1
0
02 Aug 2024
Multi-Modal Parameter-Efficient Fine-tuning via Graph Neural Network
Multi-Modal Parameter-Efficient Fine-tuning via Graph Neural Network
Bin Cheng
Jiaxuan Lu
16
0
0
01 Aug 2024
Task-Adapter: Task-specific Adaptation of Image Models for Few-shot
  Action Recognition
Task-Adapter: Task-specific Adaptation of Image Models for Few-shot Action Recognition
Congqi Cao
Guibiao Liao
Yating Yu
Kanglin Liu
Lingtong Min
Yanning Zhang
30
3
0
01 Aug 2024
CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound
  Image Segmentation
CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation
Shreyank N. Gowda
David A. Clifton
MedIm
26
1
0
31 Jul 2024
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment
Anurag Das
Xinting Hu
Li Jiang
Bernt Schiele
VLM
29
3
0
31 Jul 2024
Image Re-Identification: Where Self-supervision Meets Vision-Language
  Learning
Image Re-Identification: Where Self-supervision Meets Vision-Language Learning
Bin Wang
Yuying Liang
Lei Cai
Huakun Huang
Huanqiang Zeng
VLM
LRM
19
0
0
30 Jul 2024
Advancing Prompt Learning through an External Layer
Advancing Prompt Learning through an External Layer
Fangming Cui
Xun Yang
Chao Wu
Liang Xiao
Xinmei Tian
VLM
29
1
0
29 Jul 2024
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Tz-Ying Wu
Kyle Min
Subarna Tripathi
Nuno Vasconcelos
EgoV
51
0
0
28 Jul 2024
Multi-modal Data Binding for Survival Analysis Modeling with Incomplete
  Data and Annotations
Multi-modal Data Binding for Survival Analysis Modeling with Incomplete Data and Annotations
Linhao Qu
Dan Huang
Shaoting Zhang
Xiaosong Wang
14
2
0
25 Jul 2024
Selective Vision-Language Subspace Projection for Few-shot CLIP
Selective Vision-Language Subspace Projection for Few-shot CLIP
Xingyu Zhu
Beier Zhu
Yi Tan
Shuo Wang
Yanbin Hao
H. Zhang
VLM
33
3
0
24 Jul 2024
Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language
  Encoders
Zero-Shot Embeddings Inform Learning and Forgetting with Vision-Language Encoders
Laura Niss
Kevin Vogt-Lowell
Theodoros Tsiligkaridis
VLM
22
0
0
22 Jul 2024
Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning
Jingchen Sun
Rohan Sharma
Vishnu Suresh Lokhande
Changyou Chen
28
0
0
22 Jul 2024
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Rethinking Domain Adaptation and Generalization in the Era of CLIP
Ruoyu Feng
Tao Yu
Xin Jin
Xiaoyuan Yu
Lei Xiao
Zhibo Chen
VLM
26
1
0
21 Jul 2024
Large-vocabulary forensic pathological analyses via prototypical
  cross-modal contrastive learning
Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning
Chen Shen
Chunfeng Lian
Wanqing Zhang
Fan Wang
Jianhua Zhang
...
Hongshu Mu
Hao Wu
Xinggong Liang
Jianhua Ma
Zhenyuan Wang
26
0
0
20 Jul 2024
Class-Incremental Learning with CLIP: Adaptive Representation Adjustment
  and Parameter Fusion
Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion
Linlan Huang
Xusheng Cao
Haori Lu
Xialei Liu
CLL
40
11
0
19 Jul 2024
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Improving Representation of High-frequency Components for Medical Visual Foundation Models
Yuetan Chu
Yilan Zhang
Zhongyi Han
Changchun Yang
Longxi Zhou
Gongning Luo
Chao Huang
Xin Gao
MedIm
24
1
0
19 Jul 2024
Robust Calibration of Large Vision-Language Adapters
Robust Calibration of Large Vision-Language Adapters
Balamurali Murugesan
Julio Silva-Rodríguez
Ismail Ben Ayed
Jose Dolz
OODD
VLM
24
6
0
18 Jul 2024
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via
  Modal Fusion Map
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map
Yilin Ye
Shishi Xiao
Xingchen Zeng
Wei Zeng
33
2
0
17 Jul 2024
Previous
12345...111213
Next