ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.02460
  4. Cited By
Improved Zero-Shot Classification by Adapting VLMs with Text
  Descriptions
v1v2 (latest)

Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions

Computer Vision and Pattern Recognition (CVPR), 2024
4 January 2024
Oindrila Saha
Grant Van Horn
Subhransu Maji
    VLM
ArXiv (abs)PDFHTMLGithub (39★)

Papers citing "Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions"

33 / 33 papers shown
Not All Birds Look The Same: Identity-Preserving Generation For Birds
Not All Birds Look The Same: Identity-Preserving Generation For Birds
Aaron Sun
Oindrila Saha
Subhransu Maji
163
0
0
04 Dec 2025
Culture in Action: Evaluating Text-to-Image Models through Social Activities
Culture in Action: Evaluating Text-to-Image Models through Social Activities
Sina Malakouti
Boqing Gong
Adriana Kovashka
EGVMVLM
431
1
0
07 Nov 2025
[De|Re]constructing VLMs' Reasoning in Counting
[De|Re]constructing VLMs' Reasoning in Counting
Simone Alghisi
Gabriel Roccabruna
Massimo Rizzoli
Seyed Mahed Mousavi
Giuseppe Riccardi
ReLMLRMVLM
312
4
0
22 Oct 2025
CaMiT: A Time-Aware Car Model Dataset for Classification and Generation
CaMiT: A Time-Aware Car Model Dataset for Classification and Generation
Frédéric LIN
Biruk Abere Ambaw
Adrian Daniel Popescu
Hejer Ammar
Romaric Audigier
Hervé Le Borgne
VLMAI4TS
338
0
0
20 Oct 2025
Free-Grained Hierarchical Visual Recognition
Free-Grained Hierarchical Visual Recognition
Seulki Park
Zilin Wang
Stella X. Yu
NoLa
191
1
0
16 Oct 2025
Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models
Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models
Md. Atabuzzaman
Andrew Zhang
Chris Thomas
MLLMVLM
189
1
0
04 Oct 2025
No Labels Needed: Zero-Shot Image Classification with Collaborative Self-Learning
No Labels Needed: Zero-Shot Image Classification with Collaborative Self-Learning
Matheus Vinícius Todescato
Joel Luís Carbonera
VLM
160
0
0
23 Sep 2025
Decomposing Visual Classification: Assessing Tree-Based Reasoning in VLMs
Decomposing Visual Classification: Assessing Tree-Based Reasoning in VLMs
Sary Elmansoury
Islam Mesabah
Gerrit Großmann
Peter Neigel
Raj Bhalwankar
Daniel Kondermann
Sebastian Vollmer
VLMReLM
184
0
0
10 Sep 2025
An Explainable Deep Neural Network with Frequency-Aware Channel and Spatial Refinement for Flood Prediction in Sustainable Cities
An Explainable Deep Neural Network with Frequency-Aware Channel and Spatial Refinement for Flood Prediction in Sustainable CitiesSustainable cities and society (SCS), 2025
Shahid Shafi Dar
Bharat Kaurav
Arnav Jain
Chandravardhan Singh Raghaw
Mohammad Zia Ur Rehman
Nagendra Kumar
AI4CE
329
5
0
07 Sep 2025
Object Detection with Multimodal Large Vision-Language Models: An In-depth Review
Object Detection with Multimodal Large Vision-Language Models: An In-depth ReviewInformation Fusion (Inf. Fusion), 2025
Ranjan Sapkota
Manoj Karkee
ObjDVLM
355
26
0
25 Aug 2025
Unsupervised Urban Tree Biodiversity Mapping from Street-Level Imagery Using Spatially-Aware Visual Clustering
Unsupervised Urban Tree Biodiversity Mapping from Street-Level Imagery Using Spatially-Aware Visual Clustering
Diaa Addeen Abuhani
Marco Seccaroni
Martina Mazzarello
Imran Zualkernan
Fábio Duarte
C. Ratti
159
0
0
19 Aug 2025
Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment
Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment
Lubin Gan
Jing Zhang
Linhao Qu
Y. X. R. Wang
Siying Wu
Xiaoyan Sun
MedIm
285
5
0
03 Aug 2025
Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance
Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance
Fengze Yang
Bo Yu
Yang Zhou
Xuewen Luo
Zhengzhong Tu
Chenxi Liu
351
0
0
01 Aug 2025
FedVLM: Scalable Personalized Vision-Language Models through Federated Learning
FedVLM: Scalable Personalized Vision-Language Models through Federated Learning
Arkajyoti Mitra
Afia Anjum
Paul Agbaje
Mert D. Pesé
Habeeb Olufowobi
VLM
264
2
0
23 Jul 2025
An Empirical Study of Bugs in Data Visualization Libraries
An Empirical Study of Bugs in Data Visualization Libraries
Weiqi Lu
Yongqiang Tian
Xiaohan Zhong
Haoyang Ma
Zhenyang Xu
Shing-Chi Cheung
Chengnian Sun
150
2
0
18 Jun 2025
An Evaluation of a Visual Question Answering Strategy for Zero-shot Facial Expression Recognition in Still Images
An Evaluation of a Visual Question Answering Strategy for Zero-shot Facial Expression Recognition in Still Images
Modesto Castrillón-Santana
Oliverio J. Santana
David Freire-Obregón
Daniel Hernández-Sosa
J. Lorenzo-Navarro
376
0
0
30 Apr 2025
EcoWikiRS: Learning Ecological Representation of Satellite Images from Weak Supervision with Species Observations and Wikipedia
EcoWikiRS: Learning Ecological Representation of Satellite Images from Weak Supervision with Species Observations and Wikipedia
Valerie Zermatten
J. Castillo-Navarro
Pallavi Jain
D. Tuia
Diego Marcos
367
8
0
28 Apr 2025
PVLM: Parsing-Aware Vision Language Model with Dynamic Contrastive Learning for Zero-Shot Deepfake Attribution
PVLM: Parsing-Aware Vision Language Model with Dynamic Contrastive Learning for Zero-Shot Deepfake Attribution
Yaning Zhang
Jiahe Zhang
Chunjie Ma
Weili Guan
Tian Gan
Zan Gao
300
0
0
19 Apr 2025
Self-Evolving Visual Concept Library using Vision-Language Critics
Self-Evolving Visual Concept Library using Vision-Language CriticsComputer Vision and Pattern Recognition (CVPR), 2025
Atharva Sehgal
Patrick Yuan
Ziniu Hu
Yisong Yue
Jennifer J. Sun
Swarat Chaudhuri
VLM
289
2
0
31 Mar 2025
Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models
Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models
Xiaojun Jia
Sensen Gao
Simeng Qin
Ke Ma
Xianrui Li
Yihao Huang
Wei Dong
Yang Liu
Xiaochun Cao
AAMLVLM
582
3
0
17 Mar 2025
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models
Seyed Mohamad Ali Tousi
Ramy M. A. Farag
Jacket Demby's
Gbenga Omotara
John A. Lory
Guilherme N. DeSouza
1.0K
3
0
03 Mar 2025
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
ProAPO: Progressively Automatic Prompt Optimization for Visual ClassificationComputer Vision and Pattern Recognition (CVPR), 2025
Xiangyan Qu
Gaopeng Gou
Jiamin Zhuang
Jing Yu
Kun Song
Qihao Wang
Yili Li
Gang Xiong
VLM
815
17
0
27 Feb 2025
DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition
DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition
Chiyuan He
Zihuan Qiu
Fanman Meng
Linfeng Xu
Qi Wu
Haoyang Li
CLLVLM
632
0
0
02 Feb 2025
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation
Huadong Tang
Youpeng Zhao
Y. Huang
Min Xu
Jun Wang
Qiang Wu
MLLMVLM
485
1
0
30 Nov 2024
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections
Mohamed Fazli Mohamed Imam
Rufael Fedaku Marew
Jameel Hassan
Mustansar Fiaz
Alham Fikri Aji
Hisham Cholakkal
VLM
1.3K
8
0
28 Nov 2024
A Survey of Low-shot Vision-Language Model Adaptation via Representer
  Theorem
A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem
Kun Ding
Ying Wang
Gaofeng Meng
Shiming Xiang
VLM
334
0
0
15 Oct 2024
Designing Interfaces for Multimodal Vector Search Applications
Designing Interfaces for Multimodal Vector Search Applications
Owen Pendrigh Elliott
Tom Hamer
Jesse Clark
237
0
0
18 Sep 2024
Enabling Small Models for Zero-Shot Selection and Reuse through Model Label Learning
Enabling Small Models for Zero-Shot Selection and Reuse through Model Label Learning
Jia Zhang
Zhi Zhou
Lan-Zhe Guo
Yu-Feng Li
VLM
476
0
0
21 Aug 2024
Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary
  Concepts
Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary ConceptsEuropean Conference on Computer Vision (ECCV), 2024
Andong Tan
Fengtao Zhou
Hao Chen
VLM
257
21
0
05 Aug 2024
YouDream: Generating Anatomically Controllable Consistent Text-to-3D
  Animals
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Sandeep Mishra
Oindrila Saha
A. Bovik
246
0
0
24 Jun 2024
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning
Tian Liu
Huixin Zhang
Shubham Parashar
Shu Kong
423
2
0
17 Jun 2024
Test-Time Multimodal Backdoor Detection by Contrastive Prompting
Test-Time Multimodal Backdoor Detection by Contrastive Prompting
Yuwei Niu
Shuo He
Qinglai Wei
Z. Wu
Feng Liu
Bingquan Shen
AAML
498
4
0
24 May 2024
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
CLIP-Adapter: Better Vision-Language Models with Feature AdaptersInternational Journal of Computer Vision (IJCV), 2021
Shiyang Feng
Shijie Geng
Renrui Zhang
Teli Ma
Rongyao Fang
Zelong Li
Jiaming Song
Yu Qiao
VLMCLIP
1.4K
1,614
0
09 Oct 2021
1
Page 1 of 1