v1v2 (latest)

Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions

Computer Vision and Pattern Recognition (CVPR), 2024

4 January 2024

ArXiv (abs)PDF HTML Github (39★)

Papers citing "Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions"

33 / 33 papers shown

Not All Birds Look The Same: Identity-Preserving Generation For Birds

Aaron Sun

Oindrila Saha

Subhransu Maji

163

04 Dec 2025

Culture in Action: Evaluating Text-to-Image Models through Social Activities

431

07 Nov 2025

[De|Re]constructing VLMs' Reasoning in Counting

312

22 Oct 2025

CaMiT: A Time-Aware Car Model Dataset for Classification and Generation

Frédéric LIN

Biruk Abere Ambaw

Adrian Daniel Popescu

338

20 Oct 2025

Free-Grained Hierarchical Visual Recognition

191

16 Oct 2025

Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models

189

04 Oct 2025

No Labels Needed: Zero-Shot Image Classification with Collaborative Self-Learning

Matheus Vinícius Todescato

Joel Luís Carbonera

VLM

160

23 Sep 2025

Decomposing Visual Classification: Assessing Tree-Based Reasoning in VLMs

184

10 Sep 2025

An Explainable Deep Neural Network with Frequency-Aware Channel and Spatial Refinement for Flood Prediction in Sustainable CitiesSustainable cities and society (SCS), 2025

Shahid Shafi Dar

Bharat Kaurav

Arnav Jain

Chandravardhan Singh Raghaw

Mohammad Zia Ur Rehman

Nagendra Kumar

AI4CE

329

07 Sep 2025

Object Detection with Multimodal Large Vision-Language Models: An In-depth ReviewInformation Fusion (Inf. Fusion), 2025

Ranjan Sapkota

Manoj Karkee

ObjD VLM

355

25 Aug 2025

Unsupervised Urban Tree Biodiversity Mapping from Street-Level Imagery Using Spatially-Aware Visual Clustering

159

19 Aug 2025

Enhancing Zero-Shot Brain Tumor Subtype Classification via Fine-Grained Patch-Text Alignment

285

03 Aug 2025

Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance

351

01 Aug 2025

FedVLM: Scalable Personalized Vision-Language Models through Federated Learning

264

23 Jul 2025

An Empirical Study of Bugs in Data Visualization Libraries

150

18 Jun 2025

An Evaluation of a Visual Question Answering Strategy for Zero-shot Facial Expression Recognition in Still Images

Modesto Castrillón-Santana

Oliverio J. Santana

David Freire-Obregón

Daniel Hernández-Sosa

J. Lorenzo-Navarro

376

30 Apr 2025

EcoWikiRS: Learning Ecological Representation of Satellite Images from Weak Supervision with Species Observations and Wikipedia

367

28 Apr 2025

PVLM: Parsing-Aware Vision Language Model with Dynamic Contrastive Learning for Zero-Shot Deepfake Attribution

300

19 Apr 2025

Self-Evolving Visual Concept Library using Vision-Language CriticsComputer Vision and Pattern Recognition (CVPR), 2025

289

31 Mar 2025

Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models

582

17 Mar 2025

A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models

Seyed Mohamad Ali Tousi

1.0K

03 Mar 2025

ProAPO: Progressively Automatic Prompt Optimization for Visual ClassificationComputer Vision and Pattern Recognition (CVPR), 2025

815

27 Feb 2025

DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition

632

02 Feb 2025

LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation

485

30 Nov 2024

CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections

Mohamed Fazli Mohamed Imam

1.3K

28 Nov 2024

A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem

334

15 Oct 2024

Designing Interfaces for Multimodal Vector Search Applications

Owen Pendrigh Elliott

Tom Hamer

Jesse Clark

237

18 Sep 2024

Enabling Small Models for Zero-Shot Selection and Reuse through Model Label Learning

476

21 Aug 2024

Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary ConceptsEuropean Conference on Computer Vision (ECCV), 2024

Andong Tan

Fengtao Zhou

Hao Chen

VLM

257

05 Aug 2024

YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals

Sandeep Mishra

Oindrila Saha

A. Bovik

246

24 Jun 2024

Few-Shot Recognition via Stage-Wise Retrieval-Augmented Finetuning

423

17 Jun 2024

Test-Time Multimodal Backdoor Detection by Contrastive Prompting

498

24 May 2024

CLIP-Adapter: Better Vision-Language Models with Feature AdaptersInternational Journal of Computer Vision (IJCV), 2021

Yu Qiao

1.4K

1,614

09 Oct 2021