Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions

17 March 2021

Papers citing "Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions"

20 / 20 papers shown

MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification

462

10 Mar 2025

Coreset Selection via LLM-based Concept Bottlenecks

Akshay Mehra

Trisha Mittal

Subhadra Gopalakrishnan

Joshua Kimball

388

23 Feb 2025

M-Tuning: Prompt Tuning with Mitigated Label Bias in Open-Set Scenarios

729

31 Dec 2024

Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIPNeural Information Processing Systems (NeurIPS), 2024

334

31 Oct 2024

Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning

Qi Wu

388

22 Jul 2024

LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification

Renyi Qu

Mark Yatskar

329

29 May 2024

Why are Visually-Grounded Language Models Bad at Image Classification?

441

101

28 May 2024

Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images

Ziwei Liu

...

292

02 Nov 2023

Open-Set Image Tagging with Multi-Grained Text Supervision

Xinyu Huang

Yi-Jie Huang

Youcai Zhang

Weiwei Tian

Rui Feng

Lei Zhang

297

23 Oct 2023

Waffling around for Performance: Visual Classification with Random Words and Broad ConceptsIEEE International Conference on Computer Vision (ICCV), 2023

A. Sophia Koepke

290

122

12 Jun 2023

Describe me an Aucklet: Generating Grounded Perceptual Category DescriptionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Bill Noble

N. Ilinykh

333

07 Mar 2023

CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsInternational Conference on Machine Learning (ICML), 2023

463

122

06 Feb 2023

I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image ClassificationComputer Vision and Pattern Recognition (CVPR), 2022

Muhammad Ferjad Naeem

Muhammad Gul Zain Ali Khan

Yongqin Xian

Muhammad Zeshan Afzal

D. Stricker

Luc Van Gool

F. Tombari

VLM

228

05 Dec 2022

Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image ClassificationComputer Vision and Pattern Recognition (CVPR), 2022

487

348

21 Nov 2022

Text2Model: Text-based Model Induction for Zero-shot Image ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

211

27 Oct 2022

Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic FeaturesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

361

155

13 Oct 2022

I2DFormer: Learning Image to Document Attention for Zero-Shot Image ClassificationNeural Information Processing Systems (NeurIPS), 2022

Muhammad Ferjad Naeem

Yongqin Xian

Luc Van Gool

F. Tombari

VLM

235

21 Sep 2022

What does a platypus look like? Generating customized prompts for zero-shot image classificationIEEE International Conference on Computer Vision (ICCV), 2022

608

339

07 Sep 2022

SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization

405

26 Feb 2022

Scaling up Multi-domain Semantic Segmentation with Sentence EmbeddingsInternational Journal of Computer Vision (IJCV), 2022

Chunhua Shen

419

04 Feb 2022