ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.09669
  4. Cited By
Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual
  Descriptions

Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions

17 March 2021
Sebastian Bujwid
Josephine Sullivan
    VLM
ArXiv (abs)PDFHTML

Papers citing "Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions"

20 / 20 papers shown
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
Xiangyan Qu
Jing Yu
Jiamin Zhuang
Gaopeng Gou
Gang Xiong
Qi Wu
VLM
462
2
0
10 Mar 2025
Coreset Selection via LLM-based Concept Bottlenecks
Coreset Selection via LLM-based Concept Bottlenecks
Akshay Mehra
Trisha Mittal
Subhadra Gopalakrishnan
Joshua Kimball
388
0
0
23 Feb 2025
M-Tuning: Prompt Tuning with Mitigated Label Bias in Open-Set Scenarios
M-Tuning: Prompt Tuning with Mitigated Label Bias in Open-Set Scenarios
Ning Liao
Xiaopeng Zhang
Minglu Cao
Junchi Yan
VPVLMVLM
729
2
0
31 Dec 2024
Aggregate-and-Adapt Natural Language Prompts for Downstream
  Generalization of CLIP
Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIPNeural Information Processing Systems (NeurIPS), 2024
Chen Huang
Skyler Seto
Samira Abnar
David Grangier
Navdeep Jaitly
J. Susskind
VLM
334
6
0
31 Oct 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based
  Zero-Shot Learning
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
388
5
0
22 Jul 2024
LLM-based Hierarchical Concept Decomposition for Interpretable
  Fine-Grained Image Classification
LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification
Renyi Qu
Mark Yatskar
329
2
0
29 May 2024
Why are Visually-Grounded Language Models Bad at Image Classification?
Why are Visually-Grounded Language Models Bad at Image Classification?
Yuhui Zhang
Alyssa Unell
Xiaohan Wang
Dhruba Ghosh
Yuchang Su
Ludwig Schmidt
Serena Yeung-Levy
VLM
441
101
0
28 May 2024
Multimodal Foundation Models for Zero-shot Animal Species Recognition in
  Camera Trap Images
Multimodal Foundation Models for Zero-shot Animal Species Recognition in Camera Trap Images
Zalan Fabian
Zhongqi Miao
Chunyuan Li
Yuanhan Zhang
Ziwei Liu
...
Laura Siabatto
Andrés Link
Pablo Arbelaez
Rahul Dodhia
J. L. Ferres
292
18
0
02 Nov 2023
Open-Set Image Tagging with Multi-Grained Text Supervision
Open-Set Image Tagging with Multi-Grained Text Supervision
Xinyu Huang
Yi-Jie Huang
Youcai Zhang
Weiwei Tian
Rui Feng
Yuejie Zhang
Yanchun Xie
Yaqian Li
Lei Zhang
VLM
297
72
0
23 Oct 2023
Waffling around for Performance: Visual Classification with Random Words
  and Broad Concepts
Waffling around for Performance: Visual Classification with Random Words and Broad ConceptsIEEE International Conference on Computer Vision (ICCV), 2023
Karsten Roth
Jae Myung Kim
A. Sophia Koepke
Oriol Vinyals
Cordelia Schmid
Zeynep Akata
VLM
290
122
0
12 Jun 2023
Describe me an Aucklet: Generating Grounded Perceptual Category
  Descriptions
Describe me an Aucklet: Generating Grounded Perceptual Category DescriptionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Bill Noble
N. Ilinykh
333
0
0
07 Mar 2023
CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets
CHiLS: Zero-Shot Image Classification with Hierarchical Label SetsInternational Conference on Machine Learning (ICML), 2023
Cheng-i Wang
Julian McAuley
Zachary Chase Lipton
Saurabh Garg
VLM
463
122
0
06 Feb 2023
I2MVFormer: Large Language Model Generated Multi-View Document
  Supervision for Zero-Shot Image Classification
I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image ClassificationComputer Vision and Pattern Recognition (CVPR), 2022
Muhammad Ferjad Naeem
Muhammad Gul Zain Ali Khan
Yongqin Xian
Muhammad Zeshan Afzal
D. Stricker
Luc Van Gool
F. Tombari
VLM
228
93
0
05 Dec 2022
Language in a Bottle: Language Model Guided Concept Bottlenecks for
  Interpretable Image Classification
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image ClassificationComputer Vision and Pattern Recognition (CVPR), 2022
Yue Yang
Artemis Panagopoulou
Shenghao Zhou
Daniel Jin
Chris Callison-Burch
Mark Yatskar
487
348
0
21 Nov 2022
Text2Model: Text-based Model Induction for Zero-shot Image
  Classification
Text2Model: Text-based Model Induction for Zero-shot Image ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ohad Amosy
Tomer Volk
Eilam Shapira
Eyal Ben-David
Roi Reichart
Gal Chechik
VLM
211
2
0
27 Oct 2022
Decoding Visual Neural Representations by Multimodal Learning of
  Brain-Visual-Linguistic Features
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic FeaturesIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Changde Du
Kaicheng Fu
Jinpeng Li
Huiguang He
VLM
361
155
0
13 Oct 2022
I2DFormer: Learning Image to Document Attention for Zero-Shot Image
  Classification
I2DFormer: Learning Image to Document Attention for Zero-Shot Image ClassificationNeural Information Processing Systems (NeurIPS), 2022
Muhammad Ferjad Naeem
Yongqin Xian
Luc Van Gool
F. Tombari
VLM
235
59
0
21 Sep 2022
What does a platypus look like? Generating customized prompts for
  zero-shot image classification
What does a platypus look like? Generating customized prompts for zero-shot image classificationIEEE International Conference on Computer Vision (ICCV), 2022
Sarah M Pratt
Ian Covert
Rosanne Liu
Ali Farhadi
VLM
608
339
0
07 Sep 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot
  Generalization
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik Narasimhan
VLM
405
2
0
26 Feb 2022
Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings
Scaling up Multi-domain Semantic Segmentation with Sentence EmbeddingsInternational Journal of Computer Vision (IJCV), 2022
Wei Yin
Yifan Liu
Chunhua Shen
Baichuan Sun
Anton Van Den Hengel
VLM
419
11
0
04 Feb 2022
1
Page 1 of 1