Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1506.00511
Cited By
v1
v2 (latest)
Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions
IEEE International Conference on Computer Vision (ICCV), 2015
1 June 2015
Jimmy Ba
Kevin Swersky
Sanja Fidler
Ruslan Salakhutdinov
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions"
50 / 196 papers shown
Title
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Huajie Jiang
Hao Sun
Xiaohan Yu
Yongli Hu
Baocai Yin
Jian Yang
Yuankai Qi
VLM
161
1
0
29 Mar 2025
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification
Xiangyan Qu
Jing Yu
Jiamin Zhuang
Gaopeng Gou
Gang Xiong
Qi Wu
VLM
324
2
0
10 Mar 2025
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Yaoyun Zhang
Xuenan Xu
Mengyue Wu
VGen
212
2
0
24 Dec 2024
An Individual Identity-Driven Framework for Animal Re-Identification
Yihao Wu
Di Zhao
Jingfeng Zhang
Yun Sing Koh
145
1
0
30 Oct 2024
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning
Xiangyang Qu
Jing Yu
Keke Gai
Jiamin Zhuang
Yuanmin Tang
Gang Xiong
Gaopeng Gou
Qi Wu
287
5
0
22 Jul 2024
NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning
Yi Zhang
Chun-Wun Cheng
Ke Yu
Zhihai He
Carola-Bibiane Schonlieb
Angelica I Aviles-Rivero
VLM
191
3
0
11 Jul 2024
Conceptual Codebook Learning for Vision-Language Models
Yi Zhang
Ke Yu
Siqi Wu
Zhihai He
VLM
406
5
0
02 Jul 2024
A separability-based approach to quantifying generalization: which layer is best?
Luciano Dyballa
Evan Gerritz
Steven W. Zucker
OOD
317
4
0
02 May 2024
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Jiajun Song
Yang Zhang
Shuqiang Jiang
221
14
0
14 Feb 2024
A Closer Look at AUROC and AUPRC under Class Imbalance
Neural Information Processing Systems (NeurIPS), 2024
Matthew B. A. McDermott
Lasse Hyldig Hansen
Haoran Zhang
Giovanni Angelotti
Jack Gallifant
624
82
0
11 Jan 2024
CLIP in Medical Imaging: A Comprehensive Survey
Medical Image Analysis (MIA), 2023
Zihao Zhao
Yuxiao Liu
Han Wu
Yonghao Li
Sheng Wang
L. Teng
Disheng Liu
Zhiming Cui
Qian Wang
Hongtu Zhu
CLIP
MedIm
LM&MA
VLM
529
43
0
12 Dec 2023
Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models
International Conference on Machine Learning (ICML), 2023
Zhihe Lu
Jiawang Bai
Xin Li
Zeyu Xiao
Xinchao Wang
VLM
135
16
0
28 Nov 2023
Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Xue-mei Hu
Ce Zhang
Yi Zhang
Bowen Hai
Ke Yu
Zhihai He
MDE
VLM
241
21
0
02 Nov 2023
SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection
ACM Multimedia (ACM MM), 2023
Pengfei Zhou
Weiqing Min
Yang Zhang
Jiajun Song
Ying Jin
Shuqiang Jiang
DiffM
261
14
0
07 Oct 2023
Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP
International Conference on Learning Representations (ICLR), 2023
Zixiang Chen
Yihe Deng
Yuanzhi Li
Quanquan Gu
VLM
337
17
0
02 Oct 2023
Domain-Controlled Prompt Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
204
29
0
30 Sep 2023
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shengxiang Zhang
Muzammal Naseer
Guangyi Chen
Zhiqiang Shen
Salman Khan
Kun Zhang
Fahad Shahbaz Khan
VLM
195
7
0
24 Aug 2023
Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning
ACM Multimedia (ACM MM), 2023
Qianqian Wang
Junlong Du
Ke Yan
Shouhong Ding
VLM
149
30
0
09 Aug 2023
Cross-Modal Concept Learning and Inference for Vision-Language Models
Neurocomputing (Neurocomputing), 2023
Yi Zhang
Ce Zhang
Yushun Tang
Z. He
VLM
MLLM
CLIP
166
20
0
28 Jul 2023
Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts
Mayug Maniparambil
Chris Vorster
D. Molloy
N. Murphy
Kevin McGuinness
Noel E. O'Connor
CLIP
VLM
MLLM
237
73
0
21 Jul 2023
CoPL: Contextual Prompt Learning for Vision-Language Understanding
AAAI Conference on Artificial Intelligence (AAAI), 2023
Koustava Goswami
Srikrishna Karanam
Prateksha Udhayanan
J. JosephK.
Balaji Vasan Srinivasan
VLM
228
17
0
03 Jul 2023
Multimodal Zero-Shot Learning for Tactile Texture Recognition
G. Cao
Jiaqi Jiang
Danushka Bollegala
Min Li
Shan Luo
138
18
0
22 Jun 2023
MuDPT: Multi-modal Deep-symphysis Prompt Tuning for Large Pre-trained Vision-Language Models
IEEE International Conference on Multimedia and Expo (ICME), 2023
Yongzhu Miao
Shasha Li
Jintao Tang
Ting Wang
VLM
MLLM
VPVLM
151
4
0
20 Jun 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
292
1
0
02 Jun 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Man Liu
Feng Li
Chunjie Zhang
Yunchao Wei
Yuxi Liu
Yao-Min Zhao
163
56
0
27 Mar 2023
Multi-modal Machine Learning in Engineering Design: A Review and Future Directions
Journal of Computing and Information Science in Engineering (JCISE), 2023
Binyang Song
Ruilin Zhou
Faez Ahmed
AI4CE
344
63
0
14 Feb 2023
Navigating Alignment for Non-identical Client Class Sets: A Label Name-Anchored Federated Learning Framework
Knowledge Discovery and Data Mining (KDD), 2023
Jiayun Zhang
Xiyuan Zhang
Xinyang Zhang
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
FedML
206
8
0
01 Jan 2023
Unleashing the Power of Shared Label Structures for Human Activity Recognition
International Conference on Information and Knowledge Management (CIKM), 2023
Xiyuan Zhang
Ranak Roy Chowdhury
Jiayun Zhang
Dezhi Hong
Rajesh K. Gupta
Jingbo Shang
VLM
226
10
0
01 Jan 2023
Localized Latent Updates for Fine-Tuning Vision-Language Models
Moritz Ibing
I. Lim
Leif Kobbelt
VLM
154
1
0
13 Dec 2022
EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder
AAAI Conference on Artificial Intelligence (AAAI), 2022
Xiaoshui Huang
Zhou Huang
Shengjia Li
Wentao Qu
Tong He
Yuenan Hou
Yifan Zuo
Wanli Ouyang
302
26
0
08 Dec 2022
Multitask Vision-Language Prompt Tuning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLM
VPVLM
272
75
0
21 Nov 2022
Task Residual for Tuning Vision-Language Models
Computer Vision and Pattern Recognition (CVPR), 2022
Tao Yu
Zhihe Lu
Xin Jin
Zhibo Chen
Xinchao Wang
VLM
CLIP
244
130
0
18 Nov 2022
Text2Model: Text-based Model Induction for Zero-shot Image Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ohad Amosy
Tomer Volk
Eilam Shapira
Eyal Ben-David
Roi Reichart
Gal Chechik
VLM
122
2
0
27 Oct 2022
Learning by Asking Questions for Knowledge-based Novel Object Recognition
International Journal of Computer Vision (IJCV), 2022
Kohei Uehara
Tatsuya Harada
160
2
0
12 Oct 2022
Learning to embed semantic similarity for joint image-text retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Noam Malali
Y. Keller
181
12
0
07 Oct 2022
I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification
Neural Information Processing Systems (NeurIPS), 2022
Muhammad Ferjad Naeem
Yongqin Xian
Luc Van Gool
F. Tombari
VLM
173
54
0
21 Sep 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming-Hsuan Yang
Serge Belongie
Huayu Chen
VLM
164
25
0
15 Jul 2022
Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes
Neural Information Processing Systems (NeurIPS), 2022
Alessio Mazzetto
Cristina Menghini
A. Yuan
E. Upfal
Stephen H. Bach
VLM
152
2
0
25 May 2022
Generating Representative Samples for Few-Shot Classification
Computer Vision and Pattern Recognition (CVPR), 2022
Aoxiang Fan
Hieu M. Le
VLM
216
89
0
05 May 2022
Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention
Computer Vision and Pattern Recognition (CVPR), 2022
Yu Yang
Seung Wook Kim
Jungseock Joo
FAtt
175
19
0
10 Apr 2022
Mixed Differential Privacy in Computer Vision
Computer Vision and Pattern Recognition (CVPR), 2022
Aditya Golatkar
Alessandro Achille
Yu Wang
Aaron Roth
Michael Kearns
Stefano Soatto
PICV
VLM
240
55
0
22 Mar 2022
Conditional Prompt Learning for Vision-Language Models
Computer Vision and Pattern Recognition (CVPR), 2022
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VLM
CLIP
VPVLM
501
1,832
0
10 Mar 2022
SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie
Ameet Deshpande
Karthik Narasimhan
VLM
314
2
0
26 Feb 2022
On Guiding Visual Attention with Language Specification
Computer Vision and Pattern Recognition (CVPR), 2022
Suzanne Petryk
Lisa Dunlap
Keyan Nasseri
Joseph E. Gonzalez
Trevor Darrell
Anna Rohrbach
VLM
405
38
1
17 Feb 2022
A Survey on Visual Transfer Learning using Knowledge Graphs
Sebastian Monka
Lavdim Halilaj
Achim Rettinger
231
26
0
27 Jan 2022
Towards Zero-shot Sign Language Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Yunus Can Bilge
R. G. Cinbis
Nazli Ikizler-Cinbis
SLR
177
42
0
15 Jan 2022
CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision
A. Shrivastava
Ramprasaath R. Selvaraju
Nikhil Naik
Vicente Ordonez
VLM
CLIP
153
7
0
14 Dec 2021
Dual Progressive Prototype Network for Generalized Zero-Shot Learning
Neural Information Processing Systems (NeurIPS), 2021
Chaoqun Wang
Shaobo Min
Xuejin Chen
Xiaoyan Sun
Houqiang Li
184
61
0
03 Nov 2021
Fine-Grained Zero-Shot Learning with DNA as Side Information
Sarkhan Badirli
Zeynep Akata
G. Mohler
Christel Picard
M. M. Dundar
SyDa
BDL
319
40
0
29 Sep 2021
Semantics-Guided Contrastive Network for Zero-Shot Object detection
Caixia Yan
Xiao Chang
Minnan Luo
Huan Liu
Xiaoqin Zhang
Qinghua Zheng
ObjD
VLM
240
94
0
04 Sep 2021
1
2
3
4
Next