Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2005.13402
Cited By
v1
v2
v3 (latest)
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2020
27 May 2020
Pratik Mazumder
Pravendra Singh
Kranti K. Parida
Vinay P. Namboodiri
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings"
23 / 23 papers shown
MPJudge: Towards Perceptual Assessment of Music-Induced Paintings
Shiqi Jiang
Tianyi Liang
Changbo Wang
Chenhui Li
Chenhui Li
76
0
0
10 Nov 2025
REIS: A High-Performance and Energy-Efficient Retrieval System with In-Storage Processing
International Symposium on Computer Architecture (ISCA), 2025
Kangqi Chen
Andreas Kosmas Kakolyris
Rakesh Nadig
Manos Frouzakis
Nika Mansouri-Ghiasi
Yu Liang
Haiyu Mao
Mohammad Sadrosadati
Mohammad Sadrosadati
Onur Mutlu
RALM
323
5
0
19 Jun 2025
Optimizing Genetic Algorithms with Multilayer Perceptron Networks for Enhancing TinyFace Recognition
Mohammad Subhi Al-Batah
Mowafaq Salem Alzboon
Muhyeeddin Alqaraleh
CVBM
215
0
0
11 Jun 2025
Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning
Wenrui Li
Penghong Wang
Xingtao Wang
Wangmeng Zuo
Xiaopeng Fan
Yonghong Tian
217
3
0
26 May 2025
Extremely Simple Out-of-distribution Detection for Audio-visual Generalized Zero-shot Learning
Yang Liu
Xinming Zhang
Jiale Du
Xinbo Gao
Jungong Han
OODD
304
0
0
28 Mar 2025
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds
Computer Vision and Pattern Recognition (CVPR), 2025
E. Shaar
Ariel Shaulov
Gal Chechik
Lior Wolf
VLM
363
2
0
17 Mar 2025
Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning
RunLin Yu
Yipu Gong
Wenrui Li
Aiwen Sun
Mengren Zheng
VLM
282
0
0
16 Dec 2024
Towards Open-Vocabulary Audio-Visual Event Localization
Computer Vision and Pattern Recognition (CVPR), 2024
Jinxing Zhou
Dan Guo
Ruohao Guo
Yuxin Mao
Jingjing Hu
Yiran Zhong
Xiaojun Chang
Ming Wang
VLM
552
25
0
18 Nov 2024
Audio-visual Generalized Zero-shot Learning the Easy Way
Shentong Mo
Pedro Morgado
263
8
0
18 Jul 2024
Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning
Wenrui Li
Penghong Wang
Ruiqin Xiong
Xiaopeng Fan
277
17
0
11 Jul 2024
Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
David Kurzendörfer
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
VLM
CLIP
214
3
0
09 Apr 2024
Boosting Audio-visual Zero-shot Learning with Large Language Models
Haoxing Chen
Yaohui Li
Yan Hong
Zizheng Huang
Zhuoer Xu
Zhangxuan Gu
Jun Lan
Huijia Zhu
Weiqiang Wang
VLM
268
3
0
21 Nov 2023
Hyperbolic Audio-visual Zero-shot Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Jie Hong
Zeeshan Hayder
Junlin Han
Pengfei Fang
Mehrtash Harandi
L. Petersson
252
24
0
24 Aug 2023
Audio-Visual Class-Incremental Learning
IEEE International Conference on Computer Vision (ICCV), 2023
Weiguo Pian
Shentong Mo
Yunhui Guo
Yapeng Tian
CLL
VLM
240
35
0
21 Aug 2023
Robust Sound-Guided Image Manipulation
Neural Networks (NN), 2022
Seung Hyun Lee
Gyeongrok Oh
Wonmin Byeon
Sang Ho Yoon
Jinkyu Kim
Sangpil Kim
DiffM
344
8
0
30 Aug 2022
Temporal and cross-modal attention for audio-visual zero-shot learning
European Conference on Computer Vision (ECCV), 2022
Otniel-Bogdan Mercea
Thomas Hummel
A. Sophia Koepke
Zeynep Akata
208
32
0
20 Jul 2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Rui Qian
Yeqing Li
Zheng Xu
Ming-Hsuan Yang
Serge Belongie
Huayu Chen
VLM
189
26
0
15 Jul 2022
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language
Computer Vision and Pattern Recognition (CVPR), 2022
Otniel-Bogdan Mercea
Lukas Riesch
A. Sophia Koepke
Zeynep Akata
193
56
0
07 Mar 2022
Sound-Guided Semantic Image Manipulation
Seung Hyun Lee
Wonseok Roh
Wonmin Byeon
Sang Ho Yoon
Chanyoung Kim
Jinkyu Kim
Sangpil Kim
DiffM
340
53
0
30 Nov 2021
Beyond Mono to Binaural: Generating Binaural Audio from Mono Audio with Depth and Cross Modal Attention
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Kranti K. Parida
Siddharth Srivastava
Gaurav Sharma
MDE
204
29
0
15 Nov 2021
Discriminative Semantic Transitive Consistency for Cross-Modal Learning
Computer Vision and Image Understanding (CVIU), 2021
Kranti K. Parida
Gaurav Sharma
212
1
0
25 Mar 2021
Beyond Image to Depth: Improving Depth Prediction using Echoes
Computer Vision and Pattern Recognition (CVPR), 2021
Kranti K. Parida
Siddharth Srivastava
Gaurav Sharma
MDE
311
42
0
15 Mar 2021
A Review of Generalized Zero-Shot Learning Methods
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Farhad Pourpanah
Moloud Abdar
Yuxuan Luo
Xinlei Zhou
Ran Wang
C. P. Lim
Xizhao Wang
Q. M. Jonathan Wu
VLM
649
513
0
17 Nov 2020
1
Page 1 of 1