Zero-shot Learning for Audio-based Music Classification and Tagging

5 July 2019

Papers citing "Zero-shot Learning for Audio-based Music Classification and Tagging"

28 / 28 papers shown

Title
Learning disentangled representations for instrument-based music similarity Yuka Hashizume Li Li Atsushi Miyashita T. Toda 49 0 0 21 Mar 2025
Investigation of perceptual music similarity focusing on each instrumental part Yuka Hashizume T. Toda 44 1 0 04 Feb 2025
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text Hayeon Bang Eunjin Choi Megan Finch Seungheon Doh Seolhee Lee G. Lee Juhan Nam 23 0 0 04 Nov 2024
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval Seungheon Doh Minhee Lee Dasaem Jeong Juhan Nam 57 8 0 04 Oct 2024
Multi-label Zero-Shot Audio Classification with Temporal Attention Duygu Dogan Huang Xie Toni Heittola Tuomas Virtanen VLM 27 0 0 31 Aug 2024
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition Yannis Vasilakis Rachel M. Bittner Johan Pauwels 40 0 0 25 Jul 2024
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models Xuenan Xu Pingyue Zhang Ming Yan Ji Zhang Mengyue Wu VLM 21 0 0 19 Jul 2024
Musical Word Embedding for Music Tagging and Retrieval Seungheon Doh Jongpil Lee Dasaem Jeong Juhan Nam 21 2 0 21 Apr 2024
MuseChat: A Conversational Music Recommendation System for Videos Zhikang Dong Bin Chen Xiulong Liu Paweł Polak Peng Zhang LRM 37 26 0 10 Oct 2023
MDSC: Towards Evaluating the Style Consistency Between Music and Dance Zixiang Zhou Weiyuan Li Baoyuan Wang 27 1 0 04 Sep 2023
Language-Guided Music Recommendation for Video via Prompt Analogies Daniel McKee Justin Salamon Josef Sivic Bryan C. Russell VGen 28 26 0 15 Jun 2023
Toward Universal Text-to-Music Retrieval Seungheon Doh Minz Won Keunwoo Choi Juhan Nam VLM 11 25 0 26 Nov 2022
Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning Yuka Hashizume Li Li T. Toda 20 5 0 15 Nov 2022
MATT: A Multiple-instance Attention Mechanism for Long-tail Music Genre Classification Xiaokai Liu Meng Zhang 6 0 0 09 Sep 2022
Contrastive Audio-Language Learning for Music Ilaria Manco Emmanouil Benetos Elio Quinton Gyorgy Fazekas 25 44 0 25 Aug 2022
Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers Paul Primus Gerhard Widmer VLM 17 5 0 24 Aug 2022
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language Otniel-Bogdan Mercea Lukas Riesch A. Sophia Koepke Zeynep Akata 22 48 0 07 Mar 2022
Exploring modality-agnostic representations for music classification Ho-Hsiang Wu Magdalena Fuentes J. P. Bello 11 4 0 02 Jun 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation Sunwoo Kim Minje Kim 17 18 0 08 May 2021
Enriched Music Representations with Multiple Cross-modal Contrastive Learning Andrés Ferraro Xavier Favory K. Drossos Yuntae Kim Dmitry Bogdanov 11 25 0 01 Apr 2021
Multimodal Metric Learning for Tag-based Music Retrieval Minz Won Sergio Oramas Oriol Nieto F. Gouyon Xavier Serra 11 44 0 30 Oct 2020
Mood Classification Using Listening Data Filip Korzeniowski Oriol Nieto Matthew C. McCallum Minz Won Sergio Oramas Erik M. Schmidt 10 12 0 22 Oct 2020
Disentangled Multidimensional Metric Learning for Music Similarity Jongpil Lee Nicholas J. Bryan Justin Salamon Zeyu Jin Juhan Nam 24 40 0 09 Aug 2020
Musical Word Embedding: Bridging the Gap between Listening Contexts and Music Seungheon Doh Jongpil Lee T. Park Juhan Nam 12 4 0 23 Jul 2020
Visual Attention for Musical Instrument Recognition Karn N. Watcharasupat Siddharth Gururani Alexander Lerch 17 3 0 17 Jun 2020
nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks K. Cheuk Hans Anderson Kat R. Agres Dorien Herremans 11 5 0 27 Dec 2019
Zero-shot Learning and Knowledge Transfer in Music Classification and Tagging Jeong-Eun Choi Jongpil Lee Jiyoung Park Juhan Nam VLM 11 6 0 20 Jun 2019
Learning Deep Representations of Fine-grained Visual Descriptions Scott E. Reed Zeynep Akata Bernt Schiele Honglak Lee OCL VLM 170 840 0 17 May 2016