ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.02670
  4. Cited By
Zero-shot Learning for Audio-based Music Classification and Tagging

Zero-shot Learning for Audio-based Music Classification and Tagging

5 July 2019
Jeong-Eun Choi
Jongpil Lee
Jiyoung Park
Juhan Nam
    VLM
ArXivPDFHTML

Papers citing "Zero-shot Learning for Audio-based Music Classification and Tagging"

28 / 28 papers shown
Title
Learning disentangled representations for instrument-based music similarity
Learning disentangled representations for instrument-based music similarity
Yuka Hashizume
Li Li
Atsushi Miyashita
T. Toda
49
0
0
21 Mar 2025
Investigation of perceptual music similarity focusing on each instrumental part
Investigation of perceptual music similarity focusing on each instrumental part
Yuka Hashizume
T. Toda
44
1
0
04 Feb 2025
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
PIAST: A Multimodal Piano Dataset with Audio, Symbolic and Text
Hayeon Bang
Eunjin Choi
Megan Finch
Seungheon Doh
Seolhee Lee
G. Lee
Juhan Nam
23
0
0
04 Nov 2024
Enriching Music Descriptions with a Finetuned-LLM and Metadata for
  Text-to-Music Retrieval
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval
Seungheon Doh
Minhee Lee
Dasaem Jeong
Juhan Nam
57
8
0
04 Oct 2024
Multi-label Zero-Shot Audio Classification with Temporal Attention
Multi-label Zero-Shot Audio Classification with Temporal Attention
Duygu Dogan
Huang Xie
Toni Heittola
Tuomas Virtanen
VLM
27
0
0
31 Aug 2024
I can listen but cannot read: An evaluation of two-tower multimodal
  systems for instrument recognition
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition
Yannis Vasilakis
Rachel M. Bittner
Johan Pauwels
40
0
0
25 Jul 2024
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge
  from Large Language Models
Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models
Xuenan Xu
Pingyue Zhang
Ming Yan
Ji Zhang
Mengyue Wu
VLM
21
0
0
19 Jul 2024
Musical Word Embedding for Music Tagging and Retrieval
Musical Word Embedding for Music Tagging and Retrieval
Seungheon Doh
Jongpil Lee
Dasaem Jeong
Juhan Nam
21
2
0
21 Apr 2024
MuseChat: A Conversational Music Recommendation System for Videos
MuseChat: A Conversational Music Recommendation System for Videos
Zhikang Dong
Bin Chen
Xiulong Liu
Paweł Polak
Peng Zhang
LRM
37
26
0
10 Oct 2023
MDSC: Towards Evaluating the Style Consistency Between Music and Dance
MDSC: Towards Evaluating the Style Consistency Between Music and Dance
Zixiang Zhou
Weiyuan Li
Baoyuan Wang
27
1
0
04 Sep 2023
Language-Guided Music Recommendation for Video via Prompt Analogies
Language-Guided Music Recommendation for Video via Prompt Analogies
Daniel McKee
Justin Salamon
Josef Sivic
Bryan C. Russell
VGen
28
26
0
15 Jun 2023
Toward Universal Text-to-Music Retrieval
Toward Universal Text-to-Music Retrieval
Seungheon Doh
Minz Won
Keunwoo Choi
Juhan Nam
VLM
11
25
0
26 Nov 2022
Music Similarity Calculation of Individual Instrumental Sounds Using
  Metric Learning
Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning
Yuka Hashizume
Li Li
T. Toda
20
5
0
15 Nov 2022
MATT: A Multiple-instance Attention Mechanism for Long-tail Music Genre
  Classification
MATT: A Multiple-instance Attention Mechanism for Long-tail Music Genre Classification
Xiaokai Liu
Meng Zhang
6
0
0
09 Sep 2022
Contrastive Audio-Language Learning for Music
Contrastive Audio-Language Learning for Music
Ilaria Manco
Emmanouil Benetos
Elio Quinton
Gyorgy Fazekas
25
44
0
25 Aug 2022
Improved Zero-Shot Audio Tagging & Classification with Patchout
  Spectrogram Transformers
Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers
Paul Primus
Gerhard Widmer
VLM
17
5
0
24 Aug 2022
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention
  and Language
Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and Language
Otniel-Bogdan Mercea
Lukas Riesch
A. Sophia Koepke
Zeynep Akata
22
48
0
07 Mar 2022
Exploring modality-agnostic representations for music classification
Exploring modality-agnostic representations for music classification
Ho-Hsiang Wu
Magdalena Fuentes
J. P. Bello
11
4
0
02 Jun 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot
  Learning with Knowledge Distillation
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation
Sunwoo Kim
Minje Kim
17
18
0
08 May 2021
Enriched Music Representations with Multiple Cross-modal Contrastive
  Learning
Enriched Music Representations with Multiple Cross-modal Contrastive Learning
Andrés Ferraro
Xavier Favory
K. Drossos
Yuntae Kim
Dmitry Bogdanov
11
25
0
01 Apr 2021
Multimodal Metric Learning for Tag-based Music Retrieval
Multimodal Metric Learning for Tag-based Music Retrieval
Minz Won
Sergio Oramas
Oriol Nieto
F. Gouyon
Xavier Serra
11
44
0
30 Oct 2020
Mood Classification Using Listening Data
Mood Classification Using Listening Data
Filip Korzeniowski
Oriol Nieto
Matthew C. McCallum
Minz Won
Sergio Oramas
Erik M. Schmidt
10
12
0
22 Oct 2020
Disentangled Multidimensional Metric Learning for Music Similarity
Disentangled Multidimensional Metric Learning for Music Similarity
Jongpil Lee
Nicholas J. Bryan
Justin Salamon
Zeyu Jin
Juhan Nam
24
40
0
09 Aug 2020
Musical Word Embedding: Bridging the Gap between Listening Contexts and
  Music
Musical Word Embedding: Bridging the Gap between Listening Contexts and Music
Seungheon Doh
Jongpil Lee
T. Park
Juhan Nam
12
4
0
23 Jul 2020
Visual Attention for Musical Instrument Recognition
Visual Attention for Musical Instrument Recognition
Karn N. Watcharasupat
Siddharth Gururani
Alexander Lerch
17
3
0
17 Jun 2020
nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using
  1D Convolution Neural Networks
nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks
K. Cheuk
Hans Anderson
Kat R. Agres
Dorien Herremans
11
5
0
27 Dec 2019
Zero-shot Learning and Knowledge Transfer in Music Classification and
  Tagging
Zero-shot Learning and Knowledge Transfer in Music Classification and Tagging
Jeong-Eun Choi
Jongpil Lee
Jiyoung Park
Juhan Nam
VLM
11
6
0
20 Jun 2019
Learning Deep Representations of Fine-grained Visual Descriptions
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
170
840
0
17 May 2016
1