ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.02209
  4. Cited By
Unsupervised Learning of Semantic Audio Representations

Unsupervised Learning of Semantic Audio Representations

6 November 2017
A. Jansen
Manoj Plakal
R. Pandya
D. Ellis
Shawn Hershey
Jiayang Liu
R. C. Moore
Rif A. Saurous
    SSL
ArXivPDFHTML

Papers citing "Unsupervised Learning of Semantic Audio Representations"

49 / 49 papers shown
Title
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
Ruben Ciranni
Emilian Postolache
Giorgio Mariani
Michele Mancusi
Giorgio Fabbro
Emanuele Rodolà
Luca Cosmo
79
7
0
10 Jan 2025
MATE: Meet At The Embedding -- Connecting Images with Long Texts
MATE: Meet At The Embedding -- Connecting Images with Long Texts
Young Kyun Jang
Junmo Kang
Yong Jae Lee
Donghyun Kim
VLM
46
5
0
26 Jun 2024
AudioRepInceptionNeXt: A lightweight single-stream architecture for
  efficient audio recognition
AudioRepInceptionNeXt: A lightweight single-stream architecture for efficient audio recognition
Kin Wai Lau
Yasar Abbas Ur Rehman
L. Po
51
1
0
21 Apr 2024
Rank Supervised Contrastive Learning for Time Series Classification
Rank Supervised Contrastive Learning for Time Series Classification
Qianying Ren
Dongsheng Luo
Dongjin Song
AI4TS
29
2
0
31 Jan 2024
Self-Supervised Learning for Few-Shot Bird Sound Classification
Self-Supervised Learning for Few-Shot Bird Sound Classification
Ilyass Moummad
Romain Serizel
Nicolas Farrugia
SSL
28
9
0
25 Dec 2023
Private Matrix Factorization with Public Item Features
Private Matrix Factorization with Public Item Features
Mihaela Curmei
Walid Krichene
Li Zhang
Mukund Sundararajan
39
3
0
17 Sep 2023
Enhancing Unsupervised Audio Representation Learning via Adversarial
  Sample Generation
Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation
Yulin Pan
Xiangteng He
Biao Gong
Yuxin Peng
Yiliang Lv
SSL
29
0
0
15 Mar 2023
Improving Self-Supervised Learning for Audio Representations by Feature
  Diversity and Decorrelation
Improving Self-Supervised Learning for Audio Representations by Feature Diversity and Decorrelation
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
SSL
47
3
0
07 Mar 2023
Supervised and Unsupervised Learning of Audio Representations for Music
  Understanding
Supervised and Unsupervised Learning of Audio Representations for Music Understanding
Matthew C. McCallum
Filip Korzeniowski
Sergio Oramas
F. Gouyon
Andreas F. Ehmann
SSL
80
37
0
07 Oct 2022
Representing Spatial Trajectories as Distributions
Representing Spatial Trajectories as Distributions
Dídac Surís
Carl Vondrick
41
5
0
04 Oct 2022
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic
  Word Embeddings
Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings
Badr M. Abdullah
Bernd Möbius
Dietrich Klakow
15
3
0
14 Sep 2022
MuLan: A Joint Embedding of Music Audio and Natural Language
MuLan: A Joint Embedding of Music Audio and Natural Language
Qingqing Huang
A. Jansen
Joonseok Lee
Ravi Ganti
Judith Yue Li
D. Ellis
30
131
0
26 Aug 2022
Towards Proper Contrastive Self-supervised Learning Strategies For Music
  Audio Representation
Towards Proper Contrastive Self-supervised Learning Strategies For Music Audio Representation
Jeong-Eun Choi
Seongwon Jang
Hyunsouk Cho
Sehee Chung
SSL
24
6
0
10 Jul 2022
Urban Rhapsody: Large-scale exploration of urban soundscapes
Urban Rhapsody: Large-scale exploration of urban soundscapes
Joao Rulff
Fabio Miranda
Maryam Hosseini
Marcos Lage
M. Cartwright
Graham Dove
J. P. Bello
Claudio T. Silva
27
7
0
25 May 2022
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio
  Representation Learning
DeLoRes: Decorrelating Latent Spaces for Low-Resource Audio Representation Learning
Sreyan Ghosh
Ashish Seth
and Deepak Mittal
Maneesh Singh
S. Umesh
SSL
27
6
0
25 Mar 2022
Federated Self-Supervised Learning for Acoustic Event Classification
Federated Self-Supervised Learning for Acoustic Event Classification
Meng Feng
Chieh-Chi Kao
Qingming Tang
Ming Sun
Viktor Rozgic
Spyros Matsoukas
Chao Wang
44
11
0
22 Mar 2022
A Brief Overview of Unsupervised Neural Speech Representation Learning
A Brief Overview of Unsupervised Neural Speech Representation Learning
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
Lars Maaløe
Christian Igel
BDL
AI4TS
SSL
21
11
0
01 Mar 2022
Self-Supervised Beat Tracking in Musical Signals with Polyphonic
  Contrastive Learning
Self-Supervised Beat Tracking in Musical Signals with Polyphonic Contrastive Learning
Dorian Desblancs
SSL
30
2
0
05 Jan 2022
Discrete and continuous representations and processing in deep learning:
  Looking forward
Discrete and continuous representations and processing in deep learning: Looking forward
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
38
20
0
04 Jan 2022
Towards Learning Universal Audio Representations
Towards Learning Universal Audio Representations
Luyu Wang
Pauline Luc
Yan Wu
Adrià Recasens
Lucas Smaira
...
Andrew Jaegle
Jean-Baptiste Alayrac
Sander Dieleman
João Carreira
Aaron van den Oord
SSL
39
68
0
23 Nov 2021
DECAR: Deep Clustering for learning general-purpose Audio
  Representations
DECAR: Deep Clustering for learning general-purpose Audio Representations
Sreyan Ghosh
Sandesh V Katta
Ashish Seth
S. Umesh
SSL
36
12
0
17 Oct 2021
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Sangeeta Srivastava
Yun Wang
Andros Tjandra
Anurag Kumar
Chunxi Liu
Kritika Singh
Yatharth Saraf
SSL
38
24
0
14 Oct 2021
Universal Paralinguistic Speech Representations Using Self-Supervised
  Conformers
Universal Paralinguistic Speech Representations Using Self-Supervised Conformers
Joel Shor
A. Jansen
Wei Han
Daniel S. Park
Yu Zhang
SSL
AI4TS
48
54
0
09 Oct 2021
Cross-domain Semi-Supervised Audio Event Classification Using
  Contrastive Regularization
Cross-domain Semi-Supervised Audio Event Classification Using Contrastive Regularization
Donmoon Lee
Kyogu Lee
25
3
0
29 Sep 2021
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning
  for Automatic Speech Recognition
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Yu Zhang
Daniel S. Park
Wei Han
James Qin
Anmol Gulati
...
Zhifeng Chen
Quoc V. Le
Chung-Cheng Chiu
Ruoming Pang
Yonghui Wu
SSL
34
175
0
27 Sep 2021
Unsupervised Learning of Deep Features for Music Segmentation
Unsupervised Learning of Deep Features for Music Segmentation
Matthew C. McCallum
SSL
23
39
0
30 Aug 2021
Learning De-identified Representations of Prosody from Raw Audio
Learning De-identified Representations of Prosody from Raw Audio
J. Weston
R. Lenain
U. Meepegama
E. Fristed
SSL
34
15
0
17 Jul 2021
Self-Supervised Learning from Automatically Separated Sound Scenes
Self-Supervised Learning from Automatically Separated Sound Scenes
Eduardo Fonseca
A. Jansen
D. Ellis
Scott Wisdom
Marco Tagliasacchi
J. Hershey
Manoj Plakal
Shawn Hershey
R. C. Moore
Xavier Serra
SSL
44
13
0
05 May 2021
Comparison and Analysis of Deep Audio Embeddings for Music Emotion
  Recognition
Comparison and Analysis of Deep Audio Embeddings for Music Emotion Recognition
E. Koh
Shlomo Dubnov
32
38
0
13 Apr 2021
Broaden Your Views for Self-Supervised Video Learning
Broaden Your Views for Self-Supervised Video Learning
Adrià Recasens
Pauline Luc
Jean-Baptiste Alayrac
Luyu Wang
Ross Hemsley
...
Florent Altché
M. Valko
Jean-Bastien Grill
Aaron van den Oord
Andrew Zisserman
SSL
AI4TS
35
127
0
30 Mar 2021
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio
  Representation
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
38
175
0
11 Mar 2021
Enhancing Audio Augmentation Methods with Consistency Learning
Enhancing Audio Augmentation Methods with Consistency Learning
Turab Iqbal
Karim Helwani
A. Krishnaswamy
Wenwu Wang
29
5
0
09 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and
  Aggregation
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
104
144
0
02 Feb 2021
Unsupervised Contrastive Learning of Sound Event Representations
Unsupervised Contrastive Learning of Sound Event Representations
Eduardo Fonseca
Diego Ortego
Kevin McGuinness
Noel E. O'Connor
Xavier Serra
SSL
27
65
0
15 Nov 2020
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio
  and Tags
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags
Xavier Favory
Konstantinos Drossos
Tuomas Virtanen
Xavier Serra
32
15
0
27 Oct 2020
Contrastive Learning of General-Purpose Audio Representations
Contrastive Learning of General-Purpose Audio Representations
Aaqib Saeed
David Grangier
Neil Zeghidour
VLM
SSL
24
262
0
21 Oct 2020
FSD50K: An Open Dataset of Human-Labeled Sound Events
FSD50K: An Open Dataset of Human-Labeled Sound Events
Eduardo Fonseca
Xavier Favory
Jordi Pons
F. Font
Xavier Serra
26
438
0
01 Oct 2020
Disentangled Multidimensional Metric Learning for Music Similarity
Disentangled Multidimensional Metric Learning for Music Similarity
Jongpil Lee
Nicholas J. Bryan
Justin Salamon
Zeyu Jin
Juhan Nam
32
40
0
09 Aug 2020
Time-Frequency Scattering Accurately Models Auditory Similarities
  Between Instrumental Playing Techniques
Time-Frequency Scattering Accurately Models Auditory Similarities Between Instrumental Playing Techniques
Vincent Lostanlen
Christian El-Hajj
Mathias Rossignol
G. Lafay
Joakim Andén
Mathieu Lagrange
15
12
0
21 Jul 2020
Towards Learning a Universal Non-Semantic Representation of Speech
Towards Learning a Universal Non-Semantic Representation of Speech
Joel Shor
A. Jansen
Ronnie Maor
Oran Lang
Omry Tuval
Félix de Chaumont Quitry
Marco Tagliasacchi
Ira Shavitt
Dotan Emanuel
Yinnon A. Haviv
SSL
51
155
0
25 Feb 2020
Multi-task self-supervised learning for Robust Speech Recognition
Multi-task self-supervised learning for Robust Speech Recognition
Mirco Ravanelli
Jianyuan Zhong
Santiago Pascual
P. Swietojanski
João Monteiro
J. Trmal
Yoshua Bengio
SSL
189
288
0
25 Jan 2020
Metric Learning with Background Noise Class for Few-shot Detection of
  Rare Sound Events
Metric Learning with Background Noise Class for Few-shot Detection of Rare Sound Events
Kazuki Shimada
Yuichiro Koyama
A. Inoue
30
23
0
30 Oct 2019
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu
David Harwath
James R. Glass
SSL
26
32
0
09 Jul 2019
Self-supervised audio representation learning for mobile devices
Self-supervised audio representation learning for mobile devices
Marco Tagliasacchi
Beat Gfeller
Félix de Chaumont Quitry
Dominik Roblek
SSL
AI4TS
6
46
0
24 May 2019
Training neural audio classifiers with few data
Training neural audio classifiers with few data
Jordi Pons
Joan Serrà
Xavier Serra
24
57
0
24 Oct 2018
Phonetic-and-Semantic Embedding of Spoken Words with Applications in
  Spoken Content Retrieval
Phonetic-and-Semantic Embedding of Spoken Words with Applications in Spoken Content Retrieval
Yi-Chen Chen
Sung-Feng Huang
Chia-Hao Shen
Hung-yi Lee
Lin-Shan Lee
51
37
0
21 Jul 2018
Unspeech: Unsupervised Speech Context Embeddings
Unspeech: Unsupervised Speech Context Embeddings
Benjamin Milde
Chris Biemann
SSL
27
28
0
18 Apr 2018
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory
  Input
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input
David Harwath
Adrià Recasens
Dídac Surís
Galen Chuang
Antonio Torralba
James R. Glass
32
200
0
04 Apr 2018
Learning audio sequence representations for acoustic event
  classification
Learning audio sequence representations for acoustic event classification
Zixing Zhang
Ding Liu
Jing Han
Kun Qian
Björn Schuller
SSL
AI4TS
46
14
0
27 Jul 2017
1