ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.07616
  4. Cited By
Unsupervised Contrastive Learning of Sound Event Representations

Unsupervised Contrastive Learning of Sound Event Representations

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
15 November 2020
Eduardo Fonseca
Diego Ortego
Kevin McGuinness
Noel E. O'Connor
Xavier Serra
    SSL
ArXiv (abs)PDFHTML

Papers citing "Unsupervised Contrastive Learning of Sound Event Representations"

46 / 46 papers shown
Latent Multi-view Learning for Robust Environmental Sound Representations
Latent Multi-view Learning for Robust Environmental Sound Representations
Sivan Ding
J. Wilkins
Magdalena Fuentes
J. P. Bello
310
0
0
02 Oct 2025
Balancing Information Preservation and Disentanglement in Self-Supervised Music Representation Learning
Balancing Information Preservation and Disentanglement in Self-Supervised Music Representation Learning
J. Wilkins
Sivan Ding
Magdalena Fuentes
J. P. Bello
422
1
0
30 Jul 2025
Prototype based Masked Audio Model for Self-Supervised Learning of Sound
  Event Detection
Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Pengfei Cai
Yan Song
Nan Jiang
Qing Gu
Ian Mcloughlin
275
5
0
26 Sep 2024
Domain-Invariant Representation Learning of Bird Sounds
Domain-Invariant Representation Learning of Bird Sounds
Ilyass Moummad
Romain Serizel
Emmanouil Benetos
Nicolas Farrugia
SSL
473
8
0
13 Sep 2024
Contrastive Learning from Synthetic Audio Doppelgängers
Contrastive Learning from Synthetic Audio DoppelgängersInternational Conference on Learning Representations (ICLR), 2024
Manuel Cherep
Nikhil Singh
431
1
0
09 Jun 2024
On the Effect of Data-Augmentation on Local Embedding Properties in the
  Contrastive Learning of Music Audio Representations
On the Effect of Data-Augmentation on Local Embedding Properties in the Contrastive Learning of Music Audio RepresentationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Matthew C. McCallum
Matthew E. P. Davies
Florian Henkel
Jaehun Kim
Samuel E. Sandberg
236
10
0
17 Jan 2024
Self-Supervised Learning for Few-Shot Bird Sound Classification
Self-Supervised Learning for Few-Shot Bird Sound Classification
Ilyass Moummad
Romain Serizel
Nicolas Farrugia
SSL
458
16
0
25 Dec 2023
Holistic Representation Learning for Multitask Trajectory Anomaly
  Detection
Holistic Representation Learning for Multitask Trajectory Anomaly DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Alexandros Stergiou
B. D. Weerdt
Nikos Deligiannis
311
22
0
03 Nov 2023
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event
  Representation
Exploring Self-Supervised Contrastive Learning of Spatial Sound Event RepresentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xilin Jiang
Cong Han
Yinghao Aaron Li
N. Mesgarani
SSL
238
8
0
27 Sep 2023
Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound
  Detection
Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ilyass Moummad
Romain Serizel
Nicolas Farrugia
250
10
0
16 Sep 2023
Optimizing Audio Augmentations for Contrastive Learning of
  Health-Related Acoustic Signals
Optimizing Audio Augmentations for Contrastive Learning of Health-Related Acoustic Signals
Louis Blankemeier
Sebastien Baur
Wei-Hung Weng
Jake Garrison
Yossi Matias
Shruthi Prabhakara
Diego Ardila
Zaid Nabulsi
247
0
0
11 Sep 2023
Pretraining Representations for Bioacoustic Few-shot Detection using
  Supervised Contrastive Learning
Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning
Ilyass Moummad
Romain Serizel
Nicolas Farrugia
278
2
0
02 Sep 2023
Self-supervised Audio Teacher-Student Transformer for Both Clip-level
  and Frame-level Tasks
Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level TasksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Xian Li
Nian Shao
Xiaofei Li
ViTCLIP
439
52
0
07 Jun 2023
Discovering COVID-19 Coughing and Breathing Patterns from Unlabeled Data
  Using Contrastive Learning with Varying Pre-Training Domains
Discovering COVID-19 Coughing and Breathing Patterns from Unlabeled Data Using Contrastive Learning with Varying Pre-Training DomainsInterspeech (Interspeech), 2023
Jinjin Cai
Sudip Vhaduri
Xiao Luo
SSL
197
6
0
02 Jun 2023
Masked Autoencoders with Multi-Window Local-Global Attention Are Better
  Audio Learners
Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio LearnersInternational Conference on Learning Representations (ICLR), 2023
Sarthak Yadav
Sergios Theodoridis
Lars Kai Hansen
Zheng-Hua Tan
310
15
0
01 Jun 2023
MT-SLVR: Multi-Task Self-Supervised Learning for Transformation
  In(Variant) Representations
MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) RepresentationsInterspeech (Interspeech), 2023
Calum Heggan
Timothy M. Hospedales
S. Budgett
Mehrdad Yaghoobi
SSL
398
7
0
29 May 2023
Pengi: An Audio Language Model for Audio Tasks
Pengi: An Audio Language Model for Audio TasksNeural Information Processing Systems (NeurIPS), 2023
Soham Deshmukh
Benjamin Elizalde
Rita Singh
Huaming Wang
MLLMAuLLM
520
260
0
19 May 2023
Looking Similar, Sounding Different: Leveraging Counterfactual
  Cross-Modal Pairs for Audiovisual Representation Learning
Looking Similar, Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation LearningComputer Vision and Pattern Recognition (CVPR), 2023
Nikhil Singh
Chih-Wei Wu
Iroro Orife
Mahdi M. Kalayeh
452
3
0
12 Apr 2023
Leveraging Neural Representations for Audio Manipulation
Leveraging Neural Representations for Audio Manipulation
Scott H. Hawley
C. Steinmetz
140
3
0
10 Apr 2023
Enhancing Unsupervised Audio Representation Learning via Adversarial
  Sample Generation
Enhancing Unsupervised Audio Representation Learning via Adversarial Sample Generation
Yulin Pan
Xiangteng He
Biao Gong
Yuxin Peng
Yiliang Lv
SSL
147
0
0
15 Mar 2023
Improving Self-Supervised Learning for Audio Representations by Feature
  Diversity and Decorrelation
Improving Self-Supervised Learning for Audio Representations by Feature Diversity and DecorrelationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Bac Nguyen
Stefan Uhlich
Fabien Cardinaux
SSL
243
4
0
07 Mar 2023
Multi-Source Contrastive Learning from Musical Audio
Multi-Source Contrastive Learning from Musical Audio
C. Garoufis
Athanasia Zlatintsi
Petros Maragos
308
11
0
14 Feb 2023
BEATs: Audio Pre-Training with Acoustic Tokenizers
BEATs: Audio Pre-Training with Acoustic TokenizersInternational Conference on Machine Learning (ICML), 2022
Sanyuan Chen
Yu-Huan Wu
Chengyi Wang
Shujie Liu
Daniel C. Tompkins
Zhuo Chen
Furu Wei
538
542
0
18 Dec 2022
Self-supervised learning of audio representations using angular
  contrastive loss
Self-supervised learning of audio representations using angular contrastive lossIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Shanshan Wang
S. Tripathy
A. Mesaros
SSL
363
5
0
10 Nov 2022
Pretraining Respiratory Sound Representations using Metadata and
  Contrastive Learning
Pretraining Respiratory Sound Representations using Metadata and Contrastive LearningIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2022
Ilyass Moummad
Nicolas Farrugia
343
35
0
27 Oct 2022
Representing Spatial Trajectories as Distributions
Representing Spatial Trajectories as DistributionsNeural Information Processing Systems (NeurIPS), 2022
Dídac Surís
Carl Vondrick
304
6
0
04 Oct 2022
Audio Barlow Twins: Self-Supervised Audio Representation Learning
Audio Barlow Twins: Self-Supervised Audio Representation LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jonah Anton
H. Coppock
Pancham Shukla
Bjorn W. Schuller
BDLSSL
321
14
0
28 Sep 2022
Contrastive Audio-Language Learning for Music
Contrastive Audio-Language Learning for MusicInternational Society for Music Information Retrieval Conference (ISMIR), 2022
Ilaria Manco
Emmanouil Benetos
Elio Quinton
Gyorgy Fazekas
404
64
0
25 Aug 2022
Representation Learning for the Automatic Indexing of Sound Effects
  Libraries
Representation Learning for the Automatic Indexing of Sound Effects LibrariesInternational Society for Music Information Retrieval Conference (ISMIR), 2022
Alison B. Ma
Alexander Lerch
173
0
0
18 Aug 2022
Self-supervised Learning of Audio Representations from Audio-Visual Data
  using Spatial Alignment
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial AlignmentIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Shanshan Wang
Archontis Politis
A. Mesaros
Maria Sandsten
SSL
160
12
0
02 Jun 2022
Masked Spectrogram Modeling using Masked Autoencoders for Learning
  General-purpose Audio Representation
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
295
90
0
26 Apr 2022
ATST: Audio Representation Learning with Teacher-Student Transformer
ATST: Audio Representation Learning with Teacher-Student TransformerInterspeech (Interspeech), 2022
Xian Li
Xiaofei Li
ViT
153
28
0
26 Apr 2022
BYOL for Audio: Exploring Pre-trained General-purpose Audio
  Representations
BYOL for Audio: Exploring Pre-trained General-purpose Audio RepresentationsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
331
84
0
15 Apr 2022
Learning neural audio features without supervision
Learning neural audio features without supervisionInterspeech (Interspeech), 2022
Sarthak Yadav
Neil Zeghidour
SSL
150
5
0
29 Mar 2022
HEAR: Holistic Evaluation of Audio Representations
HEAR: Holistic Evaluation of Audio RepresentationsNeural Information Processing Systems (NeurIPS), 2022
Joseph P. Turian
Jordie Shier
H. Khan
Bhiksha Raj
Björn W. Schuller
...
P. Esling
Pranay Manocha
Shinji Watanabe
Zeyu Jin
Yonatan Bisk
454
141
0
06 Mar 2022
Audio Self-supervised Learning: A Survey
Audio Self-supervised Learning: A SurveyPatterns (Patterns), 2022
Shuo Liu
Adria Mallol-Ragolta
Emilia Parada-Cabeleiro
Kun Qian
Xingshuo Jing
Alexander Kathan
Bin Hu
Bjoern W. Schuller
SSL
339
136
0
02 Mar 2022
Augmented Contrastive Self-Supervised Learning for Audio Invariant
  Representations
Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations
M Motavali Emami
Dung T. Tran
K. Koishida
SSL
187
2
0
21 Dec 2021
SpliceOut: A Simple and Efficient Audio Augmentation Method
SpliceOut: A Simple and Efficient Audio Augmentation Method
Arjit Jain
Pranay Reddy Samala
Deepak Mittal
Preethi Jyothi
M. Singh
553
14
0
30 Sep 2021
Using system context information to complement weakly labeled data
Using system context information to complement weakly labeled data
Matthias Meyer
M. Wenner
C. Hibert
Fabian Walter
Lothar Thiele
109
1
0
19 Jul 2021
Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL)
Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL)
Michael A. Hedderich
Benjamin Roth
Katharina Kann
Barbara Plank
Alex Ratner
Dietrich Klakow
207
0
0
08 Jul 2021
Improving Sound Event Classification by Increasing Shift Invariance in
  Convolutional Neural Networks
Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks
Eduardo Fonseca
Andrés Ferraro
Xavier Serra
AI4TS
377
11
0
01 Jul 2021
Self-Supervised Learning from Automatically Separated Sound Scenes
Self-Supervised Learning from Automatically Separated Sound ScenesIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Eduardo Fonseca
A. Jansen
D. Ellis
Scott Wisdom
Marco Tagliasacchi
J. Hershey
Manoj Plakal
Shawn Hershey
R. C. Moore
Xavier Serra
SSL
287
13
0
05 May 2021
Multimodal Self-Supervised Learning of General Audio Representations
Multimodal Self-Supervised Learning of General Audio Representations
Luyu Wang
Pauline Luc
Adrià Recasens
Jean-Baptiste Alayrac
Aaron van den Oord
SSL
314
45
0
26 Apr 2021
Enriched Music Representations with Multiple Cross-modal Contrastive
  Learning
Enriched Music Representations with Multiple Cross-modal Contrastive LearningIEEE Signal Processing Letters (IEEE SPL), 2021
Andrés Ferraro
Xavier Favory
Konstantinos Drossos
Yuntae Kim
Dmitry Bogdanov
382
30
0
01 Apr 2021
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio
  Representation
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio RepresentationIEEE International Joint Conference on Neural Network (IJCNN), 2021
Daisuke Niizumi
Daiki Takeuchi
Yasunori Ohishi
Noboru Harada
K. Kashino
SSL
412
204
0
11 Mar 2021
FSD50K: An Open Dataset of Human-Labeled Sound Events
FSD50K: An Open Dataset of Human-Labeled Sound EventsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Eduardo Fonseca
Xavier Favory
Jordi Pons
F. Font
Xavier Serra
778
646
0
01 Oct 2020
1
Page 1 of 1