ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.08484
  4. Cited By
MUSAN: A Music, Speech, and Noise Corpus

MUSAN: A Music, Speech, and Noise Corpus

28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
ArXiv (abs)PDFHTML

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown
Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Aswin Sivaraman
Minje Kim
SSL
189
11
0
06 Nov 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
168
37
0
05 Nov 2020
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a
  Variable Number of Speakers
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers
Eunjung Han
Chul Lee
A. Stolcke
182
46
0
05 Nov 2020
Small footprint Text-Independent Speaker Verification for Embedded
  Systems
Small footprint Text-Independent Speaker Verification for Embedded Systems
Julien Balian
Raffaele Tavarone
Mathieu Poumeyrol
A. Coucke
112
14
0
03 Nov 2020
Adapting Pretrained Transformer to Lattices for Spoken Language
  Understanding
Adapting Pretrained Transformer to Lattices for Spoken Language UnderstandingAutomatic Speech Recognition & Understanding (ASRU), 2019
Chao-Wei Huang
Yun-Nung Chen
108
38
0
02 Nov 2020
The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020
The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020
Xu Xiang
124
15
0
31 Oct 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
191
6
0
30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yoohwan Kwon
Hee-Soo Heo
Bong-Jin Lee
Joon Son Chung
187
67
0
29 Oct 2020
CopyPaste: An Augmentation Method for Speech Emotion Recognition
CopyPaste: An Augmentation Method for Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
R. Pappagari
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
213
49
0
27 Oct 2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for
  Self-supervised Speech Representation Learning
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation LearningInterspeech (Interspeech), 2020
Dongwei Jiang
Wubo Li
Miao Cao
Wei Zou
Xiangang Li
SSL
295
72
0
27 Oct 2020
Integrating end-to-end neural and clustering-based diarization: Getting
  the best of both worlds
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worldsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
K. Kinoshita
Marc Delcroix
Naohiro Tawara
236
102
0
26 Oct 2020
An iterative framework for self-supervised deep speaker representation
  learning
An iterative framework for self-supervised deep speaker representation learningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Danwei Cai
Weiqing Wang
Ming Li
SSL
120
43
0
25 Oct 2020
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
141
52
0
23 Oct 2020
Compositional embedding models for speaker identification and
  diarization with simultaneous speech from 2+ speakers
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li
Jacob Whitehill
231
13
0
22 Oct 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker
  Diarisation Challenge
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge
Renyu Wang
Ruilin Tong
Y. Yeung
Xiao Chen
97
1
0
22 Oct 2020
Momentum Contrast Speaker Representation Learning
Momentum Contrast Speaker Representation Learning
Jangho Lee
Jaihyun Koh
Sungroh Yoon
SSL
109
3
0
22 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via
  Contrastive Equilibrium Learning
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
150
23
0
22 Oct 2020
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and
  Quality-Aware Score Calibration in DNN Based Speaker Verification
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
181
104
0
21 Oct 2020
Joint Blind Room Acoustic Characterization From Speech And Music Signals
  Using Convolutional Recurrent Neural Networks
Joint Blind Room Acoustic Characterization From Speech And Music Signals Using Convolutional Recurrent Neural Networks
Paul Callens
Milos Cernak
124
11
0
21 Oct 2020
Contrastive Learning of General-Purpose Audio Representations
Contrastive Learning of General-Purpose Audio Representations
Aaqib Saeed
David Grangier
Neil Zeghidour
VLMSSL
237
308
0
21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker
  Recognition Challenge2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020
Shufan Shen
Ran Miao
Yi Wang
Zhihua Wei
99
0
0
20 Oct 2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge
  2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020
Rui Wang
Zhihua Wei
Yibin Zhan
Zhuoxiao Chen
73
0
0
16 Oct 2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition
  Evaluation
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation
Rohan Kumar Das
Ruijie Tao
Jichen Yang
Wei Rao
Cheng Yu
Haizhou Li
120
11
0
08 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker
  Verification in Adverse Environments
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse EnvironmentsIEEE Access (IEEE Access), 2020
Youngmoon Jung
Yeunju Choi
Hyungjun Lim
Hoirin Kim
143
13
0
06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge
  2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
Joon Son Chung
115
147
0
29 Sep 2020
Residual acoustic echo suppression based on efficient multi-task
  convolutional neural network
Residual acoustic echo suppression based on efficient multi-task convolutional neural network
Xinquan Zhou
Yanhong Leng
85
9
0
29 Sep 2020
Howl: A Deployed, Open-Source Wake Word Detection System
Howl: A Deployed, Open-Source Wake Word Detection System
Raphael Tang
Jaejun Lee
Afsaneh Razi
Julia Cambre
Ian Bicking
Jofish Kaye
Jimmy J. Lin
VLM
128
17
0
21 Aug 2020
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based
  on Transformer Encoder
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer EncoderIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Narla John Metilda Sagaya Mary
S. Umesh
Sandesh V Katta
145
33
0
11 Aug 2020
Variable frame rate-based data augmentation to handle speaking-style
  variability for automatic speaker verification
Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verificationInterspeech (Interspeech), 2020
Amber Afshan
Jinxi Guo
S. Park
Vijay Ravi
A. McCree
Abeer Alwan
111
6
0
08 Aug 2020
Exploring the Use of an Unsupervised Autoregressive Model as a Shared
  Encoder for Text-Dependent Speaker Verification
Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker VerificationInterspeech (Interspeech), 2020
Vijay Ravi
Ruchao Fan
Amber Afshan
Huanhua Lu
Abeer Alwan
112
9
0
08 Aug 2020
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker
  Verification Challenge
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification ChallengeInterspeech (Interspeech), 2020
Li Zhang
Jian Wu
Lei Xie
246
12
0
08 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with
  Adversarial Learning
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
169
6
0
05 Aug 2020
Unacceptable, where is my privacy? Exploring Accidental Triggers of
  Smart Speakers
Unacceptable, where is my privacy? Exploring Accidental Triggers of Smart Speakers
Lea Schonherr
Maximilian Golla
Thorsten Eisenhofer
Jan Wiele
D. Kolossa
Thorsten Holz
115
42
0
02 Aug 2020
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech
  Enhancement
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech EnhancementInterspeech (Interspeech), 2020
Yanxin Hu
Yun Liu
Shubo Lv
Mengtao Xing
Shimin Zhang
Yihui Fu
Jian Wu
Bihong Zhang
Lei Xie
485
717
0
01 Aug 2020
Designing Neural Speaker Embeddings with Meta Learning
Designing Neural Speaker Embeddings with Meta Learning
Manoj Kumar
Tae Jin Park
Somer Bishop
Shrikanth Narayanan
207
10
0
31 Jul 2020
A Comparative Re-Assessment of Feature Extractors for Deep Speaker
  Embeddings
A Comparative Re-Assessment of Feature Extractors for Deep Speaker EmbeddingsInterspeech (Interspeech), 2020
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
91
10
0
30 Jul 2020
Multimodal Integration for Large-Vocabulary Audio-Visual Speech
  Recognition
Multimodal Integration for Large-Vocabulary Audio-Visual Speech RecognitionEuropean Signal Processing Conference (EUSIPCO), 2020
Wentao Yu
Steffen Zeiler
D. Kolossa
171
11
0
28 Jul 2020
Augmentation adversarial training for self-supervised speaker
  recognition
Augmentation adversarial training for self-supervised speaker recognition
Jaesung Huh
Hee-Soo Heo
Jingu Kang
Shinji Watanabe
Joon Son Chung
SSL
211
78
0
23 Jul 2020
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype
  Mining and Language-Dependent Score Normalization
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype Mining and Language-Dependent Score NormalizationInterspeech (Interspeech), 2020
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
121
26
0
15 Jul 2020
Data Augmenting Contrastive Learning of Speech Representations in the
  Time Domain
Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Eugene Kharitonov
M. Rivière
Gabriel Synnaeve
Lior Wolf
Pierre-Emmanuel Mazaré
Matthijs Douze
Emmanuel Dupoux
211
123
0
02 Jul 2020
Data augmentation versus noise compensation for x- vector speaker
  recognition systems in noisy environments
Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments
Mohammad MohammadAmini
D. Matrouf
113
15
0
29 Jun 2020
A study on more realistic room simulation for far-field keyword spotting
A study on more realistic room simulation for far-field keyword spotting
Eric Bezzam
Robin Scheibler
C. Cadoux
Thibault Gisselbrecht
170
12
0
04 Jun 2020
Online End-to-End Neural Diarization with Speaker-Tracing Buffer
Online End-to-End Neural Diarization with Speaker-Tracing BufferSpoken Language Technology Workshop (SLT), 2020
Yawen Xue
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
196
52
0
04 Jun 2020
Graph2Speak: Improving Speaker Identification using Network Knowledge in
  Criminal Conversational Data
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data
Mael Fabien
Seyyed Saeed Sarfjoo
P. Motlícek
S. Madikeri
172
3
0
03 Jun 2020
Inaudible Adversarial Perturbations for Targeted Attack in Speaker
  Recognition
Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition
Qing Wang
Pengcheng Guo
Lei Xie
AAML
192
62
0
21 May 2020
SADDEL: Joint Speech Separation and Denoising Model based on Multitask
  Learning
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning
Yuan-Kuei Wu
Chao-I Tuan
Hung-yi Lee
Yu Tsao
115
4
0
20 May 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with
  Encoder-Decoder Based Attractors
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Kenji Nagamatsu
312
218
0
20 May 2020
Wake Word Detection with Alignment-Free Lattice-Free MMI
Wake Word Detection with Alignment-Free Lattice-Free MMI
Yiming Wang
Hang Lv
Daniel Povey
Lei Xie
Sanjeev Khudanpur
ObjD
215
17
0
17 May 2020
Single Channel Far Field Feature Enhancement For Speaker Verification In
  The Wild
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild
P. S. Nidadavolu
Saurabh Kataria
Leibny Paola García-Perera
Jesús Villalba
Najim Dehak
115
3
0
17 May 2020
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Aswin Sivaraman
Minje Kim
MoE
118
14
0
16 May 2020
Previous
123...11121314
Next