ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.08484
  4. Cited By
MUSAN: A Music, Speech, and Noise Corpus

MUSAN: A Music, Speech, and Noise Corpus

28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
ArXiv (abs)PDFHTML

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown
NIST SRE CTS Superset: A large-scale dataset for telephony speaker
  recognition
NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition
S. O. Sadjadi
AI4TS
60
27
0
16 Aug 2021
Xi-Vector Embedding for Speaker Recognition
Xi-Vector Embedding for Speaker RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021
Kong Aik Lee
Qiongqiong Wang
Takafumi Koshinaka
BDL
69
39
0
12 Aug 2021
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency
  Domain Features and a Pre-trained Acoustic Model
Multi-channel Speech Enhancement with 2-D Convolutional Time-frequency Domain Features and a Pre-trained Acoustic ModelAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Quandong Wang
Junnan Wu
Zhao Yan
Sichong Qian
Liyong Guo
Lichun Fan
Weiji Zhuang
Peng Gao
Yujun Wang
234
0
0
23 Jul 2021
Is Someone Speaking? Exploring Long-term Temporal Features for
  Audio-visual Active Speaker Detection
Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker DetectionACM Multimedia (ACM MM), 2021
Ruijie Tao
Zexu Pan
Rohan Kumar Das
Xinyuan Qian
Mike Zheng Shou
Haizhou Li
205
218
0
14 Jul 2021
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech
  Enhancement
DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement
Xiaohuai Le
Hongsheng Chen
Kai-Jyun Chen
Jing Lu
212
101
0
12 Jul 2021
MACCIF-TDNN: Multi aspect aggregation of channel and context
  interdependence features in TDNN-based speaker verification
MACCIF-TDNN: Multi aspect aggregation of channel and context interdependence features in TDNN-based speaker verification
Fangyuan Wang
Z. Song
Hongchen Jiang
Bo Xu
102
8
0
07 Jul 2021
The HCCL Speaker Verification System for Far-Field Speaker Verification
  Challenge
The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge
Zhuo Li
Ce Fang
Runqiu Xiao
Zhigao Chen
Wenchao Wang
Yonghong Yan
125
2
0
03 Jul 2021
An Integrated Framework for Two-pass Personalized Voice Trigger
An Integrated Framework for Two-pass Personalized Voice TriggerInterspeech (Interspeech), 2021
Dexin Liao
Jing Li
Yiming Zhi
Song Li
Q. Hong
Lin Li
180
1
0
30 Jun 2021
A Simultaneous Denoising and Dereverberation Framework with Target
  Decoupling
A Simultaneous Denoising and Dereverberation Framework with Target Decoupling
Andong Li
Wenzhe Liu
Xiaoxue Luo
Guochen Yu
C. Zheng
Xiaodong Li
175
64
0
24 Jun 2021
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker
  Verification
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification
Li Zhang
Qing Wang
Kong Aik Lee
Lei Xie
Haizhou Li
165
14
0
17 Jun 2021
End-to-end Neural Diarization: From Transformer to Conformer
End-to-end Neural Diarization: From Transformer to ConformerInterspeech (Interspeech), 2021
Yi Y. Liu
Eunjung Han
Chul Lee
A. Stolcke
208
47
0
14 Jun 2021
Noise Classification Aided Attention-Based Neural Network for Monaural
  Speech Enhancement
Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement
Lu Ma
Song Yang
Y. Gong
Zhongqin Wu
119
0
0
31 May 2021
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding
DIVE: End-to-end Speech Diarization via Iterative Speaker EmbeddingAutomatic Speech Recognition & Understanding (ASRU), 2021
Neil Zeghidour
O. Teboul
David Grangier
125
13
0
28 May 2021
Cross-Referencing Self-Training Network for Sound Event Detection in
  Audio Mixtures
Cross-Referencing Self-Training Network for Sound Event Detection in Audio MixturesIEEE transactions on multimedia (IEEE Trans. Multimedia), 2021
Sangwook Park
D. Han
Mounya Elhilali
180
14
0
27 May 2021
Advances in integration of end-to-end neural and clustering-based
  diarization for real conversational speech
Advances in integration of end-to-end neural and clustering-based diarization for real conversational speechInterspeech (Interspeech), 2021
K. Kinoshita
Marc Delcroix
Naohiro Tawara
252
79
0
19 May 2021
X-Vectors with Multi-Scale Aggregation for Speaker Diarization
X-Vectors with Multi-Scale Aggregation for Speaker Diarization
Myung-Jae Kim
V. Apsingekar
Divya Neelagiri
119
0
0
16 May 2021
Study on the temporal pooling used in deep neural networks for speaker
  verification
Study on the temporal pooling used in deep neural networks for speaker verificationEuropean Signal Processing Conference (EUSIPCO), 2021
Mickael Rouvier
Pierre-Michel Bousquet
J. Duret
136
8
0
10 May 2021
Voice activity detection in the wild: A data-driven approach using
  teacher-student training
Voice activity detection in the wild: A data-driven approach using teacher-student trainingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Heinrich Dinkel
Shuai Wang
Xuenan Xu
Mengyue Wu
K. Yu
VLM
117
40
0
10 May 2021
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot
  Learning with Knowledge Distillation
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge DistillationIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Sunwoo Kim
Minje Kim
174
28
0
08 May 2021
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model
  Selection
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model SelectionIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021
Aswin Sivaraman
Minje Kim
157
11
0
08 May 2021
Multimodal Self-Supervised Learning of General Audio Representations
Multimodal Self-Supervised Learning of General Audio Representations
Luyu Wang
Pauline Luc
Adrià Recasens
Jean-Baptiste Alayrac
Aaron van den Oord
SSL
252
44
0
26 Apr 2021
Fusing information streams in end-to-end audio-visual speech recognition
Fusing information streams in end-to-end audio-visual speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Wentao Yu
Steffen Zeiler
D. Kolossa
242
15
0
19 Apr 2021
Learning Metrics from Mean Teacher: A Supervised Learning Method for
  Improving the Generalization of Speaker Verification System
Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System
Ju-ho Kim
Hye-jin Shim
Jee-weon Jung
Ha-Jin Yu
182
1
0
14 Apr 2021
End-to-end speaker segmentation for overlap-aware resegmentation
End-to-end speaker segmentation for overlap-aware resegmentationInterspeech (Interspeech), 2021
H. Bredin
Antoine Laurent
VLM
619
197
0
08 Apr 2021
Utilizing Self-supervised Representations for MOS Prediction
Utilizing Self-supervised Representations for MOS PredictionInterspeech (Interspeech), 2021
Wei-Cheng Tseng
Chien-yu Huang
Wei-Tsung Kao
Yist Y. Lin
Hung-yi Lee
SSL
404
71
0
07 Apr 2021
Three-class Overlapped Speech Detection using a Convolutional Recurrent
  Neural Network
Three-class Overlapped Speech Detection using a Convolutional Recurrent Neural NetworkInterspeech (Interspeech), 2021
Jee-weon Jung
Hee-Soo Heo
Youngki Kwon
Joon Son Chung
Bong-Jin Lee
273
22
0
07 Apr 2021
Personalized Speech Enhancement through Self-Supervised Data
  Augmentation and Purification
Personalized Speech Enhancement through Self-Supervised Data Augmentation and PurificationInterspeech (Interspeech), 2021
Aswin Sivaraman
Sunwoo Kim
Minje Kim
232
25
0
05 Apr 2021
Efficient Personalized Speech Enhancement through Self-Supervised
  Learning
Efficient Personalized Speech Enhancement through Self-Supervised LearningIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2021
Aswin Sivaraman
Minje Kim
226
23
0
05 Apr 2021
Attention Back-end for Automatic Speaker Verification with Multiple
  Enrollment Utterances
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment UtterancesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Chang Zeng
Xin Wang
Erica Cooper
Xiaoxiao Miao
Junichi Yamagishi
168
27
0
04 Apr 2021
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field
  Multi-Channel Speech Enhancement for Video Conferencing
INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Wei Rao
Yihui Fu
Yanxin Hu
Xin Xu
Yvkai Jv
...
Shinji Watanabe
Zheng-Hua Tan
Hui Bu
Tao Yu
Shidong Shang
149
12
0
02 Apr 2021
Multilingual and code-switching ASR challenges for low resource Indian
  languages
Multilingual and code-switching ASR challenges for low resource Indian languagesInterspeech (Interspeech), 2021
Anuj Diwan
Rakesh Vaideeswaran
Sanket Shah
Ankita Singh
Srinivasa Raghavan
...
Jai Nanavati
Raoul Nanavati
Karthik Sankaranarayanan
Tejaswi Seeram
Basil Abraham
138
109
0
01 Apr 2021
Auto-KWS 2021 Challenge: Task, Datasets, and Baselines
Auto-KWS 2021 Challenge: Task, Datasets, and BaselinesInterspeech (Interspeech), 2021
Jingsong Wang
Yuxuan He
Chunyu Zhao
Qijie Shao
Wei-Wei Tu
Tom Ko
Hung-yi Lee
Lei Xie
118
5
0
31 Mar 2021
Quantifying Bias in Automatic Speech Recognition
Quantifying Bias in Automatic Speech Recognition
Siyuan Feng
O. Kudina
B. Halpern
O. Scharenborg
185
99
0
28 Mar 2021
EfficientTDNN: Efficient Architecture Search for Speaker Recognition
EfficientTDNN: Efficient Architecture Search for Speaker RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Rui Wang
Zhihua Wei
Haoran Duan
S. Ji
Yang Long
Zhenhou Hong
263
20
0
25 Mar 2021
USTC-NELSLIP System Description for DIHARD-III Challenge
USTC-NELSLIP System Description for DIHARD-III Challenge
Yuxuan Wang
Maokui He
Shutong Niu
Lei Sun
Tian Gao
Xin Fang
Jia Pan
Jun Du
Chin-Hui Lee
143
32
0
19 Mar 2021
Learning spectro-temporal representations of complex sounds with
  parameterized neural networks
Learning spectro-temporal representations of complex sounds with parameterized neural networksJournal of the Acoustical Society of America (JASA), 2021
Rachid Riad
Julien Karadayi
Anne-Catherine Bachoud-Lévi
Emmanuel Dupoux
141
8
0
12 Mar 2021
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection
  Robust to Real-World Scenarios
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios
E. Hardy
F. Badets
96
4
0
08 Mar 2021
The NPU System for the 2020 Personalized Voice Trigger Challenge
The NPU System for the 2020 Personalized Voice Trigger Challenge
Jingyong Hou
Li Zhang
Yihui Fu
Qing Wang
Zhanheng Yang
Qijie Shao
Lei Xie
123
8
0
26 Feb 2021
Artificially Synthesising Data for Audio Classification and Segmentation
  to Improve Speech and Music Detection in Radio Broadcast
Artificially Synthesising Data for Audio Classification and Segmentation to Improve Speech and Music Detection in Radio BroadcastIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
S. Venkatesh
D. Moffat
Alexis Kirke
Gözel Shakeri
S. Brewster
...
Helen Odell-Miller
Alexander J. Street
Nicolas Farina
Sube Banerjee
E. Miranda
95
12
0
19 Feb 2021
AISPEECH-SJTU accent identification system for the Accented English
  Speech Recognition Challenge
AISPEECH-SJTU accent identification system for the Accented English Speech Recognition ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Houjun Huang
Xu Xiang
Yexin Yang
Rao Ma
Y. Qian
164
29
0
19 Feb 2021
An Investigation of End-to-End Models for Robust Speech Recognition
An Investigation of End-to-End Models for Robust Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Archiki Prasad
Preethi Jyothi
R. Velmurugan
150
23
0
11 Feb 2021
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech
  Diarization Challenge
The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge
Weiqing Wang
Qingjian Lin
Danwei Cai
Lin Yang
Ming Li
170
8
0
06 Feb 2021
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural
  Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
Shota Horiguchi
Nelson Yalta
Leibny Paola García-Perera
Yuki Takashima
Yawen Xue
Desh Raj
Zili Huang
Yusuke Fujita
Shinji Watanabe
Sanjeev Khudanpur
BDL
117
40
0
02 Feb 2021
Online Streaming End-to-End Neural Diarization Handling Overlapping
  Speech and Flexible Numbers of Speakers
Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
Yawen Xue
Shota Horiguchi
Yusuke Fujita
Yuki Takashima
Shinji Watanabe
Leibny Paola García-Perera
Kenji Nagamatsu
190
6
0
21 Jan 2021
A Principle Solution for Enroll-Test Mismatch in Speaker Recognition
A Principle Solution for Enroll-Test Mismatch in Speaker RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Lantian Li
Dong Wang
Jiawen Kang
Renyu Wang
Jingqian Wu
Zhendong Gao
Xiao Chen
143
8
0
23 Dec 2020
CN-Celeb: multi-genre speaker recognition
CN-Celeb: multi-genre speaker recognitionSpeech Communication (Speech Commun.), 2020
Lantian Li
Ruiqi Liu
Jiawen Kang
Yue Fan
Hao Cui
Yunqi Cai
Ravichander Vipperla
Tianshi Zheng
Dong Wang
220
143
0
23 Dec 2020
End-to-End Speaker Diarization as Post-Processing
End-to-End Speaker Diarization as Post-ProcessingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Shota Horiguchi
Leibny Paola García-Perera
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
231
45
0
18 Dec 2020
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge
Arsha Nagrani
Joon Son Chung
Jaesung Huh
Andrew Brown
Ernesto Coto
Weidi Xie
Mitchell McLaren
D. Reynolds
Andrew Zisserman
162
76
0
12 Dec 2020
One Shot Learning for Speech Separation
One Shot Learning for Speech SeparationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yuan-Kuei Wu
Kuan-Po Huang
Yu Tsao
Hung-yi Lee
VLM
163
8
0
20 Nov 2020
Towards Semi-Supervised Semantics Understanding from Speech
Towards Semi-Supervised Semantics Understanding from Speech
Cheng-I Jeff Lai
Jin Cao
S. Bodapati
Shang-Wen Li
SSL
185
7
0
11 Nov 2020
Previous
123...1011121314
Next