Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.08484
Cited By
MUSAN: A Music, Speech, and Noise Corpus
28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MUSAN: A Music, Speech, and Noise Corpus"
50 / 664 papers shown
Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement
Aswin Sivaraman
Minje Kim
SSL
189
11
0
06 Nov 2020
Multi-class Spectral Clustering with Overlaps for Speaker Diarization
Desh Raj
Zili Huang
Sanjeev Khudanpur
168
37
0
05 Nov 2020
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers
Eunjung Han
Chul Lee
A. Stolcke
182
46
0
05 Nov 2020
Small footprint Text-Independent Speaker Verification for Embedded Systems
Julien Balian
Raffaele Tavarone
Mathieu Poumeyrol
A. Coucke
112
14
0
03 Nov 2020
Adapting Pretrained Transformer to Lattices for Spoken Language Understanding
Automatic Speech Recognition & Understanding (ASRU), 2019
Chao-Wei Huang
Yun-Nung Chen
108
38
0
02 Nov 2020
The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020
Xu Xiang
124
15
0
31 Oct 2020
Deep Speaker Vector Normalization with Maximum Gaussianality Training
Yunqi Cai
Lantian Li
Dong Wang
Andrew Abel
191
6
0
30 Oct 2020
The ins and outs of speaker recognition: lessons from VoxSRC 2020
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yoohwan Kwon
Hee-Soo Heo
Bong-Jin Lee
Joon Son Chung
187
67
0
29 Oct 2020
CopyPaste: An Augmentation Method for Speech Emotion Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
R. Pappagari
Jesús Villalba
Piotr Żelasko
Laureano Moro-Velazquez
Najim Dehak
213
49
0
27 Oct 2020
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning
Interspeech (Interspeech), 2020
Dongwei Jiang
Wubo Li
Miao Cao
Wei Zou
Xiangang Li
SSL
295
72
0
27 Oct 2020
Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
K. Kinoshita
Marc Delcroix
Naohiro Tawara
236
102
0
26 Oct 2020
An iterative framework for self-supervised deep speaker representation learning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Danwei Cai
Weiqing Wang
Ming Li
SSL
120
43
0
25 Oct 2020
The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
141
52
0
23 Oct 2020
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers
Zeqian Li
Jacob Whitehill
231
13
0
22 Oct 2020
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge
Renyu Wang
Ruilin Tong
Y. Yeung
Xiao Chen
97
1
0
22 Oct 2020
Momentum Contrast Speaker Representation Learning
Jangho Lee
Jaihyun Koh
Sungroh Yoon
SSL
109
3
0
22 Oct 2020
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning
Sung Hwan Mun
Woohyun Kang
Min Hyun Han
N. Kim
SSL
150
23
0
22 Oct 2020
The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
181
104
0
21 Oct 2020
Joint Blind Room Acoustic Characterization From Speech And Music Signals Using Convolutional Recurrent Neural Networks
Paul Callens
Milos Cernak
124
11
0
21 Oct 2020
Contrastive Learning of General-Purpose Audio Representations
Aaqib Saeed
David Grangier
Neil Zeghidour
VLM
SSL
237
308
0
21 Oct 2020
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020
Shufan Shen
Ran Miao
Yi Wang
Zhihua Wei
99
0
0
20 Oct 2020
Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020
Rui Wang
Zhihua Wei
Yibin Zhan
Zhuoxiao Chen
73
0
0
16 Oct 2020
HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation
Rohan Kumar Das
Ruijie Tao
Jichen Yang
Wei Rao
Cheng Yu
Haizhou Li
120
11
0
08 Oct 2020
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
IEEE Access (IEEE Access), 2020
Youngmoon Jung
Yeunju Choi
Hyungjun Lim
Hoirin Kim
143
13
0
06 Oct 2020
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
Hee-Soo Heo
Bong-Jin Lee
Jaesung Huh
Joon Son Chung
115
147
0
29 Sep 2020
Residual acoustic echo suppression based on efficient multi-task convolutional neural network
Xinquan Zhou
Yanhong Leng
85
9
0
29 Sep 2020
Howl: A Deployed, Open-Source Wake Word Detection System
Raphael Tang
Jaejun Lee
Afsaneh Razi
Julia Cambre
Ian Bicking
Jofish Kaye
Jimmy J. Lin
VLM
128
17
0
21 Aug 2020
S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer Encoder
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020
Narla John Metilda Sagaya Mary
S. Umesh
Sandesh V Katta
145
33
0
11 Aug 2020
Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification
Interspeech (Interspeech), 2020
Amber Afshan
Jinxi Guo
S. Park
Vijay Ravi
A. McCree
Abeer Alwan
111
6
0
08 Aug 2020
Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
Interspeech (Interspeech), 2020
Vijay Ravi
Ruchao Fan
Amber Afshan
Huanhua Lu
Abeer Alwan
112
9
0
08 Aug 2020
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
Interspeech (Interspeech), 2020
Li Zhang
Jian Wu
Lei Xie
246
12
0
08 Aug 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
Jing-Xuan Zhang
Zhenhua Ling
Lirong Dai
169
6
0
05 Aug 2020
Unacceptable, where is my privacy? Exploring Accidental Triggers of Smart Speakers
Lea Schonherr
Maximilian Golla
Thorsten Eisenhofer
Jan Wiele
D. Kolossa
Thorsten Holz
115
42
0
02 Aug 2020
DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
Interspeech (Interspeech), 2020
Yanxin Hu
Yun Liu
Shubo Lv
Mengtao Xing
Shimin Zhang
Yihui Fu
Jian Wu
Bihong Zhang
Lei Xie
485
717
0
01 Aug 2020
Designing Neural Speaker Embeddings with Meta Learning
Manoj Kumar
Tae Jin Park
Somer Bishop
Shrikanth Narayanan
207
10
0
31 Jul 2020
A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings
Interspeech (Interspeech), 2020
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
91
10
0
30 Jul 2020
Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition
European Signal Processing Conference (EUSIPCO), 2020
Wentao Yu
Steffen Zeiler
D. Kolossa
171
11
0
28 Jul 2020
Augmentation adversarial training for self-supervised speaker recognition
Jaesung Huh
Hee-Soo Heo
Jingu Kang
Shinji Watanabe
Joon Son Chung
SSL
211
78
0
23 Jul 2020
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype Mining and Language-Dependent Score Normalization
Interspeech (Interspeech), 2020
Jenthe Thienpondt
Brecht Desplanques
Kris Demuynck
121
26
0
15 Jul 2020
Data Augmenting Contrastive Learning of Speech Representations in the Time Domain
Eugene Kharitonov
M. Rivière
Gabriel Synnaeve
Lior Wolf
Pierre-Emmanuel Mazaré
Matthijs Douze
Emmanuel Dupoux
211
123
0
02 Jul 2020
Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments
Mohammad MohammadAmini
D. Matrouf
113
15
0
29 Jun 2020
A study on more realistic room simulation for far-field keyword spotting
Eric Bezzam
Robin Scheibler
C. Cadoux
Thibault Gisselbrecht
170
12
0
04 Jun 2020
Online End-to-End Neural Diarization with Speaker-Tracing Buffer
Spoken Language Technology Workshop (SLT), 2020
Yawen Xue
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Kenji Nagamatsu
196
52
0
04 Jun 2020
Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data
Mael Fabien
Seyyed Saeed Sarfjoo
P. Motlícek
S. Madikeri
172
3
0
03 Jun 2020
Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition
Qing Wang
Pengcheng Guo
Lei Xie
AAML
192
62
0
21 May 2020
SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning
Yuan-Kuei Wu
Chao-I Tuan
Hung-yi Lee
Yu Tsao
115
4
0
20 May 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
Shota Horiguchi
Yusuke Fujita
Shinji Watanabe
Yawen Xue
Kenji Nagamatsu
312
218
0
20 May 2020
Wake Word Detection with Alignment-Free Lattice-Free MMI
Yiming Wang
Hang Lv
Daniel Povey
Lei Xie
Sanjeev Khudanpur
ObjD
215
17
0
17 May 2020
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild
P. S. Nidadavolu
Saurabh Kataria
Leibny Paola García-Perera
Jesús Villalba
Najim Dehak
115
3
0
17 May 2020
Sparse Mixture of Local Experts for Efficient Speech Enhancement
Aswin Sivaraman
Minje Kim
MoE
118
14
0
16 May 2020
Previous
1
2
3
...
11
12
13
14
Next