MUSAN: A Music, Speech, and Noise Corpus

28 October 2015

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown

Self-Supervised Learning from Contrastive Mixtures for Personalized Speech Enhancement

Aswin Sivaraman

Minje Kim

SSL

189

06 Nov 2020

Multi-class Spectral Clustering with Overlaps for Speaker Diarization

Desh Raj

Zili Huang

Sanjeev Khudanpur

168

05 Nov 2020

BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers

Eunjung Han

Chul Lee

A. Stolcke

182

05 Nov 2020

Small footprint Text-Independent Speaker Verification for Embedded Systems

112

03 Nov 2020

Adapting Pretrained Transformer to Lattices for Spoken Language UnderstandingAutomatic Speech Recognition & Understanding (ASRU), 2019

Chao-Wei Huang

Yun-Nung Chen

108

02 Nov 2020

The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020

Xu Xiang

124

31 Oct 2020

Deep Speaker Vector Normalization with Maximum Gaussianality Training

Yunqi Cai

Lantian Li

Dong Wang

Andrew Abel

191

30 Oct 2020

The ins and outs of speaker recognition: lessons from VoxSRC 2020IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Yoohwan Kwon

Hee-Soo Heo

Bong-Jin Lee

Joon Son Chung

187

29 Oct 2020

CopyPaste: An Augmentation Method for Speech Emotion RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

R. Pappagari

Jesús Villalba

Piotr Żelasko

Laureano Moro-Velazquez

Najim Dehak

213

27 Oct 2020

Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation LearningInterspeech (Interspeech), 2020

Dongwei Jiang

Wubo Li

Miao Cao

Wei Zou

Xiangang Li

SSL

295

27 Oct 2020

Integrating end-to-end neural and clustering-based diarization: Getting the best of both worldsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

K. Kinoshita

Marc Delcroix

Naohiro Tawara

236

102

26 Oct 2020

An iterative framework for self-supervised deep speaker representation learningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

120

25 Oct 2020

The IDLAB VoxCeleb Speaker Recognition Challenge 2020 System Description

Jenthe Thienpondt

Brecht Desplanques

Kris Demuynck

141

23 Oct 2020

Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers

Zeqian Li

Jacob Whitehill

231

22 Oct 2020

The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge

22 Oct 2020

Momentum Contrast Speaker Representation Learning

109

22 Oct 2020

Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning

150

22 Oct 2020

The IDLAB VoxSRC-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification

Jenthe Thienpondt

Brecht Desplanques

Kris Demuynck

181

104

21 Oct 2020

Joint Blind Room Acoustic Characterization From Speech And Music Signals Using Convolutional Recurrent Neural Networks

Paul Callens

Milos Cernak

124

21 Oct 2020

Contrastive Learning of General-Purpose Audio Representations

237

308

21 Oct 2020

Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020

20 Oct 2020

Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020

Rui Wang

Zhihua Wei

Yibin Zhan

Zhuoxiao Chen

16 Oct 2020

HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation

Rohan Kumar Das

Haizhou Li

120

08 Oct 2020

A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse EnvironmentsIEEE Access (IEEE Access), 2020

143

06 Oct 2020

Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020

Hee-Soo Heo

Bong-Jin Lee

Jaesung Huh

Joon Son Chung

115

147

29 Sep 2020

Residual acoustic echo suppression based on efficient multi-task convolutional neural network

Xinquan Zhou

Yanhong Leng

29 Sep 2020

Howl: A Deployed, Open-Source Wake Word Detection System

128

21 Aug 2020

S-vectors and TESA: Speaker Embeddings and a Speaker Authenticator Based on Transformer EncoderIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020

Narla John Metilda Sagaya Mary

S. Umesh

Sandesh V Katta

145

11 Aug 2020

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verificationInterspeech (Interspeech), 2020

111

08 Aug 2020

Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker VerificationInterspeech (Interspeech), 2020

112

08 Aug 2020

NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification ChallengeInterspeech (Interspeech), 2020

Li Zhang

Jian Wu

Lei Xie

246

08 Aug 2020

Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning

Jing-Xuan Zhang

Zhenhua Ling

Lirong Dai

169

05 Aug 2020

Unacceptable, where is my privacy? Exploring Accidental Triggers of Smart Speakers

115

02 Aug 2020

DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech EnhancementInterspeech (Interspeech), 2020

Jian Wu

Lei Xie

485

717

01 Aug 2020

Designing Neural Speaker Embeddings with Meta Learning

Manoj Kumar

Tae Jin Park

Somer Bishop

Shrikanth Narayanan

207

31 Jul 2020

A Comparative Re-Assessment of Feature Extractors for Deep Speaker EmbeddingsInterspeech (Interspeech), 2020

Xuechen Liu

Md. Sahidullah

Tomi Kinnunen

30 Jul 2020

Multimodal Integration for Large-Vocabulary Audio-Visual Speech RecognitionEuropean Signal Processing Conference (EUSIPCO), 2020

Wentao Yu

Steffen Zeiler

D. Kolossa

171

28 Jul 2020

Augmentation adversarial training for self-supervised speaker recognition

Joon Son Chung

211

23 Jul 2020

Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype Mining and Language-Dependent Score NormalizationInterspeech (Interspeech), 2020

Jenthe Thienpondt

Brecht Desplanques

Kris Demuynck

121

15 Jul 2020

Data Augmenting Contrastive Learning of Speech Representations in the Time Domain

Pierre-Emmanuel Mazaré

Matthijs Douze

Emmanuel Dupoux

211

123

02 Jul 2020

Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments

Mohammad MohammadAmini

D. Matrouf

113

29 Jun 2020

A study on more realistic room simulation for far-field keyword spotting

Eric Bezzam

Robin Scheibler

C. Cadoux

Thibault Gisselbrecht

170

04 Jun 2020

Online End-to-End Neural Diarization with Speaker-Tracing BufferSpoken Language Technology Workshop (SLT), 2020

196

04 Jun 2020

Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data

172

03 Jun 2020

Inaudible Adversarial Perturbations for Targeted Attack in Speaker Recognition

Qing Wang

Pengcheng Guo

Lei Xie

AAML

192

21 May 2020

SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning

115

20 May 2020

End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors

312

218

20 May 2020

Wake Word Detection with Alignment-Free Lattice-Free MMI

Yiming Wang

Hang Lv

Daniel Povey

Lei Xie

Sanjeev Khudanpur

ObjD

215

17 May 2020

Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild

P. S. Nidadavolu

Saurabh Kataria

Leibny Paola García-Perera

Jesús Villalba

Najim Dehak

115

17 May 2020

Sparse Mixture of Local Experts for Efficient Speech Enhancement

Aswin Sivaraman

Minje Kim

MoE

118

16 May 2020