MUSAN: A Music, Speech, and Noise Corpus

28 October 2015

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown

LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition ChallengeInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

183

14 Oct 2022

Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion

Zhuo Li

Pengyuan Zhang

165

13 Oct 2022

THUEE system description for NIST 2020 SRE CTS challenge

Xinyue Ma

Minqiang Xu

125

12 Oct 2022

Cross-dataset COVID-19 Transfer Learning with Cough Detection, Cough Segmentation, and Data Augmentation

164

12 Oct 2022

The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022

Na Li

144

11 Oct 2022

Mutual Learning of Single- and Multi-Channel End-to-End Neural DiarizationSpoken Language Technology Workshop (SLT), 2022

Shota Horiguchi

Yuki Takashima

Shinji Watanabe

Leibny Paola García-Perera

240

07 Oct 2022

WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming Voice Trigger

182

06 Oct 2022

CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsSpoken Language Technology Workshop (SLT), 2022

328

05 Oct 2022

Deepfake audio detection by speaker verificationInternational Workshop on Information Forensics and Security (WIFS), 2022

241

28 Sep 2022

Joint Speech Activity and Overlap Detection with Multi-Exit ArchitectureAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

250

24 Sep 2022

The SpeakIn Speaker Verification System for Far-Field Speaker Verification Challenge 2022

Yihao Chen

Minqiang Xu

171

23 Sep 2022

The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022

Haizhou Li

243

23 Sep 2022

UniKW-AT: Unified Keyword Spotting and Audio TaggingInterspeech (Interspeech), 2022

Yujun Wang

237

23 Sep 2022

The SpeakIn System Description for CNSRC2022

Yihao Chen

Minqiang Xu

132

22 Sep 2022

The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022

Sangwon Suh

Sunjong Park

146

21 Sep 2022

The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022

R. Zhou

Yu Du

Che-Ming Hu

117

20 Sep 2022

SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022

177

19 Sep 2022

The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022

Jingguang Tian

Xinhui Hu

Xinkang Xu

237

19 Sep 2022

Learning Audio-Visual embedding for Person Verification in the Wild

185

09 Sep 2022

Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment UtterancesComputer Speech and Language (CSL), 2022

Chang Zeng

Xiaoxiao Miao

Xin Wang

Erica Cooper

Junichi Yamagishi

108

01 Sep 2022

Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

376

27 Aug 2022

Disentangled Speaker Representation Learning via Mutual Information MinimizationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

319

17 Aug 2022

C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker VerificationIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Chunlei Zhang

Dong Yu

196

15 Aug 2022

LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo CancellationInterspeech (Interspeech), 2022

Chen Zhang

Jinjiang Liu

Xueliang Zhang

15 Aug 2022

FRA-RIR: Fast Random Approximation of the Image-source MethodInterspeech (Interspeech), 2022

Yi Luo

Jianwei Yu

117

08 Aug 2022

Robust Acoustic Domain Identification with its Application to Speaker DiarizationInternational Journal of Speech Technology (IJST), 2022

185

05 Aug 2022

Attention and DCT based Global Context Modeling for Text-independent Speaker RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Wei Xia

John H. L. Hansen

164

04 Aug 2022

The SJTU System for Short-duration Speaker Verification Challenge 2021Interspeech (Interspeech), 2021

03 Aug 2022

Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label CorrectionInterspeech (Interspeech), 2022

Bing Han

Zhengyang Chen

Y. Qian

113

03 Aug 2022

Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge

A. I. S. Ferreira

Gustavo dos Reis Oliveira

171

29 Jul 2022

Utterance-by-utterance overlap-aware neural diarization with Graph-PITInterspeech (Interspeech), 2022

156

28 Jul 2022

Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and AugmentationsACM Multimedia (ACM MM), 2022

28 Jul 2022

Inference skipping for more efficient real-time speech enhancement with parallel RNNsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

310

22 Jul 2022

The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification ChallengeInterspeech (Interspeech), 2022

226

15 Jul 2022

u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled ModalityNeural Information Processing Systems (NeurIPS), 2022

Wei-Ning Hsu

Bowen Shi

SSL VLM

319

14 Jul 2022

Cross-Age Speaker Verification: Learning Age-Invariant Speaker EmbeddingsInterspeech (Interspeech), 2022

Na Li

179

13 Jul 2022

Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesPattern Recognition Letters (PRL), 2022

Harlin Lee

Aaqib Saeed

271

12 Jul 2022

Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive LearningInterspeech (Interspeech), 2022

Théo Lepage

Réda Dehak

SSL

207

12 Jul 2022

pMCT: Patched Multi-Condition Training for Robust Speech RecognitionInterspeech (Interspeech), 2022

Pablo Peso Parada

A. Dobrowolska

Karthikeyan P. Saravanan

Mete Ozay

240

11 Jul 2022

Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation LearningInterspeech (Interspeech), 2022

Mufan Sang

John H. L. Hansen

152

10 Jul 2022

Low-resource Low-footprint Wake-word Detection using Knowledge DistillationInterspeech (Interspeech), 2022

106

06 Jul 2022

The THUEE System Description for the IARPA OpenASR21 ChallengeInterspeech (Interspeech), 2022

110

29 Jun 2022

Speaker Verification in Multi-Speaker Environments Using Temporal Feature FusionEuropean Signal Processing Conference (EUSIPCO), 2022

121

28 Jun 2022

Wav2Vec-Aug: Improved self-supervised training with limited dataInterspeech (Interspeech), 2022

175

27 Jun 2022

Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fireIEEE Signal Processing Letters (SPL), 2022

Linhao Dong

114

27 Jun 2022

Extended U-Net for Speaker Verification in Noisy EnvironmentsInterspeech (Interspeech), 2022

Ju-ho Kim

Ju-Sung Heo

Hye-jin Shim

Ha-Jin Yu

112

27 Jun 2022

The SJTU X-LANCE Lab System for CNSRC 2022

275

23 Jun 2022

Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Danwei Cai

Zexin Cai

Ming Li

228

18 Jun 2022

The Influence of Dataset Partitioning on Dysfluency Detection SystemsInternational Conference on Text, Speech and Dialogue (TSD), 2022

182

07 Jun 2022

AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition SystemsIEEE Transactions on Dependable and Secure Computing (TDSC), 2022

Lingling Fan

178

07 Jun 2022