ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.08484
  4. Cited By
MUSAN: A Music, Speech, and Noise Corpus

MUSAN: A Music, Speech, and Noise Corpus

28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
ArXiv (abs)PDFHTML

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition ChallengeInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Yan Jia
Mihee Hong
Jingyu Hou
Kailong Ren
Sifan Ma
Jin Wang
Fangzhen Peng
Yinglin Ji
Lin Yang
Junjie Wang
183
1
0
14 Oct 2022
Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score
  Fusion
Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion
Yuxiang Zhang
Jingze Lu
Xingming Wang
Zhuo Li
Runqiu Xiao
Wenchao Wang
Ming Li
Pengyuan Zhang
165
6
0
13 Oct 2022
THUEE system description for NIST 2020 SRE CTS challenge
THUEE system description for NIST 2020 SRE CTS challenge
Yu Zheng
Jinghan Peng
Miao Zhao
Yufeng Ma
Min Liu
Xinyue Ma
Tianyu Liang
Tianlong Kong
Liang He
Minqiang Xu
125
1
0
12 Oct 2022
Cross-dataset COVID-19 Transfer Learning with Cough Detection, Cough
  Segmentation, and Data Augmentation
Cross-dataset COVID-19 Transfer Learning with Cough Detection, Cough Segmentation, and Data Augmentation
Bagus Tris Atmaja
Zanjabila
Suyanto
A. Sasou
164
2
0
12 Oct 2022
The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge
  2022
The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022
Xiaoyi Qin
Na Li
Yuke Lin
Yiwei Ding
Chao Weng
Jane Polak Scowcroft
Ming Li
144
12
0
11 Oct 2022
Mutual Learning of Single- and Multi-Channel End-to-End Neural
  Diarization
Mutual Learning of Single- and Multi-Channel End-to-End Neural DiarizationSpoken Language Technology Workshop (SLT), 2022
Shota Horiguchi
Yuki Takashima
Shinji Watanabe
Leibny Paola García-Perera
240
2
0
07 Oct 2022
WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming
  Voice Trigger
WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming Voice Trigger
Zixing Zhang
Thorin Farnsworth
Senling Lin
S. Karout
182
2
0
06 Oct 2022
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised
  learning of speech representations
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsSpoken Language Technology Workshop (SLT), 2022
Vasista Sai Lodagala
Sreyan Ghosh
S. Umesh
SSL
328
24
0
05 Oct 2022
Deepfake audio detection by speaker verification
Deepfake audio detection by speaker verificationInternational Workshop on Information Forensics and Security (WIFS), 2022
Alessandro Pianese
D. Cozzolino
Giovanni Poggi
L. Verdoliva
241
52
0
28 Sep 2022
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture
Joint Speech Activity and Overlap Detection with Multi-Exit ArchitectureAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Ziqing Du
Kai Liu
Xucheng Wan
Huan Zhou
250
1
0
24 Sep 2022
The SpeakIn Speaker Verification System for Far-Field Speaker
  Verification Challenge 2022
The SpeakIn Speaker Verification System for Far-Field Speaker Verification Challenge 2022
Yu Zheng
Jinghan Peng
Yihao Chen
Yajun Zhang
Jialong Wang
Min Liu
Minqiang Xu
171
9
0
23 Sep 2022
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge
  2022
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Qutang Cai
Guoqiang Hong
Zhijian Ye
Ximin Li
Haizhou Li
243
8
0
23 Sep 2022
UniKW-AT: Unified Keyword Spotting and Audio Tagging
UniKW-AT: Unified Keyword Spotting and Audio TaggingInterspeech (Interspeech), 2022
Heinrich Dinkel
Yongqing Wang
Zhiyong Yan
Junbo Zhang
Yujun Wang
237
3
0
23 Sep 2022
The SpeakIn System Description for CNSRC2022
The SpeakIn System Description for CNSRC2022
Yu Zheng
Yihao Chen
Jinghan Peng
Yajun Zhang
Min Liu
Minqiang Xu
132
3
0
22 Sep 2022
The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022
The ReturnZero System for VoxCeleb Speaker Recognition Challenge 2022
Sangwon Suh
Sunjong Park
146
2
0
21 Sep 2022
The BUCEA Speaker Diarization System for the VoxCeleb Speaker
  Recognition Challenge 2022
The BUCEA Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
R. Zhou
Yu Du
Che-Ming Hu
117
0
0
20 Sep 2022
SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022
SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022
Zhengyang Chen
Bing Han
Xu Xiang
Houjun Huang
Bei Liu
Y. Qian
177
12
0
19 Sep 2022
The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022
The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022
Jingguang Tian
Xinhui Hu
Xinkang Xu
237
1
0
19 Sep 2022
Learning Audio-Visual embedding for Person Verification in the Wild
Learning Audio-Visual embedding for Person Verification in the Wild
Peiwen Sun
Shanshan Zhang
Zishan Liu
Yougen Yuan
Tao Zhang
Honggang Zhang
Pengfei Hu
185
4
0
09 Sep 2022
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End
  Automatic Speaker Verification with Multiple Enrollment Utterances
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment UtterancesComputer Speech and Language (CSL), 2022
Chang Zeng
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
108
7
0
01 Sep 2022
Target Speaker Voice Activity Detection with Transformers and Its
  Integration with End-to-End Neural Diarization
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural DiarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Dongmei Wang
Xiong Xiao
Naoyuki Kanda
Takuya Yoshioka
Jian Wu
376
33
0
27 Aug 2022
Disentangled Speaker Representation Learning via Mutual Information
  Minimization
Disentangled Speaker Representation Learning via Mutual Information MinimizationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Sung Hwan Mun
Mingrui Han
Minchan Kim
Dongjune Lee
N. Kim
DRL
319
13
0
17 Aug 2022
C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning
  for Speaker Verification
C3-DINO: Joint Contrastive and Non-contrastive Self-Supervised Learning for Speaker VerificationIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Chunlei Zhang
Dong Yu
196
22
0
15 Aug 2022
LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic
  Acoustic Echo Cancellation
LCSM: A Lightweight Complex Spectral Mapping Framework for Stereophonic Acoustic Echo CancellationInterspeech (Interspeech), 2022
Chen Zhang
Jinjiang Liu
Xueliang Zhang
93
10
0
15 Aug 2022
FRA-RIR: Fast Random Approximation of the Image-source Method
FRA-RIR: Fast Random Approximation of the Image-source MethodInterspeech (Interspeech), 2022
Yi Luo
Jianwei Yu
117
9
0
08 Aug 2022
Robust Acoustic Domain Identification with its Application to Speaker
  Diarization
Robust Acoustic Domain Identification with its Application to Speaker DiarizationInternational Journal of Speech Technology (IJST), 2022
Kishore Kumar A
Shefali Waldekar
Md. Sahidullah
G. Saha
185
0
0
05 Aug 2022
Attention and DCT based Global Context Modeling for Text-independent
  Speaker Recognition
Attention and DCT based Global Context Modeling for Text-independent Speaker RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Wei Xia
John H. L. Hansen
164
7
0
04 Aug 2022
The SJTU System for Short-duration Speaker Verification Challenge 2021
The SJTU System for Short-duration Speaker Verification Challenge 2021Interspeech (Interspeech), 2021
Bing Han
Zhengyang Chen
Zhikai Zhou
Y. Qian
75
9
0
03 Aug 2022
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label
  Correction
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label CorrectionInterspeech (Interspeech), 2022
Bing Han
Zhengyang Chen
Y. Qian
113
43
0
03 Aug 2022
Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
A. I. S. Ferreira
Gustavo dos Reis Oliveira
171
3
0
29 Jul 2022
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
Utterance-by-utterance overlap-aware neural diarization with Graph-PITInterspeech (Interspeech), 2022
K. Kinoshita
Thilo von Neumann
Marc Delcroix
Christoph Boeddeker
Reinhold Haeb-Umbach
156
4
0
28 Jul 2022
Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions
  Using Trainable Kernels and Augmentations
Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and AugmentationsACM Multimedia (ACM MM), 2022
Devesh Khandelwal
Sean Campos
Shwetha Nagaraj
F. Nugen
Alberto Todeschini
93
0
0
28 Jul 2022
Inference skipping for more efficient real-time speech enhancement with
  parallel RNNs
Inference skipping for more efficient real-time speech enhancement with parallel RNNsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Xiaohuai Le
Tong Lei
Kai-Jyun Chen
Jing Lu
310
26
0
22 Jul 2022
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification
  Challenge
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification ChallengeInterspeech (Interspeech), 2022
Xingming Wang
Xiaoyi Qin
Yikang Wang
Yunfei Xu
Ming Li
226
17
0
15 Jul 2022
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer
  to Unlabeled Modality
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled ModalityNeural Information Processing Systems (NeurIPS), 2022
Wei-Ning Hsu
Bowen Shi
SSLVLM
319
52
0
14 Jul 2022
Cross-Age Speaker Verification: Learning Age-Invariant Speaker
  Embeddings
Cross-Age Speaker Verification: Learning Age-Invariant Speaker EmbeddingsInterspeech (Interspeech), 2022
Xiaoyi Qin
Na Li
Chao Weng
Jane Polak Scowcroft
Ming Li
179
22
0
13 Jul 2022
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for
  Low-Resource Devices
Distilled Non-Semantic Speech Embeddings with Binary Neural Networks for Low-Resource DevicesPattern Recognition Letters (PRL), 2022
Harlin Lee
Aaqib Saeed
271
2
0
12 Jul 2022
Label-Efficient Self-Supervised Speaker Verification With Information
  Maximization and Contrastive Learning
Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive LearningInterspeech (Interspeech), 2022
Théo Lepage
Réda Dehak
SSL
207
15
0
12 Jul 2022
pMCT: Patched Multi-Condition Training for Robust Speech Recognition
pMCT: Patched Multi-Condition Training for Robust Speech RecognitionInterspeech (Interspeech), 2022
Pablo Peso Parada
A. Dobrowolska
Karthikeyan P. Saravanan
Mete Ozay
240
11
0
11 Jul 2022
Multi-Frequency Information Enhanced Channel Attention Module for
  Speaker Representation Learning
Multi-Frequency Information Enhanced Channel Attention Module for Speaker Representation LearningInterspeech (Interspeech), 2022
Mufan Sang
John H. L. Hansen
152
17
0
10 Jul 2022
Low-resource Low-footprint Wake-word Detection using Knowledge
  Distillation
Low-resource Low-footprint Wake-word Detection using Knowledge DistillationInterspeech (Interspeech), 2022
Arindam Ghosh
Mark C. Fuhs
Deblin Bagchi
Bahman Farahani
Monika Woszczyna
VLM
106
5
0
06 Jul 2022
The THUEE System Description for the IARPA OpenASR21 Challenge
The THUEE System Description for the IARPA OpenASR21 ChallengeInterspeech (Interspeech), 2022
Jing Zhao
Haoyu Wang
Jinpeng Li
Shuzhou Chai
Guan-Bo Wang
Guoguo Chen
Weiqiang Zhang
VLM
110
1
0
29 Jun 2022
Speaker Verification in Multi-Speaker Environments Using Temporal
  Feature Fusion
Speaker Verification in Multi-Speaker Environments Using Temporal Feature FusionEuropean Signal Processing Conference (EUSIPCO), 2022
Ahmad Aloradi
Wolfgang Mack
Mohamed Elminshawi
Emanuel Habets
121
6
0
28 Jun 2022
Wav2Vec-Aug: Improved self-supervised training with limited data
Wav2Vec-Aug: Improved self-supervised training with limited dataInterspeech (Interspeech), 2022
Anuroop Sriram
Michael Auli
Alexei Baevski
SSLVLM
175
16
0
27 Jun 2022
Sequence-level Speaker Change Detection with Difference-based Continuous
  Integrate-and-fire
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fireIEEE Signal Processing Letters (SPL), 2022
Zhiyun Fan
Linhao Dong
Meng Cai
Zejun Ma
Bo Xu
114
5
0
27 Jun 2022
Extended U-Net for Speaker Verification in Noisy Environments
Extended U-Net for Speaker Verification in Noisy EnvironmentsInterspeech (Interspeech), 2022
Ju-ho Kim
Ju-Sung Heo
Hye-jin Shim
Ha-Jin Yu
112
23
0
27 Jun 2022
The SJTU X-LANCE Lab System for CNSRC 2022
The SJTU X-LANCE Lab System for CNSRC 2022
Zhengyang Chen
Bei Liu
Bing Han
Leying Zhang
Y. Qian
275
20
0
23 Jun 2022
Identifying Source Speakers for Voice Conversion based Spoofing Attacks
  on Speaker Verification Systems
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Danwei Cai
Zexin Cai
Ming Li
228
14
0
18 Jun 2022
The Influence of Dataset Partitioning on Dysfluency Detection Systems
The Influence of Dataset Partitioning on Dysfluency Detection SystemsInternational Conference on Text, Speech and Dialogue (TSD), 2022
Sebastian P. Bayerl
Dominik Wagner
Elmar Nöth
Tobias Bocklet
Korbinian Riedhammer
182
25
0
07 Jun 2022
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker
  Recognition Systems
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition SystemsIEEE Transactions on Dependable and Secure Computing (TDSC), 2022
Guangke Chen
Zhe Zhao
Fu Song
Sen Chen
Lingling Fan
Yang Liu
AAML
178
22
0
07 Jun 2022
Previous
123...789...121314
Next
Page 8 of 14
Pageof 14