ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.08484
  4. Cited By
MUSAN: A Music, Speech, and Noise Corpus

MUSAN: A Music, Speech, and Noise Corpus

28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
ArXiv (abs)PDFHTML

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown
Yet Another Model for Arabic Dialect Identification
Yet Another Model for Arabic Dialect Identification
Ajinkya Kulkarni
Hanan Aldarmaki
151
6
0
20 Oct 2023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's
  DASR System
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System
T. Park
He Huang
Ante Jukić
Kunal Dhawan
Krishna C. Puvvada
Nithin Rao Koluguri
Nikolay Karpov
A. Laptev
Jagadeesh Balam
Boris Ginsburg
202
11
0
18 Oct 2023
End-to-end Online Speaker Diarization with Target Speaker Tracking
End-to-end Online Speaker Diarization with Target Speaker Tracking
Weiqing Wang
Ming Li
315
7
0
12 Oct 2023
LRPD: Large Replay Parallel Dataset
LRPD: Large Replay Parallel DatasetIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ivan Yakovlev
Mikhail P. Melnikov
Nikita Bukhal
Rostislav Makarov
Alexander Alenin
Juncheng Billy Li
A. Okhotnikov
265
2
0
29 Sep 2023
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Xin Wang
Taein Kwon
Wei-Ning Hsu
Yossi Adi
Tu Nguyen
D. Bohus
Emmanuel Dupoux
Neel Joshi
Abdelrahman Mohamed
173
5
0
29 Sep 2023
Audio-Visual Speaker Verification via Joint Cross-Attention
Audio-Visual Speaker Verification via Joint Cross-AttentionInternational Conference on Speech and Computer (SPECOM), 2023
R Gnana Praveen
Jahangir Alam
277
10
0
28 Sep 2023
Meeting Recognition with Continuous Speech Separation and
  Transcription-Supported Diarization
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Thilo von Neumann
Christoph Boeddeker
Tobias Cord-Landwehr
Marc Delcroix
Reinhold Haeb-Umbach
285
13
0
28 Sep 2023
DualVC 2: Dynamic Masked Convolution for Unified Streaming and
  Non-Streaming Voice Conversion
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ziqian Ning
Yuepeng Jiang
Pengcheng Zhu
Shuai Wang
Jixun Yao
Linfu Xie
Mengxiao Bi
294
8
0
27 Sep 2023
Collaborative Watermarking for Adversarial Speech Synthesis
Collaborative Watermarking for Adversarial Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Lauri Juvela
Xin Wang
230
19
0
26 Sep 2023
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for
  Automatic Speaker Verification
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker VerificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Duc-Tuan Truong
Ruijie Tao
J. Yip
Kong Aik Lee
Chng Eng Siong
187
12
0
26 Sep 2023
Rethinking Session Variability: Leveraging Session Embeddings for
  Session Robustness in Speaker Verification
Rethinking Session Variability: Leveraging Session Embeddings for Session Robustness in Speaker VerificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Hee-Soo Heo
Ki-hyun Nam
Bong-Jin Lee
Youngki Kwon
Min-Ji Lee
You Jin Kim
Joon Son Chung
263
3
0
26 Sep 2023
Multi-Domain Adaptation by Self-Supervised Learning for Speaker
  Verification
Multi-Domain Adaptation by Self-Supervised Learning for Speaker Verification
Wan Lin
Lantian Li
D. Wang
116
2
0
25 Sep 2023
Frame-wise streaming end-to-end speaker diarization with
  non-autoregressive self-attention-based attractors
Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractorsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Di Liang
Nian Shao
Xiaofei Li
194
7
0
25 Sep 2023
Contrastive Speaker Embedding With Sequential Disentanglement
Contrastive Speaker Embedding With Sequential DisentanglementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Youzhi Tu
Man-Wai Mak
Jen-Tzung Chien
CoGe
160
8
0
23 Sep 2023
Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language
  augmentation for Low Resource Self-Supervised Speech Models
Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech ModelsInterspeech (Interspeech), 2023
Asad Ullah
Alessandro Ragano
Andrew Hines
428
4
0
22 Sep 2023
NTT speaker diarization system for CHiME-7: multi-domain,
  multi-microphone End-to-end and vector clustering diarization
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Naohiro Tawara
Marc Delcroix
Atsushi Ando
A. Ogawa
208
14
0
22 Sep 2023
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network
  Speech Enhancement
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech EnhancementIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Bengt J. Borgström
M. Brandstein
173
4
0
21 Sep 2023
The Impact of Silence on Speech Anti-Spoofing
The Impact of Silence on Speech Anti-SpoofingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Yuxiang Zhang
Zhuo Li
Jingze Lu
Hua Hua
Wenchao Wang
Pengyuan Zhang
193
30
0
21 Sep 2023
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for
  Multi-channel Noise Reduction
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise ReductionInterspeech (Interspeech), 2022
Julitta Bartolewska
Stanisław Kacprzak
K. Kowalczyk
122
0
0
18 Sep 2023
Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker
  Verification Using Score-Based Diffusion Probabilistic Models
Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ju-ho Kim
Ju-Sung Heo
Hyun-Seo Shin
Chanmann Lim
Ha-Jin Yu
DiffM
160
7
0
14 Sep 2023
PromptASR for contextualized ASR with controllable style
PromptASR for contextualized ASR with controllable styleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoyu Yang
Wei Kang
Zengwei Yao
Yifan Yang
Liyong Guo
Fangjun Kuang
Long Lin
Daniel Povey
344
24
0
14 Sep 2023
SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
SynVox2: Towards a privacy-friendly VoxCeleb2 datasetIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoxiao Miao
Xin Eric Wang
Erica Cooper
Junichi Yamagishi
Nicholas W. D. Evans
Massimiliano Todisco
J. Bonastre
Mickael Rouvier
243
8
0
12 Sep 2023
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for
  Self-supervised Representations of French Speech
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French SpeechComputer Speech and Language (CSL), 2023
Titouan Parcollet
H. Nguyen
Solène Evain
Marcely Zanon Boito
Adrien Pupier
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
262
26
0
11 Sep 2023
Hierarchical Audio-Visual Information Fusion with Multi-label Joint
  Decoding for MER 2023
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023ACM Multimedia (ACM MM), 2023
Haotian Wang
Yuxuan Xi
Hang Chen
Jun Du
Yan Song
...
Pengfei Hu
Ya Jiang
Shi Cheng
Jie Zhang
Yuzhe Weng
213
5
0
11 Sep 2023
ReZero: Region-customizable Sound Extraction
ReZero: Region-customizable Sound Extraction
Rongzhi Gu
Yi Luo
147
34
0
31 Aug 2023
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Ruoyu Wang
Maokui He
Jun Du
Hengshun Zhou
Shutong Niu
...
Mengzhi Wang
Genshun Wan
Jia Pan
Jianqing Gao
Chin-Hui Lee
235
16
0
28 Aug 2023
UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Yu Zheng
Yajun Zhang
Chuanying Niu
Yibin Zhan
Yanhua Long
Dongxing Xu
140
6
0
24 Aug 2023
AdVerb: Visually Guided Audio Dereverberation
AdVerb: Visually Guided Audio DereverberationIEEE International Conference on Computer Vision (ICCV), 2023
Sanjoy Chowdhury
Sreyan Ghosh
Subhrajyoti Dasgupta
Anton Ratnarajah
Utkarsh Tyagi
Tianyi Zhou
212
18
0
23 Aug 2023
Convoifilter: A case study of doing cocktail party speech recognition
Convoifilter: A case study of doing cocktail party speech recognition
Thai-Binh Nguyen
A. Waibel
243
2
0
22 Aug 2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD
  2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023
Zexin Cai
Weiqing Wang
Yikang Wang
Ming Li
144
10
0
20 Aug 2023
Graph Neural Network Backend for Speaker Recognition
Graph Neural Network Backend for Speaker Recognition
Liang He
Rui Li
Mengqi Niu
168
0
0
17 Aug 2023
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker
  Recognition Challenge 2023
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Ze Li
Yuke Lin
Xiaoyi Qin
Ning Jiang
Guoqing Zhao
Ming Li
172
7
0
17 Aug 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition
  Challenge 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023
Mengjie Du
Xiang Fang
Jie Li
167
0
0
16 Aug 2023
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
SpeechX: Neural Codec Language Model as a Versatile Speech TransformerIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Xiaofei Wang
Manthan Thakker
Zhuo Chen
Naoyuki Kanda
Sefik Emre Eskimez
Sanyuan Chen
M. Tang
Shujie Liu
Jinyu Li
Takuya Yoshioka
315
112
0
14 Aug 2023
Large-Scale Learning on Overlapped Speech Detection: New Benchmark and
  New General System
Large-Scale Learning on Overlapped Speech Detection: New Benchmark and New General System
Zhao-Yu Yin
Jingguang Tian
Xinhui Hu
Xinkang Xu
Yang Xiang
210
2
0
11 Aug 2023
Joint speech and overlap detection: a benchmark over multiple audio
  setup and speech domains
Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Martin Lebourdais
Théo Mariotte
Marie Tahon
Anthony Larcher
Antoine Laurent
Silvio Montrésor
S. Meignier
Jean-Hugh Thomas
VLM
102
6
0
24 Jul 2023
Robust Automatic Speech Recognition via WavAugment Guided Phoneme
  Adversarial Training
Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial TrainingInterspeech (Interspeech), 2023
Gege Qi
YueFeng Chen
Xiaofeng Mao
Yang Liu
Ranjie Duan
Rong Zhang
Hui Xue
VLMAAML
219
1
0
24 Jul 2023
PAS: Partial Additive Speech Data Augmentation Method for Noise Robust
  Speaker Verification
PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification
Wonbin Kim
Hyun-Seo Shin
Ju-ho Kim
Ju-Sung Heo
Chanmann Lim
Ha-Jin Yu
178
2
0
20 Jul 2023
Exploring Binary Classification Loss For Speaker Verification
Exploring Binary Classification Loss For Speaker VerificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Bing Han
Zhengyang Chen
Y. Qian
CVBM
168
16
0
17 Jul 2023
Representation Learning With Hidden Unit Clustering For Low Resource
  Speech Applications
Representation Learning With Hidden Unit Clustering For Low Resource Speech ApplicationsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Varun Krishna
T. Sai
Sriram Ganapathy
SSL
161
3
0
14 Jul 2023
Self-supervised learning with diffusion-based multichannel speech
  enhancement for speaker verification under noisy conditions
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditionsInterspeech (Interspeech), 2023
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
236
3
0
05 Jul 2023
Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Yikang Wang
Hiromitsu Nishizaki
Ming Li
215
1
0
04 Jul 2023
An End-to-End Multi-Module Audio Deepfake Generation System for ADD
  Challenge 2023
An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023
Sheng Zhao
Qi-ping Yuan
Yibo Duan
Zhuo Chen
148
2
0
03 Jul 2023
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
Raghuveer Peri
S. O. Sadjadi
D. Garcia-Romero
159
6
0
30 Jun 2023
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple
  Devices in Diverse Scenarios
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Samuele Cornell
Sanjeev Khudanpur
Shinji Watanabe
Desh Raj
Xuankai Chang
...
Matthew Maciejewski
Yoshiki Masuyama
Zhong-Qiu Wang
S. Squartini
Sanjeev Khudanpur
239
76
0
23 Jun 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with
  Adversarial Network for Audio-Visual Speech Recognition
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech RecognitionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuchen Hu
Chen Chen
Ruizhe Li
Heqing Zou
Chng Eng Siong
GAN
208
11
0
18 Jun 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for
  Robust Audio-Visual Speech Recognition
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech RecognitionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuchen Hu
Ruizhe Li
Cheng Chen
Chengwei Qin
Qiu-shi Zhu
Eng Siong Chng
223
14
0
18 Jun 2023
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
SURT 2.0: Advances in Transducer-based Multi-talker Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Desh Raj
Daniel Povey
Sanjeev Khudanpur
VLM
337
16
0
18 Jun 2023
CoverHunter: Cover Song Identification with Refined Attention and
  Alignments
CoverHunter: Cover Song Identification with Refined Attention and AlignmentsIEEE International Conference on Multimedia and Expo (ICME), 2023
Yifan Zhang
Deyi Tuo
Yinan Xu
Xintong Han
167
9
0
15 Jun 2023
Speaker Verification Across Ages: Investigating Deep Speaker Embedding
  Sensitivity to Age Mismatch in Enrollment and Test Speech
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test SpeechInterspeech (Interspeech), 2023
Vishwanath Pratap Singh
Md. Sahidullah
Tomi Kinnunen
197
5
0
13 Jun 2023
Previous
123456...121314
Next
Page 5 of 14
Pageof 14