ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.08484
  4. Cited By
MUSAN: A Music, Speech, and Noise Corpus

MUSAN: A Music, Speech, and Noise Corpus

28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
ArXiv (abs)PDFHTML

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown
Online Neural Diarization of Unlimited Numbers of Speakers Using Global
  and Local Attractors
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local AttractorsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Yohei Kawaguchi
277
29
0
06 Jun 2022
Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue
  Systems
Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue SystemsKnowledge Discovery and Data Mining (KDD), 2022
Ting-En Lin
Yuchuan Wu
Feiling Huang
Luo Si
Jian Sun
Yongbin Li
342
32
0
30 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
679
445
0
21 May 2022
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERTInterspeech (Interspeech), 2022
Bowen Shi
Abdel-rahman Mohamed
Wei-Ning Hsu
SSL
235
22
0
15 May 2022
Collar-aware Training for Streaming Speaker Change Detection in
  Broadcast Speech
Collar-aware Training for Streaming Speaker Change Detection in Broadcast SpeechThe Speaker and Language Recognition Workshop (Odyssey), 2022
Joonas Kalda
Tanel Alumäe
143
5
0
14 May 2022
Pretraining Approaches for Spoken Language Recognition: TalTech
  Submission to the OLR 2021 Challenge
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 ChallengeThe Speaker and Language Recognition Workshop (Odyssey), 2022
Tanel Alumäe
Kunnar Kukk
107
8
0
14 May 2022
Task splitting for DNN-based acoustic echo and noise removal
Task splitting for DNN-based acoustic echo and noise removalInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2022
Sebastian Braun
Maria Luis Valero
176
20
0
13 May 2022
Graph Convolutional Network Based Semi-Supervised Learning on
  Multi-Speaker Meeting Data
Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Fuchuan Tong
Siqi Zheng
Min Zhang
Yafeng Chen
Hongbin Suo
Q. Hong
Lin Li
SSL
151
11
0
25 Apr 2022
Improving the Naturalness of Simulated Conversations for End-to-End
  Neural Diarization
Improving the Naturalness of Simulated Conversations for End-to-End Neural DiarizationThe Speaker and Language Recognition Workshop (Odyssey), 2022
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
217
22
0
24 Apr 2022
The 2021 NIST Speaker Recognition Evaluation
The 2021 NIST Speaker Recognition EvaluationThe Speaker and Language Recognition Workshop (Odyssey), 2022
S. O. Sadjadi
Craig S. Greenberg
E. Singer
Lisa P. Mason
D. A. Reynolds
175
77
0
21 Apr 2022
The NIST CTS Speaker Recognition Challenge
The NIST CTS Speaker Recognition ChallengeThe Speaker and Language Recognition Workshop (Odyssey), 2022
S. O. Sadjadi
Craig S. Greenberg
E. Singer
Lisa P. Mason
D. Reynolds
ELM
271
0
0
21 Apr 2022
Baseline Systems for the First Spoofing-Aware Speaker Verification
  Challenge: Score and Embedding Fusion
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding FusionThe Speaker and Language Recognition Workshop (Odyssey), 2022
Hye-jin Shim
Hemlata Tak
Xuechen Liu
Hee-Soo Heo
Jee-weon Jung
...
Héctor Delgado
Kong Aik Lee
Md. Sahidullah
Tomi Kinnunen
Nicholas W. D. Evans
AAML
158
17
0
21 Apr 2022
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech
  Recognition
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Xun Gong
Y. Qian
Houjun Huang
Yanmin Qian
183
61
0
21 Apr 2022
Audio Deep Fake Detection System with Neural Stitching for ADD 2022
Audio Deep Fake Detection System with Neural Stitching for ADD 2022IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rui Yan
Cheng Wen
Shuran Zhou
Tingwei Guo
Wei Zou
Xiangang Li
147
28
0
19 Apr 2022
Detecting Vocal Fatigue with Neural Embeddings
Detecting Vocal Fatigue with Neural EmbeddingsJournal of Voice (J Voice), 2022
Sebastian P. Bayerl
Dominik Wagner
Ilja Baumann
Korbinian Riedhammer
Tobias Bocklet
160
11
0
07 Apr 2022
Frequency and Multi-Scale Selective Kernel Attention for Speaker
  Verification
Frequency and Multi-Scale Selective Kernel Attention for Speaker VerificationSpoken Language Technology Workshop (SLT), 2022
Sung Hwan Mun
Jee-weon Jung
Min Hyun Han
N. Kim
291
28
0
03 Apr 2022
From Simulated Mixtures to Simulated Conversations as Training Data for
  End-to-End Neural Diarization
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural DiarizationInterspeech (Interspeech), 2022
Federico Landini
Alicia Lozano-Diez
Mireia Díez
Lukávs Burget
209
46
0
02 Apr 2022
Improved Relation Networks for End-to-End Speaker Verification and
  Identification
Improved Relation Networks for End-to-End Speaker Verification and IdentificationInterspeech (Interspeech), 2022
Ashutosh Chaubey
Sparsh Sinha
Susmita Ghose
144
4
0
31 Mar 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech
  Separation for Flexible Number of Speakers
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of SpeakersSpoken Language Technology Workshop (SLT), 2022
Soumi Maiti
Yushi Ueda
Shinji Watanabe
Chunlei Zhang
Meng Yu
Shi-Xiong Zhang
Yong-mei Xu
289
42
0
31 Mar 2022
Adversarial Speaker Distillation for Countermeasure Model on Automatic
  Speaker Verification
Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification
Yen-Lun Liao
Xuan-Bo Chen
Chung-Che Wang
J. Jang
AAML
470
9
0
31 Mar 2022
Robust Disentangled Variational Speech Representation Learning for
  Zero-shot Voice Conversion
Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jiachen Lian
Chunlei Zhang
Dong Yu
DRL
143
55
0
30 Mar 2022
Joint domain adaptation and speech bandwidth extension using time-domain
  GANs for speaker verification
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verificationInterspeech (Interspeech), 2022
Saurabh Kataria
Jesús Villalba
Laureano Moro-Velazquez
Najim Dehak
120
4
0
30 Mar 2022
Improving Distortion Robustness of Self-supervised Speech Processing
  Tasks with Domain Adaptation
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain AdaptationInterspeech (Interspeech), 2022
Kuan Po Huang
Yuanbin Fu
Yu Zhang
Hung-yi Lee
276
28
0
30 Mar 2022
Speaker Embedding-aware Neural Diarization: an Efficient Framework for
  Overlapping Speech Diarization in Meeting Scenarios
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios
Zhihao Du
Shiliang Zhang
Siqi Zheng
Zhijie Yan
162
2
0
18 Mar 2022
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel
  Speech Enhancement from Taylor's Approximation Theory
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation TheoryInterspeech (Interspeech), 2022
Andong Li
Guochen Yu
C. Zheng
Xiaodong Li
147
15
0
14 Mar 2022
Improving the transferability of speech separation by meta-learning
Improving the transferability of speech separation by meta-learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
136
2
0
11 Mar 2022
An Environmental Feature Representation in I-vector Space for Room
  Verification and Metadata Estimation
An Environmental Feature Representation in I-vector Space for Room Verification and Metadata Estimation
Desmond Caulley
49
1
0
09 Mar 2022
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker
  Detection and Speech Enhancement
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech EnhancementIEEE transactions on multimedia (IEEE TMM), 2022
Jun Xiong
Can Ma
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
199
37
0
04 Mar 2022
Magnitude-aware Probabilistic Speaker Embeddings
Magnitude-aware Probabilistic Speaker EmbeddingsThe Speaker and Language Recognition Workshop (Odyssey), 2022
Nikita Kuzmin
Igor Fedorov
A. Sholokhov
244
7
0
28 Feb 2022
Contrastive-mixup learning for improved speaker verification
Contrastive-mixup learning for improved speaker verificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Xin Zhang
Minho Jin
R. Cheng
Ruirui Li
Eunjung Han
A. Stolcke
AAMLSSL
123
11
0
22 Feb 2022
Multi-style Training for South African Call Centre Audio
Multi-style Training for South African Call Centre Audio
Walter Heymans
Marelie Hattingh Davel
C. van Heerden
72
3
0
15 Feb 2022
Spiking Cochlea with System-level Local Automatic Gain Control
Spiking Cochlea with System-level Local Automatic Gain ControlIEEE Transactions on Circuits and Systems Part 1: Regular Papers (TCAS I), 2022
Ilya Kiselev
Chang Gao
Shih-Chii Liu
159
14
0
14 Feb 2022
Partially Fake Audio Detection by Self-attention-based Fake Span
  Discovery
Partially Fake Audio Detection by Self-attention-based Fake Span DiscoveryIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Haibin Wu
Heng-Cheng Kuo
Naijun Zheng
Kuo-Hsuan Hung
Hung-yi Lee
Yu Tsao
Hsin-Min Wang
Helen Meng
213
46
0
14 Feb 2022
Tight integration of neural- and clustering-based diarization through
  deep unfolding of infinite Gaussian mixture model
Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
K. Kinoshita
Marc Delcroix
Tomoharu Iwata
BDL
167
21
0
14 Feb 2022
The xmuspeech system for multi-channel multi-party meeting transcription
  challenge
The xmuspeech system for multi-channel multi-party meeting transcription challenge
Jie Wang
Yuji Liu
Binling Wang
Yiming Zhi
Song Li
Shipeng Xia
Jiayang Zhang
Lin Li
Q. Hong
Feng Tong
157
0
0
11 Feb 2022
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party
  meeting transcription (M2MeT) challenge
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Maokui He
Xiang Lv
Weilin Zhou
Jingjing Yin
Xiaoqi Zhang
...
Shutong Niu
Yuhang Cao
Heng Lu
Jun Du
Chin-Hui Lee
171
8
0
10 Feb 2022
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel
  Multi-party Meeting Transcription Challenge
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Jingguang Tian
Xinhui Hu
Xinkang Xu
190
9
0
10 Feb 2022
The Volcspeech system for the ICASSP 2022 multi-channel multi-party
  meeting transcription challenge
The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chen Shen
Yi Y. Liu
Wenzhi Fan
Bin Wang
Shi-Xue Wen
Yao Tian
Jun Zhang
Jingsheng Yang
Zejun Ma
158
5
0
09 Feb 2022
Cross-Channel Attention-Based Target Speaker Voice Activity Detection:
  Experimental Results for M2MeT Challenge
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Weiqing Wang
Xiaoyi Qin
Ming Li
173
32
0
06 Feb 2022
A deep complex multi-frame filtering network for stereophonic acoustic
  echo cancellation
A deep complex multi-frame filtering network for stereophonic acoustic echo cancellationInterspeech (Interspeech), 2022
Linjuan Cheng
C. Zheng
Andong Li
Yuquan Wu
Renhua Peng
Xiaodong Li
130
1
0
03 Feb 2022
The RoyalFlush System of Speech Recognition for M2MeT Challenge
The RoyalFlush System of Speech Recognition for M2MeT ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Shuaishuai Ye
Peiyao Wang
Shunfei Chen
Xinhui Hu
Xinkang Xu
192
6
0
03 Feb 2022
The CORAL++ Algorithm for Unsupervised Domain Adaptation of Speaker
  Recogntion
The CORAL++ Algorithm for Unsupervised Domain Adaptation of Speaker RecogntionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rongjin Li
Weibin Zhang
Dongpeng Chen
271
26
0
02 Feb 2022
Impact of Naturalistic Field Acoustic Environments on Forensic
  Text-independent Speaker Verification System
Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System
Zhenyu Wang
John H. L. Hansen
80
0
0
28 Jan 2022
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge
  Evaluation Plan
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan
Jee-weon Jung
Hemlata Tak
Hye-jin Shim
Hee-Soo Heo
Bong-Jin Lee
Soo-Whan Chung
Hong-Goo Kang
Ha-Jin Yu
Nicholas W. D. Evans
Tomi Kinnunen
217
34
0
25 Jan 2022
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
Optimizing Tandem Speaker Verification and Anti-Spoofing SystemsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Anssi Kanervisto
Ville Hautamaki
Tomi Kinnunen
Junichi Yamagishi
164
18
0
24 Jan 2022
PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
PickNet: Real-Time Channel Selection for Ad Hoc Microphone ArraysIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Takuya Yoshioka
Xiaofei Wang
Dongmei Wang
138
6
0
24 Jan 2022
ConvMixer: Feature Interactive Convolution with Curriculum Learning for
  Small Footprint and Noisy Far-field Keyword Spotting
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword SpottingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Dianwen Ng
Yunqi Chen
Biao Tian
Qiang Fu
Chng Eng Siong
123
59
0
15 Jan 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Robust Self-Supervised Audio-Visual Speech RecognitionInterspeech (Interspeech), 2022
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
359
117
0
05 Jan 2022
Multi-Variant Consistency based Self-supervised Learning for Robust
  Automatic Speech Recognition
Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition
Changfeng Gao
Gaofeng Cheng
Pengyuan Zhang
262
4
0
23 Dec 2021
Towards Robust Real-time Audio-Visual Speech Enhancement
Towards Robust Real-time Audio-Visual Speech Enhancement
M. Gogate
K. Dashtipour
Amir Hussain
213
4
0
16 Dec 2021
Previous
123...8910...121314
Next
Page 9 of 14
Pageof 14