MUSAN: A Music, Speech, and Noise Corpus

28 October 2015

Papers citing "MUSAN: A Music, Speech, and Noise Corpus"

50 / 664 papers shown

Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local AttractorsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Shota Horiguchi

Shinji Watanabe

Leibny Paola García-Perera

Yuki Takashima

Yohei Kawaguchi

277

06 Jun 2022

Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue SystemsKnowledge Discovery and Data Mining (KDD), 2022

Ting-En Lin

342

30 May 2022

Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Abdel-rahman Mohamed

Hung-yi Lee

Lasse Borgholt

Jakob Drachmann Havtorn

...

679

445

21 May 2022

Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERTInterspeech (Interspeech), 2022

235

15 May 2022

Collar-aware Training for Streaming Speaker Change Detection in Broadcast SpeechThe Speaker and Language Recognition Workshop (Odyssey), 2022

Joonas Kalda

Tanel Alumäe

143

14 May 2022

Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 ChallengeThe Speaker and Language Recognition Workshop (Odyssey), 2022

Tanel Alumäe

Kunnar Kukk

107

14 May 2022

Task splitting for DNN-based acoustic echo and noise removalInternational Workshop on Acoustic Signal Enhancement (IWAENC), 2022

Sebastian Braun

Maria Luis Valero

176

13 May 2022

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Fuchuan Tong

Siqi Zheng

Min Zhang

151

25 Apr 2022

Improving the Naturalness of Simulated Conversations for End-to-End Neural DiarizationThe Speaker and Language Recognition Workshop (Odyssey), 2022

Natsuo Yamashita

Shota Horiguchi

Takeshi Homma

217

24 Apr 2022

The 2021 NIST Speaker Recognition EvaluationThe Speaker and Language Recognition Workshop (Odyssey), 2022

175

21 Apr 2022

The NIST CTS Speaker Recognition ChallengeThe Speaker and Language Recognition Workshop (Odyssey), 2022

271

21 Apr 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding FusionThe Speaker and Language Recognition Workshop (Odyssey), 2022

Hye-jin Shim

Xuechen Liu

...

Kong Aik Lee

158

21 Apr 2022

Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

183

21 Apr 2022

Audio Deep Fake Detection System with Neural Stitching for ADD 2022IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Rui Yan

Cheng Wen

Shuran Zhou

Tingwei Guo

Wei Zou

Xiangang Li

147

19 Apr 2022

Detecting Vocal Fatigue with Neural EmbeddingsJournal of Voice (J Voice), 2022

160

07 Apr 2022

Frequency and Multi-Scale Selective Kernel Attention for Speaker VerificationSpoken Language Technology Workshop (SLT), 2022

291

03 Apr 2022

From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural DiarizationInterspeech (Interspeech), 2022

209

02 Apr 2022

Improved Relation Networks for End-to-End Speaker Verification and IdentificationInterspeech (Interspeech), 2022

Ashutosh Chaubey

Sparsh Sinha

Susmita Ghose

144

31 Mar 2022

EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of SpeakersSpoken Language Technology Workshop (SLT), 2022

289

31 Mar 2022

Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification

470

31 Mar 2022

Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice ConversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

143

30 Mar 2022

Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verificationInterspeech (Interspeech), 2022

Saurabh Kataria

Jesús Villalba

Laureano Moro-Velazquez

Najim Dehak

120

30 Mar 2022

Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain AdaptationInterspeech (Interspeech), 2022

Kuan Po Huang

Yuanbin Fu

Yu Zhang

Hung-yi Lee

276

30 Mar 2022

Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios

Zhihao Du

Shiliang Zhang

Siqi Zheng

Zhijie Yan

162

18 Mar 2022

TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation TheoryInterspeech (Interspeech), 2022

147

14 Mar 2022

Improving the transferability of speech separation by meta-learning

Kuan-Po Huang

Yuan-Kuei Wu

Hung-yi Lee

136

11 Mar 2022

An Environmental Feature Representation in I-vector Space for Room Verification and Metadata Estimation

Desmond Caulley

09 Mar 2022

$Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement$

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech EnhancementIEEE transactions on multimedia (IEEE TMM), 2022

Jun Xiong

Can Ma

Peng Zhang

Lei Xie

Wei Huang

Yufei Zha

199

04 Mar 2022

Magnitude-aware Probabilistic Speaker EmbeddingsThe Speaker and Language Recognition Workshop (Odyssey), 2022

Nikita Kuzmin

Igor Fedorov

A. Sholokhov

244

28 Feb 2022

Contrastive-mixup learning for improved speaker verificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

123

22 Feb 2022

Multi-style Training for South African Call Centre Audio

Walter Heymans

Marelie Hattingh Davel

C. van Heerden

15 Feb 2022

Spiking Cochlea with System-level Local Automatic Gain ControlIEEE Transactions on Circuits and Systems Part 1: Regular Papers (TCAS I), 2022

Ilya Kiselev

Chang Gao

Shih-Chii Liu

159

14 Feb 2022

Partially Fake Audio Detection by Self-attention-based Fake Span DiscoveryIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Haibin Wu

Yu Tsao

213

14 Feb 2022

Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

167

14 Feb 2022

The xmuspeech system for multi-channel multi-party meeting transcription challenge

157

11 Feb 2022

The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

...

Shutong Niu

Yuhang Cao

Heng Lu

Jun Du

Chin-Hui Lee

171

10 Feb 2022

Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge

Jingguang Tian

Xinhui Hu

Xinkang Xu

190

10 Feb 2022

The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

158

09 Feb 2022

Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Weiqing Wang

Xiaoyi Qin

Ming Li

173

06 Feb 2022

A deep complex multi-frame filtering network for stereophonic acoustic echo cancellationInterspeech (Interspeech), 2022

130

03 Feb 2022

The RoyalFlush System of Speech Recognition for M2MeT ChallengeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

192

03 Feb 2022

The CORAL++ Algorithm for Unsupervised Domain Adaptation of Speaker RecogntionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Rongjin Li

Weibin Zhang

Dongpeng Chen

271

02 Feb 2022

Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System

Zhenyu Wang

John H. L. Hansen

28 Jan 2022

SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan

Hye-jin Shim

217

25 Jan 2022

Optimizing Tandem Speaker Verification and Anti-Spoofing SystemsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

164

24 Jan 2022

PickNet: Real-Time Channel Selection for Ad Hoc Microphone ArraysIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Takuya Yoshioka

Xiaofei Wang

Dongmei Wang

138

24 Jan 2022

ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword SpottingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

123

15 Jan 2022

Robust Self-Supervised Audio-Visual Speech RecognitionInterspeech (Interspeech), 2022

Bowen Shi

Wei-Ning Hsu

Abdel-rahman Mohamed

359

117

05 Jan 2022

Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition

Changfeng Gao

Gaofeng Cheng

Pengyuan Zhang

262

23 Dec 2021

Towards Robust Real-time Audio-Visual Speech Enhancement

M. Gogate

K. Dashtipour

Amir Hussain

213

16 Dec 2021