Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.08484
Cited By
MUSAN: A Music, Speech, and Noise Corpus
28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MUSAN: A Music, Speech, and Noise Corpus"
50 / 664 papers shown
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Shota Horiguchi
Shinji Watanabe
Leibny Paola García-Perera
Yuki Takashima
Yohei Kawaguchi
277
29
0
06 Jun 2022
Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems
Knowledge Discovery and Data Mining (KDD), 2022
Ting-En Lin
Yuchuan Wu
Feiling Huang
Luo Si
Jian Sun
Yongbin Li
342
32
0
30 May 2022
Self-Supervised Speech Representation Learning: A Review
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
679
445
0
21 May 2022
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Interspeech (Interspeech), 2022
Bowen Shi
Abdel-rahman Mohamed
Wei-Ning Hsu
SSL
235
22
0
15 May 2022
Collar-aware Training for Streaming Speaker Change Detection in Broadcast Speech
The Speaker and Language Recognition Workshop (Odyssey), 2022
Joonas Kalda
Tanel Alumäe
143
5
0
14 May 2022
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge
The Speaker and Language Recognition Workshop (Odyssey), 2022
Tanel Alumäe
Kunnar Kukk
107
8
0
14 May 2022
Task splitting for DNN-based acoustic echo and noise removal
International Workshop on Acoustic Signal Enhancement (IWAENC), 2022
Sebastian Braun
Maria Luis Valero
176
20
0
13 May 2022
Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Fuchuan Tong
Siqi Zheng
Min Zhang
Yafeng Chen
Hongbin Suo
Q. Hong
Lin Li
SSL
151
11
0
25 Apr 2022
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization
The Speaker and Language Recognition Workshop (Odyssey), 2022
Natsuo Yamashita
Shota Horiguchi
Takeshi Homma
217
22
0
24 Apr 2022
The 2021 NIST Speaker Recognition Evaluation
The Speaker and Language Recognition Workshop (Odyssey), 2022
S. O. Sadjadi
Craig S. Greenberg
E. Singer
Lisa P. Mason
D. A. Reynolds
175
77
0
21 Apr 2022
The NIST CTS Speaker Recognition Challenge
The Speaker and Language Recognition Workshop (Odyssey), 2022
S. O. Sadjadi
Craig S. Greenberg
E. Singer
Lisa P. Mason
D. Reynolds
ELM
271
0
0
21 Apr 2022
Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion
The Speaker and Language Recognition Workshop (Odyssey), 2022
Hye-jin Shim
Hemlata Tak
Xuechen Liu
Hee-Soo Heo
Jee-weon Jung
...
Héctor Delgado
Kong Aik Lee
Md. Sahidullah
Tomi Kinnunen
Nicholas W. D. Evans
AAML
158
17
0
21 Apr 2022
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Xun Gong
Y. Qian
Houjun Huang
Yanmin Qian
183
61
0
21 Apr 2022
Audio Deep Fake Detection System with Neural Stitching for ADD 2022
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rui Yan
Cheng Wen
Shuran Zhou
Tingwei Guo
Wei Zou
Xiangang Li
147
28
0
19 Apr 2022
Detecting Vocal Fatigue with Neural Embeddings
Journal of Voice (J Voice), 2022
Sebastian P. Bayerl
Dominik Wagner
Ilja Baumann
Korbinian Riedhammer
Tobias Bocklet
160
11
0
07 Apr 2022
Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification
Spoken Language Technology Workshop (SLT), 2022
Sung Hwan Mun
Jee-weon Jung
Min Hyun Han
N. Kim
291
28
0
03 Apr 2022
From Simulated Mixtures to Simulated Conversations as Training Data for End-to-End Neural Diarization
Interspeech (Interspeech), 2022
Federico Landini
Alicia Lozano-Diez
Mireia Díez
Lukávs Burget
209
46
0
02 Apr 2022
Improved Relation Networks for End-to-End Speaker Verification and Identification
Interspeech (Interspeech), 2022
Ashutosh Chaubey
Sparsh Sinha
Susmita Ghose
144
4
0
31 Mar 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers
Spoken Language Technology Workshop (SLT), 2022
Soumi Maiti
Yushi Ueda
Shinji Watanabe
Chunlei Zhang
Meng Yu
Shi-Xiong Zhang
Yong-mei Xu
289
42
0
31 Mar 2022
Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification
Yen-Lun Liao
Xuan-Bo Chen
Chung-Che Wang
J. Jang
AAML
470
9
0
31 Mar 2022
Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jiachen Lian
Chunlei Zhang
Dong Yu
DRL
143
55
0
30 Mar 2022
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Interspeech (Interspeech), 2022
Saurabh Kataria
Jesús Villalba
Laureano Moro-Velazquez
Najim Dehak
120
4
0
30 Mar 2022
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation
Interspeech (Interspeech), 2022
Kuan Po Huang
Yuanbin Fu
Yu Zhang
Hung-yi Lee
276
28
0
30 Mar 2022
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios
Zhihao Du
Shiliang Zhang
Siqi Zheng
Zhijie Yan
162
2
0
18 Mar 2022
TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory
Interspeech (Interspeech), 2022
Andong Li
Guochen Yu
C. Zheng
Xiaodong Li
147
15
0
14 Mar 2022
Improving the transferability of speech separation by meta-learning
Kuan-Po Huang
Yuan-Kuei Wu
Hung-yi Lee
136
2
0
11 Mar 2022
An Environmental Feature Representation in I-vector Space for Room Verification and Metadata Estimation
Desmond Caulley
49
1
0
09 Mar 2022
Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
IEEE transactions on multimedia (IEEE TMM), 2022
Jun Xiong
Can Ma
Peng Zhang
Lei Xie
Wei Huang
Yufei Zha
199
37
0
04 Mar 2022
Magnitude-aware Probabilistic Speaker Embeddings
The Speaker and Language Recognition Workshop (Odyssey), 2022
Nikita Kuzmin
Igor Fedorov
A. Sholokhov
244
7
0
28 Feb 2022
Contrastive-mixup learning for improved speaker verification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Xin Zhang
Minho Jin
R. Cheng
Ruirui Li
Eunjung Han
A. Stolcke
AAML
SSL
123
11
0
22 Feb 2022
Multi-style Training for South African Call Centre Audio
Walter Heymans
Marelie Hattingh Davel
C. van Heerden
72
3
0
15 Feb 2022
Spiking Cochlea with System-level Local Automatic Gain Control
IEEE Transactions on Circuits and Systems Part 1: Regular Papers (TCAS I), 2022
Ilya Kiselev
Chang Gao
Shih-Chii Liu
159
14
0
14 Feb 2022
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Haibin Wu
Heng-Cheng Kuo
Naijun Zheng
Kuo-Hsuan Hung
Hung-yi Lee
Yu Tsao
Hsin-Min Wang
Helen Meng
213
46
0
14 Feb 2022
Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
K. Kinoshita
Marc Delcroix
Tomoharu Iwata
BDL
167
21
0
14 Feb 2022
The xmuspeech system for multi-channel multi-party meeting transcription challenge
Jie Wang
Yuji Liu
Binling Wang
Yiming Zhi
Song Li
Shipeng Xia
Jiayang Zhang
Lin Li
Q. Hong
Feng Tong
157
0
0
11 Feb 2022
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Maokui He
Xiang Lv
Weilin Zhou
Jingjing Yin
Xiaoqi Zhang
...
Shutong Niu
Yuhang Cao
Heng Lu
Jun Du
Chin-Hui Lee
171
8
0
10 Feb 2022
Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge
Jingguang Tian
Xinhui Hu
Xinkang Xu
190
9
0
10 Feb 2022
The Volcspeech system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chen Shen
Yi Y. Liu
Wenzhi Fan
Bin Wang
Shi-Xue Wen
Yao Tian
Jun Zhang
Jingsheng Yang
Zejun Ma
158
5
0
09 Feb 2022
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Weiqing Wang
Xiaoyi Qin
Ming Li
173
32
0
06 Feb 2022
A deep complex multi-frame filtering network for stereophonic acoustic echo cancellation
Interspeech (Interspeech), 2022
Linjuan Cheng
C. Zheng
Andong Li
Yuquan Wu
Renhua Peng
Xiaodong Li
130
1
0
03 Feb 2022
The RoyalFlush System of Speech Recognition for M2MeT Challenge
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Shuaishuai Ye
Peiyao Wang
Shunfei Chen
Xinhui Hu
Xinkang Xu
192
6
0
03 Feb 2022
The CORAL++ Algorithm for Unsupervised Domain Adaptation of Speaker Recogntion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rongjin Li
Weibin Zhang
Dongpeng Chen
271
26
0
02 Feb 2022
Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System
Zhenyu Wang
John H. L. Hansen
80
0
0
28 Jan 2022
SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan
Jee-weon Jung
Hemlata Tak
Hye-jin Shim
Hee-Soo Heo
Bong-Jin Lee
Soo-Whan Chung
Hong-Goo Kang
Ha-Jin Yu
Nicholas W. D. Evans
Tomi Kinnunen
217
34
0
25 Jan 2022
Optimizing Tandem Speaker Verification and Anti-Spoofing Systems
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Anssi Kanervisto
Ville Hautamaki
Tomi Kinnunen
Junichi Yamagishi
164
18
0
24 Jan 2022
PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Takuya Yoshioka
Xiaofei Wang
Dongmei Wang
138
6
0
24 Jan 2022
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Dianwen Ng
Yunqi Chen
Biao Tian
Qiang Fu
Chng Eng Siong
123
59
0
15 Jan 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Interspeech (Interspeech), 2022
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
359
117
0
05 Jan 2022
Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition
Changfeng Gao
Gaofeng Cheng
Pengyuan Zhang
262
4
0
23 Dec 2021
Towards Robust Real-time Audio-Visual Speech Enhancement
M. Gogate
K. Dashtipour
Amir Hussain
213
4
0
16 Dec 2021
Previous
1
2
3
...
8
9
10
...
12
13
14
Next
Page 9 of 14
Page
of 14
Go