Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.08484
Cited By
MUSAN: A Music, Speech, and Noise Corpus
28 October 2015
David Snyder
Guoguo Chen
Daniel Povey
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"MUSAN: A Music, Speech, and Noise Corpus"
50 / 664 papers shown
Yet Another Model for Arabic Dialect Identification
Ajinkya Kulkarni
Hanan Aldarmaki
151
6
0
20 Oct 2023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System
T. Park
He Huang
Ante Jukić
Kunal Dhawan
Krishna C. Puvvada
Nithin Rao Koluguri
Nikolay Karpov
A. Laptev
Jagadeesh Balam
Boris Ginsburg
202
11
0
18 Oct 2023
End-to-end Online Speaker Diarization with Target Speaker Tracking
Weiqing Wang
Ming Li
315
7
0
12 Oct 2023
LRPD: Large Replay Parallel Dataset
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Ivan Yakovlev
Mikhail P. Melnikov
Nikita Bukhal
Rostislav Makarov
Alexander Alenin
Juncheng Billy Li
A. Okhotnikov
265
2
0
29 Sep 2023
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS
Xin Wang
Taein Kwon
Wei-Ning Hsu
Yossi Adi
Tu Nguyen
D. Bohus
Emmanuel Dupoux
Neel Joshi
Abdelrahman Mohamed
173
5
0
29 Sep 2023
Audio-Visual Speaker Verification via Joint Cross-Attention
International Conference on Speech and Computer (SPECOM), 2023
R Gnana Praveen
Jahangir Alam
277
10
0
28 Sep 2023
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Thilo von Neumann
Christoph Boeddeker
Tobias Cord-Landwehr
Marc Delcroix
Reinhold Haeb-Umbach
285
13
0
28 Sep 2023
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ziqian Ning
Yuepeng Jiang
Pengcheng Zhu
Shuai Wang
Jixun Yao
Linfu Xie
Mengxiao Bi
294
8
0
27 Sep 2023
Collaborative Watermarking for Adversarial Speech Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Lauri Juvela
Xin Wang
230
19
0
26 Sep 2023
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Duc-Tuan Truong
Ruijie Tao
J. Yip
Kong Aik Lee
Chng Eng Siong
187
12
0
26 Sep 2023
Rethinking Session Variability: Leveraging Session Embeddings for Session Robustness in Speaker Verification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Hee-Soo Heo
Ki-hyun Nam
Bong-Jin Lee
Youngki Kwon
Min-Ji Lee
You Jin Kim
Joon Son Chung
263
3
0
26 Sep 2023
Multi-Domain Adaptation by Self-Supervised Learning for Speaker Verification
Wan Lin
Lantian Li
D. Wang
116
2
0
25 Sep 2023
Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Di Liang
Nian Shao
Xiaofei Li
194
7
0
25 Sep 2023
Contrastive Speaker Embedding With Sequential Disentanglement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Youzhi Tu
Man-Wai Mak
Jen-Tzung Chien
CoGe
160
8
0
23 Sep 2023
Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
Interspeech (Interspeech), 2023
Asad Ullah
Alessandro Ragano
Andrew Hines
428
4
0
22 Sep 2023
NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Naohiro Tawara
Marc Delcroix
Atsushi Ando
A. Ogawa
208
14
0
22 Sep 2023
A Multiscale Autoencoder (MSAE) Framework for End-to-End Neural Network Speech Enhancement
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Bengt J. Borgström
M. Brandstein
173
4
0
21 Sep 2023
The Impact of Silence on Speech Anti-Spoofing
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Yuxiang Zhang
Zhuo Li
Jingze Lu
Hua Hua
Wenchao Wang
Pengyuan Zhang
193
30
0
21 Sep 2023
Refining DNN-based Mask Estimation using CGMM-based EM Algorithm for Multi-channel Noise Reduction
Interspeech (Interspeech), 2022
Julitta Bartolewska
Stanisław Kacprzak
K. Kowalczyk
122
0
0
18 Sep 2023
Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ju-ho Kim
Ju-Sung Heo
Hyun-Seo Shin
Chanmann Lim
Ha-Jin Yu
DiffM
160
7
0
14 Sep 2023
PromptASR for contextualized ASR with controllable style
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoyu Yang
Wei Kang
Zengwei Yao
Yifan Yang
Liyong Guo
Fangjun Kuang
Long Lin
Daniel Povey
344
24
0
14 Sep 2023
SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoxiao Miao
Xin Eric Wang
Erica Cooper
Junichi Yamagishi
Nicholas W. D. Evans
Massimiliano Todisco
J. Bonastre
Mickael Rouvier
243
8
0
12 Sep 2023
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
Computer Speech and Language (CSL), 2023
Titouan Parcollet
H. Nguyen
Solène Evain
Marcely Zanon Boito
Adrien Pupier
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
262
26
0
11 Sep 2023
Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
ACM Multimedia (ACM MM), 2023
Haotian Wang
Yuxuan Xi
Hang Chen
Jun Du
Yan Song
...
Pengfei Hu
Ya Jiang
Shi Cheng
Jie Zhang
Yuzhe Weng
213
5
0
11 Sep 2023
ReZero: Region-customizable Sound Extraction
Rongzhi Gu
Yi Luo
147
34
0
31 Aug 2023
The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge
Ruoyu Wang
Maokui He
Jun Du
Hengshun Zhou
Shutong Niu
...
Mengzhi Wang
Genshun Wan
Jia Pan
Jianqing Gao
Chin-Hui Lee
235
16
0
28 Aug 2023
UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023
Yu Zheng
Yajun Zhang
Chuanying Niu
Yibin Zhan
Yanhua Long
Dongxing Xu
140
6
0
24 Aug 2023
AdVerb: Visually Guided Audio Dereverberation
IEEE International Conference on Computer Vision (ICCV), 2023
Sanjoy Chowdhury
Sreyan Ghosh
Subhrajyoti Dasgupta
Anton Ratnarajah
Utkarsh Tyagi
Tianyi Zhou
212
18
0
23 Aug 2023
Convoifilter: A case study of doing cocktail party speech recognition
Thai-Binh Nguyen
A. Waibel
243
2
0
22 Aug 2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023
Zexin Cai
Weiqing Wang
Yikang Wang
Ming Li
144
10
0
20 Aug 2023
Graph Neural Network Backend for Speaker Recognition
Liang He
Rui Li
Mengqi Niu
168
0
0
17 Aug 2023
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023
Ze Li
Yuke Lin
Xiaoyi Qin
Ning Jiang
Guoqing Zhao
Ming Li
172
7
0
17 Aug 2023
ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023
Mengjie Du
Xiang Fang
Jie Li
167
0
0
16 Aug 2023
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Xiaofei Wang
Manthan Thakker
Zhuo Chen
Naoyuki Kanda
Sefik Emre Eskimez
Sanyuan Chen
M. Tang
Shujie Liu
Jinyu Li
Takuya Yoshioka
315
112
0
14 Aug 2023
Large-Scale Learning on Overlapped Speech Detection: New Benchmark and New General System
Zhao-Yu Yin
Jingguang Tian
Xinhui Hu
Xinkang Xu
Yang Xiang
210
2
0
11 Aug 2023
Joint speech and overlap detection: a benchmark over multiple audio setup and speech domains
Martin Lebourdais
Théo Mariotte
Marie Tahon
Anthony Larcher
Antoine Laurent
Silvio Montrésor
S. Meignier
Jean-Hugh Thomas
VLM
102
6
0
24 Jul 2023
Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training
Interspeech (Interspeech), 2023
Gege Qi
YueFeng Chen
Xiaofeng Mao
Yang Liu
Ranjie Duan
Rong Zhang
Hui Xue
VLM
AAML
219
1
0
24 Jul 2023
PAS: Partial Additive Speech Data Augmentation Method for Noise Robust Speaker Verification
Wonbin Kim
Hyun-Seo Shin
Ju-ho Kim
Ju-Sung Heo
Chanmann Lim
Ha-Jin Yu
178
2
0
20 Jul 2023
Exploring Binary Classification Loss For Speaker Verification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Bing Han
Zhengyang Chen
Y. Qian
CVBM
168
16
0
17 Jul 2023
Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Varun Krishna
T. Sai
Sriram Ganapathy
SSL
161
3
0
14 Jul 2023
Self-supervised learning with diffusion-based multichannel speech enhancement for speaker verification under noisy conditions
Interspeech (Interspeech), 2023
Sandipana Dowerah
Ajinkya Kulkarni
Romain Serizel
D. Jouvet
DiffM
236
3
0
05 Jul 2023
Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Yikang Wang
Hiromitsu Nishizaki
Ming Li
215
1
0
04 Jul 2023
An End-to-End Multi-Module Audio Deepfake Generation System for ADD Challenge 2023
Sheng Zhao
Qi-ping Yuan
Yibo Duan
Zhuo Chen
148
2
0
03 Jul 2023
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb
Raghuveer Peri
S. O. Sadjadi
D. Garcia-Romero
159
6
0
30 Jun 2023
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
Samuele Cornell
Sanjeev Khudanpur
Shinji Watanabe
Desh Raj
Xuankai Chang
...
Matthew Maciejewski
Yoshiki Masuyama
Zhong-Qiu Wang
S. Squartini
Sanjeev Khudanpur
239
76
0
23 Jun 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuchen Hu
Chen Chen
Ruizhe Li
Heqing Zou
Chng Eng Siong
GAN
208
11
0
18 Jun 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yuchen Hu
Ruizhe Li
Cheng Chen
Chengwei Qin
Qiu-shi Zhu
Eng Siong Chng
223
14
0
18 Jun 2023
SURT 2.0: Advances in Transducer-based Multi-talker Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Desh Raj
Daniel Povey
Sanjeev Khudanpur
VLM
337
16
0
18 Jun 2023
CoverHunter: Cover Song Identification with Refined Attention and Alignments
IEEE International Conference on Multimedia and Expo (ICME), 2023
Yifan Zhang
Deyi Tuo
Yinan Xu
Xintong Han
167
9
0
15 Jun 2023
Speaker Verification Across Ages: Investigating Deep Speaker Embedding Sensitivity to Age Mismatch in Enrollment and Test Speech
Interspeech (Interspeech), 2023
Vishwanath Pratap Singh
Md. Sahidullah
Tomi Kinnunen
197
5
0
13 Jun 2023
Previous
1
2
3
4
5
6
...
12
13
14
Next
Page 5 of 14
Page
of 14
Go