Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.03411
Cited By
v1
v2 (latest)
MLS: A Large-Scale Multilingual Dataset for Speech Research
Interspeech (Interspeech), 2020
7 December 2020
Vineel Pratap
Qiantong Xu
Anuroop Sriram
Gabriel Synnaeve
R. Collobert
AuLLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"MLS: A Large-Scale Multilingual Dataset for Speech Research"
40 / 390 papers shown
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
Interspeech (Interspeech), 2022
Edresson Casanova
C. Shulby
Alexander Korolev
Arnaldo Cândido Júnior
A. S. Soares
S. Aluísio
M. Ponti
318
16
0
29 Mar 2022
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions
Interspeech (Interspeech), 2022
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
140
16
0
28 Mar 2022
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
Interspeech (Interspeech), 2022
Ye Jia
Yifan Ding
Ankur Bapna
Colin Cherry
Yu Zhang
Alexis Conneau
Nobuyuki Morioka
234
24
0
24 Mar 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
Interspeech (Interspeech), 2022
Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
...
Orhan Firat
Michael Auli
Sebastian Ruder
Jason Riesa
Melvin Johnson
VLM
AILaw
ELM
272
23
0
21 Mar 2022
Visual Speech Recognition for Multiple Languages in the Wild
Nature Machine Intelligence (Nat. Mach. Intell.), 2022
Pingchuan Ma
Stavros Petridis
Maja Pantic
VLM
416
194
0
26 Feb 2022
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
The Speaker and Language Recognition Workshop (Odyssey), 2022
Hemlata Tak
Massimiliano Todisco
Xin Wang
Jee-weon Jung
Junichi Yamagishi
Nicholas W. D. Evans
358
254
0
24 Feb 2022
Self-supervised Learning with Random-projection Quantizer for Speech Recognition
International Conference on Machine Learning (ICML), 2022
Chung-Cheng Chiu
James Qin
Yu Zhang
Jiahui Yu
Yonghui Wu
SSL
264
225
0
03 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
187
122
0
03 Feb 2022
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
International Conference on Machine Learning (ICML), 2021
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
673
550
0
04 Dec 2021
The People's Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage
Daniel Galvez
G. Diamos
Juan Ciro
Juan Felipe Cerón
Keith Achorn
Anjali Gopi
David Kanter
Maximilian Lam
Mark Mazumder
Vijay Janapa Reddi
288
124
0
17 Nov 2021
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
409
920
0
17 Nov 2021
Joint Unsupervised and Supervised Training for Multilingual ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Junwen Bai
Yue Liu
Yu Zhang
Ankur Bapna
Nikhil Siddhartha
K. Sim
Tara N. Sainath
225
64
0
15 Nov 2021
Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Automatic Speech Recognition & Understanding (ASRU), 2021
Peter Wu
Jiatong Shi
Yifan Zhong
Shinji Watanabe
A. Black
166
8
0
02 Nov 2021
Lhotse: a speech data representation library for the modern deep learning ecosystem
Willem Hagemann
Daniel Povey
Jan "Yenda" Trmal
Sanjeev Khudanpur
AuLLM
AI4TS
198
44
0
25 Oct 2021
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Arnaldo Cândido Júnior
Edresson Casanova
A. S. Soares
F. S. Oliveira
L. Oliveira
...
Daniel Peixoto Pinto da Silva
Fernando Gorgulho Fayet
B. Carlotto
L. Gris
S. Aluísio
168
16
0
14 Oct 2021
Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity
You Jin Kim
Hee-Soo Heo
Jee-weon Jung
Youngki Kwon
Bong-Jin Lee
Joon Son Chung
263
3
0
07 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
408
290
0
07 Oct 2021
Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems
J. C. Duarte
S. Colcher
54
4
0
04 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
148
3
0
29 Sep 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Interspeech (Interspeech), 2021
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
346
120
0
23 Sep 2021
Influence of ASR and Language Model on Alzheimer's Disease Detection
Joan Codina-Filbà
Guillermo Cámbara
Jordi Luque
Mireia Farrús
100
2
0
20 Sep 2021
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
International Conference on Computational Processing of the Portuguese Language (PROPOR), 2021
L. Gris
Edresson Casanova
F. S. Oliveira
A. S. Soares
A. Júnior
168
22
0
23 Jul 2021
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
A. Kalinov
Somshubra Majumdar
Jagadeesh Balam
Boris Ginsburg
MoE
114
3
0
22 Jul 2021
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task
International Workshop on Spoken Language Translation (IWSLT), 2021
Yun Tang
Hongyu Gong
Xian Li
Changhan Wang
J. Pino
Holger Schwenk
Naman Goyal
184
11
0
14 Jul 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
349
435
0
29 Jun 2021
HUI-Audio-Corpus-German: A high quality TTS dataset
Deutsche Jahrestagung für Künstliche Intelligenz (KI), 2021
Pascal Puchtler
Johannes Wirth
René Peinl
94
28
0
11 Jun 2021
Unsupervised Speech Recognition
Neural Information Processing Systems (NeurIPS), 2021
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
430
293
0
24 May 2021
Including Signed Languages in Natural Language Processing
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Kayo Yin
Amit Moryossef
J. Hochgesang
Yoav Goldberg
Malihe Alikhani
207
131
0
11 May 2021
English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Guillermo Cámbara
Alex Peiró Lilja
Mireia Farrús
Jordi Luque
82
3
0
09 May 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Automatic Speech Recognition & Understanding (ASRU), 2021
Yue Liu
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
Wenjie Huang
Min Ma
Junwen Bai
CLL
407
83
0
30 Apr 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
Interspeech (Interspeech), 2021
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
239
71
0
23 Apr 2021
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems
Journal of Artificial Intelligence Research (JAIR), 2021
E. Razumovskaia
Goran Glavaš
Olga Majewska
Edoardo Ponti
Anna Korhonen
Ivan Vulić
510
38
0
17 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
153
13
0
11 Apr 2021
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
Interspeech (Interspeech), 2021
Lu Huang
J. Sun
Yu Tang
Junfeng Hou
Jinkun Chen
Jun Zhang
Zejun Ma
167
3
0
02 Apr 2021
MediaSpeech: Multilanguage ASR Benchmark and Dataset
Rostislav Kolobov
Olga Okhapkina
Olga Omelchishina
A. Platunov
Roman Bedyakin
V. Moshkin
Dmitry Menshikov
N. Mikhaylovskiy
123
29
0
30 Mar 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
602
626
0
02 Jan 2021
Neural Representations for Modeling Variation in Speech
Martijn Bartelds
Wietse de Vries
Faraz Sanal
Caitlin Richter
M. Liberman
Martijn B. Wieling
SSL
DRL
197
29
0
25 Nov 2020
Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus
Michel Plüss
Lukas Neukom
Christian Scheller
Manfred Vogel
AILaw
142
29
0
06 Oct 2020
Unsupervised Cross-lingual Representation Learning for Speech Recognition
Interspeech (Interspeech), 2020
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
SSL
392
923
0
24 Jun 2020
TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese
Edresson Casanova
A. Júnior
C. Shulby
F. S. Oliveira
João Paulo Teixeira
M. Ponti
S. Aluísio
226
24
0
11 May 2020
Previous
1
2
3
4
5
6
7
8
Page 8 of 8