ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.03411
  4. Cited By
MLS: A Large-Scale Multilingual Dataset for Speech Research
v1v2 (latest)

MLS: A Large-Scale Multilingual Dataset for Speech Research

Interspeech (Interspeech), 2020
7 December 2020
Vineel Pratap
Qiantong Xu
Anuroop Sriram
Gabriel Synnaeve
R. Collobert
    AuLLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "MLS: A Large-Scale Multilingual Dataset for Speech Research"

40 / 390 papers shown
ASR data augmentation in low-resource settings using cross-lingual
  multi-speaker TTS and cross-lingual voice conversion
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionInterspeech (Interspeech), 2022
Edresson Casanova
C. Shulby
Alexander Korolev
Arnaldo Cândido Júnior
A. S. Soares
S. Aluísio
M. Ponti
318
16
0
29 Mar 2022
Analyzing Language-Independent Speaker Anonymization Framework under
  Unseen Conditions
Analyzing Language-Independent Speaker Anonymization Framework under Unseen ConditionsInterspeech (Interspeech), 2022
Xiaoxiao Miao
Xin Wang
Erica Cooper
Junichi Yamagishi
N. Tomashenko
140
16
0
28 Mar 2022
Leveraging unsupervised and weakly-supervised data to improve direct
  speech-to-speech translation
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translationInterspeech (Interspeech), 2022
Ye Jia
Yifan Ding
Ankur Bapna
Colin Cherry
Yu Zhang
Alexis Conneau
Nobuyuki Morioka
234
24
0
24 Mar 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
XTREME-S: Evaluating Cross-lingual Speech RepresentationsInterspeech (Interspeech), 2022
Alexis Conneau
Ankur Bapna
Yu Zhang
Min Ma
Patrick von Platen
...
Orhan Firat
Michael Auli
Sebastian Ruder
Jason Riesa
Melvin Johnson
VLMAILawELM
272
23
0
21 Mar 2022
Visual Speech Recognition for Multiple Languages in the Wild
Visual Speech Recognition for Multiple Languages in the WildNature Machine Intelligence (Nat. Mach. Intell.), 2022
Pingchuan Ma
Stavros Petridis
Maja Pantic
VLM
416
194
0
26 Feb 2022
Automatic speaker verification spoofing and deepfake detection using
  wav2vec 2.0 and data augmentation
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentationThe Speaker and Language Recognition Workshop (Odyssey), 2022
Hemlata Tak
Massimiliano Todisco
Xin Wang
Jee-weon Jung
Junichi Yamagishi
Nicholas W. D. Evans
358
254
0
24 Feb 2022
Self-supervised Learning with Random-projection Quantizer for Speech
  Recognition
Self-supervised Learning with Random-projection Quantizer for Speech RecognitionInternational Conference on Machine Learning (ICML), 2022
Chung-Cheng Chiu
James Qin
Yu Zhang
Jiahui Yu
Yonghui Wu
SSL
264
225
0
03 Feb 2022
mSLAM: Massively multilingual joint pre-training for speech and text
mSLAM: Massively multilingual joint pre-training for speech and text
Ankur Bapna
Colin Cherry
Yu Zhang
Ye Jia
Melvin Johnson
Yong Cheng
Simran Khanuja
Jason Riesa
Alexis Conneau
VLM
187
122
0
03 Feb 2022
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice
  Conversion for everyone
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyoneInternational Conference on Machine Learning (ICML), 2021
Edresson Casanova
Julian Weber
C. Shulby
Arnaldo Cândido Júnior
Eren Golge
M. Ponti
673
550
0
04 Dec 2021
The People's Speech: A Large-Scale Diverse English Speech Recognition
  Dataset for Commercial Usage
The People's Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage
Daniel Galvez
G. Diamos
Juan Ciro
Juan Felipe Cerón
Keith Achorn
Anjali Gopi
David Kanter
Maximilian Lam
Mark Mazumder
Vijay Janapa Reddi
288
124
0
17 Nov 2021
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at
  Scale
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Arun Babu
Changhan Wang
Andros Tjandra
Kushal Lakhotia
Qiantong Xu
...
Yatharth Saraf
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
SSL
409
920
0
17 Nov 2021
Joint Unsupervised and Supervised Training for Multilingual ASR
Joint Unsupervised and Supervised Training for Multilingual ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Junwen Bai
Yue Liu
Yu Zhang
Ankur Bapna
Nikhil Siddhartha
K. Sim
Tara N. Sainath
225
64
0
15 Nov 2021
Cross-lingual Transfer for Speech Processing using Acoustic Language
  Similarity
Cross-lingual Transfer for Speech Processing using Acoustic Language SimilarityAutomatic Speech Recognition & Understanding (ASRU), 2021
Peter Wu
Jiatong Shi
Yifan Zhong
Shinji Watanabe
A. Black
166
8
0
02 Nov 2021
Lhotse: a speech data representation library for the modern deep
  learning ecosystem
Lhotse: a speech data representation library for the modern deep learning ecosystem
Willem Hagemann
Daniel Povey
Jan "Yenda" Trmal
Sanjeev Khudanpur
AuLLMAI4TS
198
44
0
25 Oct 2021
CORAA: a large corpus of spontaneous and prepared speech manually
  validated for speech recognition in Brazilian Portuguese
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese
Arnaldo Cândido Júnior
Edresson Casanova
A. S. Soares
F. S. Oliveira
L. Oliveira
...
Daniel Peixoto Pinto da Silva
Fernando Gorgulho Fayet
B. Carlotto
L. Gris
S. Aluísio
168
16
0
14 Oct 2021
Advancing the dimensionality reduction of speaker embeddings for speaker
  diarisation: disentangling noise and informing speech activity
Advancing the dimensionality reduction of speaker embeddings for speaker diarisation: disentangling noise and informing speech activity
You Jin Kim
Hee-Soo Heo
Jee-weon Jung
Youngki Kwon
Bong-Jin Lee
Joon Son Chung
263
3
0
07 Oct 2021
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech
  Recognition
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition
Binbin Zhang
Hang Lv
Pengcheng Guo
Qijie Shao
Chao Yang
...
Hui Bu
Xiaoyu Chen
Chenchen Zeng
Di Wu
Zhendong Peng
408
290
0
07 Oct 2021
Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches
  for Automatic Speech Recognition Systems
Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems
J. C. Duarte
S. Colcher
54
4
0
04 Oct 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish
  Dutch
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
148
3
0
29 Sep 2021
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
Simple and Effective Zero-shot Cross-lingual Phoneme RecognitionInterspeech (Interspeech), 2021
Qiantong Xu
Alexei Baevski
Michael Auli
VLM
346
120
0
23 Sep 2021
Influence of ASR and Language Model on Alzheimer's Disease Detection
Influence of ASR and Language Model on Alzheimer's Disease Detection
Joan Codina-Filbà
Guillermo Cámbara
Jordi Luque
Mireia Farrús
100
2
0
20 Sep 2021
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0International Conference on Computational Processing of the Portuguese Language (PROPOR), 2021
L. Gris
Edresson Casanova
F. S. Oliveira
A. S. Soares
A. Júnior
168
22
0
23 Jul 2021
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
A. Kalinov
Somshubra Majumdar
Jagadeesh Balam
Boris Ginsburg
MoE
114
3
0
22 Jul 2021
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual
  Shared Task
FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared TaskInternational Workshop on Spoken Language Translation (IWSLT), 2021
Yun Tang
Hongyu Gong
Xian Li
Changhan Wang
J. Pino
Holger Schwenk
Naman Goyal
184
11
0
14 Jul 2021
A Survey on Neural Speech Synthesis
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
349
435
0
29 Jun 2021
HUI-Audio-Corpus-German: A high quality TTS dataset
HUI-Audio-Corpus-German: A high quality TTS datasetDeutsche Jahrestagung für Künstliche Intelligenz (KI), 2021
Pascal Puchtler
Johannes Wirth
René Peinl
94
28
0
11 Jun 2021
Unsupervised Speech Recognition
Unsupervised Speech RecognitionNeural Information Processing Systems (NeurIPS), 2021
Alexei Baevski
Wei-Ning Hsu
Alexis Conneau
Michael Auli
SSL
430
293
0
24 May 2021
Including Signed Languages in Natural Language Processing
Including Signed Languages in Natural Language ProcessingAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Kayo Yin
Amit Moryossef
J. Hochgesang
Yoav Goldberg
Malihe Alikhani
207
131
0
11 May 2021
English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech
  Recognition System
English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Guillermo Cámbara
Alex Peiró Lilja
Mireia Farrús
Jordi Luque
82
3
0
09 May 2021
Scaling End-to-End Models for Large-Scale Multilingual ASR
Scaling End-to-End Models for Large-Scale Multilingual ASRAutomatic Speech Recognition & Understanding (ASRU), 2021
Yue Liu
Ruoming Pang
Tara N. Sainath
Anmol Gulati
Yu Zhang
James Qin
Parisa Haghani
Wenjie Huang
Min Ma
Junwen Bai
CLL
407
83
0
30 Apr 2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised
  Representation Learning from Speech
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechInterspeech (Interspeech), 2021
Solène Evain
H. Nguyen
Hang Le
Marcely Zanon Boito
Salima Mdhaffar
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
SSL
239
71
0
23 Apr 2021
Crossing the Conversational Chasm: A Primer on Natural Language
  Processing for Multilingual Task-Oriented Dialogue Systems
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue SystemsJournal of Artificial Intelligence Research (JAIR), 2021
E. Razumovskaia
Goran Glavaš
Olga Majewska
Edoardo Ponti
Anna Korhonen
Ivan Vulić
510
38
0
17 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
153
13
0
11 Apr 2021
HMM-Free Encoder Pre-Training for Streaming RNN Transducer
HMM-Free Encoder Pre-Training for Streaming RNN TransducerInterspeech (Interspeech), 2021
Lu Huang
J. Sun
Yu Tang
Junfeng Hou
Jinkun Chen
Jun Zhang
Zejun Ma
167
3
0
02 Apr 2021
MediaSpeech: Multilanguage ASR Benchmark and Dataset
MediaSpeech: Multilanguage ASR Benchmark and Dataset
Rostislav Kolobov
Olga Okhapkina
Olga Omelchishina
A. Platunov
Roman Bedyakin
V. Moshkin
Dmitry Menshikov
N. Mikhaylovskiy
123
29
0
30 Mar 2021
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation
  Learning, Semi-Supervised Learning and Interpretation
VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and InterpretationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Changhan Wang
M. Rivière
Ann Lee
Anne Wu
Chaitanya Talnikar
Daniel Haziza
Mary Williamson
J. Pino
Emmanuel Dupoux
SSL
602
626
0
02 Jan 2021
Neural Representations for Modeling Variation in Speech
Neural Representations for Modeling Variation in Speech
Martijn Bartelds
Wietse de Vries
Faraz Sanal
Caitlin Richter
M. Liberman
Martijn B. Wieling
SSLDRL
197
29
0
25 Nov 2020
Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech
  to Standard German Text Corpus
Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus
Michel Plüss
Lukas Neukom
Christian Scheller
Manfred Vogel
AILaw
142
29
0
06 Oct 2020
Unsupervised Cross-lingual Representation Learning for Speech
  Recognition
Unsupervised Cross-lingual Representation Learning for Speech RecognitionInterspeech (Interspeech), 2020
Alexis Conneau
Alexei Baevski
R. Collobert
Abdel-rahman Mohamed
Michael Auli
SSL
392
923
0
24 Jun 2020
TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian
  Portuguese
TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese
Edresson Casanova
A. Júnior
C. Shulby
F. S. Oliveira
João Paulo Teixeira
M. Ponti
S. Aluísio
226
24
0
11 May 2020
Previous
12345678
Page 8 of 8