Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2005.09267
Cited By
v1
v2 (latest)
Iterative Pseudo-Labeling for Speech Recognition
19 May 2020
Qiantong Xu
Tatiana Likhomanenko
Jacob Kahn
Awni Y. Hannun
Gabriel Synnaeve
R. Collobert
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Iterative Pseudo-Labeling for Speech Recognition"
50 / 84 papers shown
BiRQ: Bi-Level Self-Labeling Random Quantization for Self-Supervised Speech Recognition
Liuyuan Jiang
Xiaodong Cui
Brian Kingsbury
Tianyi Chen
Lisha Chen
SSL
182
0
0
18 Sep 2025
Pitch Accent Detection improves Pretrained Automatic Speech Recognition
David Sasu
Natalie Schluter
77
0
0
06 Aug 2025
Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Andres Carofilis
Pradeep Rangappa
S. Madikeri
Shashi Kumar
Sergio Burdisso
...
Bidisha Sharma
Kadri Hacioğlu
Shankar Venkatesan
Saurabh Vyas
Andreas Stolcke
345
2
0
05 Jun 2025
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs
Neural Information Processing Systems (NeurIPS), 2024
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
Maja Pantic
SSL
431
17
0
04 Nov 2024
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Zakaria Aldeneh
Takuya Higuchi
Jee-weon Jung
Li-Wei Chen
Stephen Shum
Ahmed Hussen Abdelaziz
Shinji Watanabe
Tatiana Likhomanenko
B. Theobald
VLM
SSL
323
4
0
16 Sep 2024
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data
Liang-Hsuan Tseng
Zih-Ching Chen
Wei-Shun Chang
Cheng-Kuang Lee
Tsung-Ren Huang
Hung-yi Lee
376
6
0
15 Jul 2024
Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer
Tahira Shehzadi
Ifza
Didier Stricker
Muhammad Zeshan Afzal
ViT
460
14
0
11 Jul 2024
Token-Weighted RNN-T for Learning from Flawed Data
Gil Keren
Wei Zhou
Ozlem Kalinli
366
1
0
26 Jun 2024
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
Yifan Yang
Zheshu Song
Jianheng Zhuo
Mingyu Cui
Jinpeng Li
...
Shuai Fan
Kai Yu
Wei Zhang
Guoguo Chen
Xie Chen
652
42
0
17 Jun 2024
Revisiting ASR Error Correction with Specialized Models
Zijin Gu
Tatiana Likhomanenko
Richard He Bai
Erik McDermott
R. Collobert
Navdeep Jaitly
KELM
AuLLM
LRM
319
10
0
24 May 2024
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
336
64
0
15 Apr 2024
Mai Hoómāuna i ka Ái: Language Models Improve Automatic Speech Recognition in Hawaiian
Kaavya Chaparala
Guido Zarrella
Bruce Torres Fischer
Larry Kimura
Oiwi Parker Jones
AuLLM
182
1
0
03 Apr 2024
Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition
R. N. Nandi
Mehadi Hasan Menon
Tareq Al Muntasir
Sagor Sarker
Quazi Sarwar Muhtaseem
Md. Tariqul Islam
Shammur A. Chowdhury
Firoj Alam
332
4
0
06 Nov 2023
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Andrew Rouditchenko
R. Collobert
Tatiana Likhomanenko
VLM
287
6
0
29 Sep 2023
Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks
Sizhou Chen
Songyang Gao
Sen Fang
304
0
0
14 Sep 2023
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training
British Machine Vision Conference (BMVC), 2023
Silky Singh
Shripad Deshmukh
Mausoom Sarkar
Balaji Krishnamurthy
469
10
0
22 Aug 2023
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hanjing Zhu
Dongji Gao
Gaofeng Cheng
Daniel Povey
Pengyuan Zhang
Yonghong Yan
NoLa
306
12
0
12 Aug 2023
A Novel Self-training Approach for Low-resource Speech Recognition
Interspeech (Interspeech), 2023
Satwinder Singh
Feng Hou
Ruili Wang
253
13
0
10 Aug 2023
How to Scale Your EMA
Neural Information Processing Systems (NeurIPS), 2023
Dan Busbridge
Jason Ramapuram
Pierre Ablin
Tatiana Likhomanenko
Eeshan Gunesh Dhekane
Xavier Suau
Russ Webb
342
26
0
25 Jul 2023
Scalable and Weakly Supervised Bank Transaction Classification
Liam Toran
Cory Van Der Walt
Alan Sammarone
Alex Keller
BDL
AI4TS
149
2
0
28 May 2023
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
J. Silovský
Liuhui Deng
Arturo Argueta
Tresi Arvizo
Roger Hsiao
Sasha Kuznietsov
Yiu-Chang Lin
Xiaoqiang Xiao
Yuanyuan Zhang
296
3
0
23 May 2023
Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Tatiana Likhomanenko
Loren Lugosch
R. Collobert
358
1
0
19 May 2023
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Martijn Bartelds
Nay San
Bradley McDonnell
Dan Jurafsky
Martijn B. Wieling
289
65
0
18 May 2023
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Xiaoyu Yang
Qiujia Li
Chuxu Zhang
P. Woodland
237
12
0
20 Mar 2023
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Yu Zhang
Wei Han
James Qin
Yongqiang Wang
Ankur Bapna
...
Pedro J. Moreno
Chung-Cheng Chiu
J. Schalkwyk
Franccoise Beaufays
Yonghui Wu
VLM
534
370
0
02 Mar 2023
MAC: A unified framework boosting low resource automatic speech recognition
Zeping Min
Qian Ge
Zhong Li
E. Weinan
365
1
0
05 Feb 2023
Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Mozhdeh Gheini
Tatiana Likhomanenko
Matthias Sperber
Hendra Setiawan
281
7
0
20 Dec 2022
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Lixin Cao
Jun Wang
Ben Yang
Jane Polak Scowcroft
Dong Yu
175
4
0
12 Dec 2022
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit
Pengcheng Li
Genshun Wan
Fenglin Ding
Hang Chen
Jianqing Gao
Jia Pan
Cong Liu
SSL
257
1
0
07 Dec 2022
Progressive Multi-Scale Self-Supervised Learning for Speech Recognition
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022
Genshun Wan
Tan Liu
Hang Chen
Jia Pan
Cong Liu
Z. Ye
SSL
222
0
0
07 Dec 2022
EURO: ESPnet Unsupervised ASR Open-source Toolkit
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Dongji Gao
Jiatong Shi
Shun-Po Chuang
Leibny Paola García-Perera
Hung-yi Lee
Shinji Watanabe
Sanjeev Khudanpur
273
10
0
30 Nov 2022
Continuous Soft Pseudo-Labeling in ASR
Tatiana Likhomanenko
R. Collobert
Navdeep Jaitly
Samy Bengio
VLM
365
6
0
11 Nov 2022
More Speaking or More Speakers?
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Dan Berrebbi
R. Collobert
Navdeep Jaitly
Tatiana Likhomanenko
286
7
0
02 Nov 2022
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yosuke Higuchi
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
309
1
0
02 Nov 2022
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Zezhong Jin
Dading Zhong
Xiao Song
Zhaoyi Liu
Naipeng Ye
Qingcheng Zeng
179
3
0
28 Oct 2022
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
VLM
362
11
0
27 Oct 2022
Continuous Pseudo-Labeling from the Start
International Conference on Learning Representations (ICLR), 2022
Dan Berrebbi
R. Collobert
Samy Bengio
Navdeep Jaitly
Tatiana Likhomanenko
296
17
0
17 Oct 2022
Unsupervised domain adaptation for speech recognition with unsupervised error correction
Interspeech (Interspeech), 2022
Long Mai
Julie Carson-Berndsen
341
11
0
24 Sep 2022
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Interspeech (Interspeech), 2022
Yicheng Du
Aditya Arie Nugraha
Kouhei Sekiguchi
Yoshiaki Bando
Mathieu Fontaine
Kazuyoshi Yoshii
150
0
0
15 Jul 2022
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Interspeech (Interspeech), 2022
Chengyi Wang
Yiming Wang
Yu Wu
Sanyuan Chen
Jinyu Li
Shujie Liu
Furu Wei
SSL
265
21
0
21 Jun 2022
Boosting Cross-Domain Speech Recognition with Self-Supervision
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Hanjing Zhu
Gaofeng Cheng
Yongfeng Zhang
Wenxin Hou
Pengyuan Zhang
Yonghong Yan
408
24
0
20 Jun 2022
Decoupled Federated Learning for ASR with Non-IID Data
Interspeech (Interspeech), 2022
Hanjing Zhu
Yongfeng Zhang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
267
15
0
18 Jun 2022
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Interspeech (Interspeech), 2022
Bowen Zhang
Songjun Cao
Xiaoming Zhang
Yike Zhang
Long Ma
T. Shinozaki
SSL
272
6
0
16 Jun 2022
Self-Supervised Speech Representation Learning: A Review
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSL
AI4TS
789
475
0
21 May 2022
Towards End-to-end Unsupervised Speech Recognition
Spoken Language Technology Workshop (SLT), 2022
Alexander H. Liu
Wei-Ning Hsu
Michael Auli
Alexei Baevski
SSL
266
85
0
05 Apr 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Ye Du
Jie Zhang
Qiu-shi Zhu
Lirong Dai
Ming Wu
Xin Fang
Zhouwang Yang
192
2
0
05 Apr 2022
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Interspeech (Interspeech), 2022
Mu Yang
K. Hirschi
S. Looney
Okim Kang
John H. L. Hansen
369
23
0
29 Mar 2022
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
IEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
322
72
0
17 Feb 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
International Conference on Machine Learning (ICML), 2022
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSL
VLM
ViT
876
1,100
0
07 Feb 2022
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training
International Conference on Learning Representations (ICLR), 2022
Wenyong Huang
Zhenhe Zhang
Y. Yeung
Xin Jiang
Qun Liu
333
28
0
25 Jan 2022
1
2
Next
Page 1 of 2