ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.09267
  4. Cited By
Iterative Pseudo-Labeling for Speech Recognition
v1v2 (latest)

Iterative Pseudo-Labeling for Speech Recognition

19 May 2020
Qiantong Xu
Tatiana Likhomanenko
Jacob Kahn
Awni Y. Hannun
Gabriel Synnaeve
R. Collobert
    VLM
ArXiv (abs)PDFHTML

Papers citing "Iterative Pseudo-Labeling for Speech Recognition"

50 / 83 papers shown
Title
Pitch Accent Detection improves Pretrained Automatic Speech Recognition
Pitch Accent Detection improves Pretrained Automatic Speech Recognition
David Sasu
Natalie Schluter
24
0
0
06 Aug 2025
Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering
Andres Carofilis
Pradeep Rangappa
S. Madikeri
Shashi Kumar
Sergio Burdisso
...
Bidisha Sharma
Kadri Hacioğlu
Shankar Venkatesan
Saurabh Vyas
Andreas Stolcke
180
0
0
05 Jun 2025
Unified Speech Recognition: A Single Model for Auditory, Visual, and
  Audiovisual Inputs
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
Maja Pantic
SSL
127
9
0
04 Nov 2024
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels
Zakaria Aldeneh
Takuya Higuchi
Jee-weon Jung
Li-Wei Chen
Stephen Shum
Ahmed Hussen Abdelaziz
Shinji Watanabe
Tatiana Likhomanenko
B. Theobald
VLMSSL
136
1
0
16 Sep 2024
Leave No Knowledge Behind During Knowledge Distillation: Towards
  Practical and Effective Knowledge Distillation for Code-Switching ASR Using
  Realistic Data
Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data
Liang-Hsuan Tseng
Zih-Ching Chen
Wei-Shun Chang
Cheng-Kuang Lee
Tsung-Ren Huang
Hung-yi Lee
132
5
0
15 Jul 2024
Semi-Supervised Object Detection: A Survey on Progress from CNN to
  Transformer
Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer
Tahira Shehzadi
Ifza
Didier Stricker
Muhammad Zeshan Afzal
ViT
156
1
0
11 Jul 2024
Token-Weighted RNN-T for Learning from Flawed Data
Token-Weighted RNN-T for Learning from Flawed Data
Gil Keren
Wei Zhou
Ozlem Kalinli
99
0
0
26 Jun 2024
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
Yifan Yang
Zheshu Song
Jianheng Zhuo
Mingyu Cui
Jinpeng Li
...
Shuai Fan
Kai Yu
Wei Zhang
Guoguo Chen
Xie Chen
218
19
0
17 Jun 2024
Denoising LM: Pushing the Limits of Error Correction Models for Speech
  Recognition
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Zijin Gu
Tatiana Likhomanenko
Richard He Bai
Erik McDermott
R. Collobert
Navdeep Jaitly
AuLLM
130
7
0
24 May 2024
A Large-Scale Evaluation of Speech Foundation Models
A Large-Scale Evaluation of Speech Foundation Models
Shu-Wen Yang
Heng-Jui Chang
Zili Huang
Andy T. Liu
Cheng-I Jeff Lai
...
Kushal Lakhotia
Shang-Wen Li
Abdelrahman Mohamed
Shinji Watanabe
Hung-yi Lee
128
38
0
15 Apr 2024
Mai Hoómāuna i ka Ái: Language Models Improve Automatic Speech
  Recognition in Hawaiian
Mai Hoómāuna i ka Ái: Language Models Improve Automatic Speech Recognition in Hawaiian
Kaavya Chaparala
Guido Zarrella
Bruce Torres Fischer
Larry Kimura
Oiwi Parker Jones
AuLLM
77
0
0
03 Apr 2024
Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition
Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition
R. N. Nandi
Mehadi Hasan Menon
Tareq Al Muntasir
Sagor Sarker
Quazi Sarwar Muhtaseem
Md. Tariqul Islam
Shammur A. Chowdhury
Firoj Alam
111
3
0
06 Nov 2023
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Andrew Rouditchenko
R. Collobert
Tatiana Likhomanenko
VLM
118
3
0
29 Sep 2023
Echotune: A Modular Extractor Leveraging the Variable-Length Nature of
  Speech in ASR Tasks
Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks
Sizhou Chen
Songyang Gao
Sen Fang
62
0
0
14 Sep 2023
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and
  Bootstrapped Self-training
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training
Silky Singh
Shripad Deshmukh
Mausoom Sarkar
Balaji Krishnamurthy
129
10
0
22 Aug 2023
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech
  Recognition
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Hanjing Zhu
Dongji Gao
Gaofeng Cheng
Daniel Povey
Pengyuan Zhang
Yonghong Yan
NoLa
98
4
0
12 Aug 2023
A Novel Self-training Approach for Low-resource Speech Recognition
A Novel Self-training Approach for Low-resource Speech Recognition
Satwinder Singh
Feng Hou
Ruili Wang
98
11
0
10 Aug 2023
How to Scale Your EMA
How to Scale Your EMA
Dan Busbridge
Jason Ramapuram
Pierre Ablin
Tatiana Likhomanenko
Eeshan Gunesh Dhekane
Xavier Suau
Russ Webb
100
22
0
25 Jul 2023
Scalable and Weakly Supervised Bank Transaction Classification
Scalable and Weakly Supervised Bank Transaction Classification
Liam Toran
Cory Van Der Walt
Alan Sammarone
Alex Keller
BDLAI4TS
14
1
0
28 May 2023
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for
  Low-Resource Speech Recognition with Transducers
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
J. Silovský
Liuhui Deng
Arturo Argueta
Tresi Arvizo
Roger Hsiao
Sasha Kuznietsov
Yiu-Chang Lin
Xiaoqiang Xiao
Yuanyuan Zhang
85
3
0
23 May 2023
Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Tatiana Likhomanenko
Loren Lugosch
R. Collobert
119
0
0
19 May 2023
Making More of Little Data: Improving Low-Resource Automatic Speech
  Recognition Using Data Augmentation
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation
Martijn Bartelds
Nay San
Bradley McDonnell
Dan Jurafsky
Martijn B. Wieling
139
45
0
18 May 2023
Knowledge Distillation from Multiple Foundation Models for End-to-End
  Speech Recognition
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Xiaoyu Yang
Qiujia Li
Chuxu Zhang
P. Woodland
116
8
0
20 Mar 2023
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Yu Zhang
Wei Han
James Qin
Yongqiang Wang
Ankur Bapna
...
Pedro J. Moreno
Chung-Cheng Chiu
J. Schalkwyk
Franccoise Beaufays
Yonghui Wu
VLM
242
299
0
02 Mar 2023
MAC: A unified framework boosting low resource automatic speech
  recognition
MAC: A unified framework boosting low resource automatic speech recognition
Zeping Min
Qian Ge
Zhong Li
E. Weinan
141
1
0
05 Feb 2023
Joint Speech Transcription and Translation: Pseudo-Labeling with
  Out-of-Distribution Data
Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution Data
Mozhdeh Gheini
Tatiana Likhomanenko
Matthias Sperber
Hendra Setiawan
118
5
0
20 Dec 2022
TriNet: stabilizing self-supervised learning from complete or slow
  collapse on ASR
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR
Lixin Cao
Jun Wang
Ben Yang
Jane Polak Scowcroft
Dong Yu
90
4
0
12 Dec 2022
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit
Pengcheng Li
Genshun Wan
Fenglin Ding
Hang Chen
Jianqing Gao
Jia Pan
Cong Liu
SSL
85
1
0
07 Dec 2022
Progressive Multi-Scale Self-Supervised Learning for Speech Recognition
Progressive Multi-Scale Self-Supervised Learning for Speech Recognition
Genshun Wan
Tan Liu
Hang Chen
Jia Pan
Cong Liu
Z. Ye
SSL
61
0
0
07 Dec 2022
EURO: ESPnet Unsupervised ASR Open-source Toolkit
EURO: ESPnet Unsupervised ASR Open-source Toolkit
Dongji Gao
Jiatong Shi
Shun-Po Chuang
Leibny Paola García-Perera
Hung-yi Lee
Shinji Watanabe
Sanjeev Khudanpur
137
8
0
30 Nov 2022
Continuous Soft Pseudo-Labeling in ASR
Continuous Soft Pseudo-Labeling in ASR
Tatiana Likhomanenko
R. Collobert
Navdeep Jaitly
Samy Bengio
VLM
117
3
0
11 Nov 2022
More Speaking or More Speakers?
More Speaking or More Speakers?
Dan Berrebbi
R. Collobert
Navdeep Jaitly
Tatiana Likhomanenko
75
6
0
02 Nov 2022
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss
Yosuke Higuchi
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
109
1
0
02 Nov 2022
Filter and evolve: progressive pseudo label refining for semi-supervised
  automatic speech recognition
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Zezhong Jin
Dading Zhong
Xiao Song
Zhaoyi Liu
Naipeng Ye
Qingcheng Zeng
97
2
0
28 Oct 2022
Make More of Your Data: Minimal Effort Data Augmentation for Automatic
  Speech Recognition and Translation
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation
Tsz Kin Lam
Shigehiko Schamoni
Stefan Riezler
VLM
105
10
0
27 Oct 2022
Continuous Pseudo-Labeling from the Start
Continuous Pseudo-Labeling from the Start
Dan Berrebbi
R. Collobert
Samy Bengio
Navdeep Jaitly
Tatiana Likhomanenko
100
16
0
17 Oct 2022
Unsupervised domain adaptation for speech recognition with unsupervised
  error correction
Unsupervised domain adaptation for speech recognition with unsupervised error correction
Long Mai
Julie Carson-Berndsen
113
9
0
24 Sep 2022
Direction-Aware Joint Adaptation of Neural Speech Enhancement and
  Recognition in Real Multiparty Conversational Environments
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments
Yicheng Du
Aditya Arie Nugraha
Kouhei Sekiguchi
Yoshiaki Bando
Mathieu Fontaine
Kazuyoshi Yoshii
69
0
0
15 Jul 2022
Supervision-Guided Codebooks for Masked Prediction in Speech
  Pre-training
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training
Chengyi Wang
Yiming Wang
Yu Wu
Sanyuan Chen
Jinyu Li
Shujie Liu
Furu Wei
SSL
119
20
0
21 Jun 2022
Boosting Cross-Domain Speech Recognition with Self-Supervision
Boosting Cross-Domain Speech Recognition with Self-Supervision
Hanjing Zhu
Gaofeng Cheng
Yongfeng Zhang
Wenxin Hou
Pengyuan Zhang
Yonghong Yan
133
17
0
20 Jun 2022
Decoupled Federated Learning for ASR with Non-IID Data
Decoupled Federated Learning for ASR with Non-IID Data
Hanjing Zhu
Yongfeng Zhang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
109
12
0
18 Jun 2022
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based
  on Self-supervised Pre-training
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Bowen Zhang
Songjun Cao
Xiaoming Zhang
Yike Zhang
Long Ma
T. Shinozaki
SSL
88
6
0
16 Jun 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
358
395
0
21 May 2022
Towards End-to-end Unsupervised Speech Recognition
Towards End-to-end Unsupervised Speech Recognition
Alexander H. Liu
Wei-Ning Hsu
Michael Auli
Alexei Baevski
SSL
96
75
0
05 Apr 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text
  for Low-Resource Automatic Speech Recognition
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Ye Du
Jie Zhang
Qiu-shi Zhu
Lirong Dai
Ming Wu
Xin Fang
Zhouwang Yang
75
2
0
05 Apr 2022
Improving Mispronunciation Detection with Wav2vec2-based Momentum
  Pseudo-Labeling for Accentedness and Intelligibility Assessment
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Mu Yang
K. Hirschi
S. Looney
Okim Kang
John H. L. Hansen
118
17
0
29 Mar 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
90
58
0
17 Feb 2022
data2vec: A General Framework for Self-supervised Learning in Speech,
  Vision and Language
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSLVLMViT
222
919
0
07 Feb 2022
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning
  for Speech Pre-Training
SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training
Wenyong Huang
Zhenhe Zhang
Y. Yeung
Xin Jiang
Qun Liu
142
24
0
25 Jan 2022
On the Use of External Data for Spoken Named Entity Recognition
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad
Felix Wu
Suwon Shon
Karen Livescu
Kyu Jeong Han
95
16
0
14 Dec 2021
12
Next