v1v2 (latest)

Iterative Pseudo-Labeling for Speech Recognition

19 May 2020

Papers citing "Iterative Pseudo-Labeling for Speech Recognition"

50 / 84 papers shown

BiRQ: Bi-Level Self-Labeling Random Quantization for Self-Supervised Speech Recognition

182

18 Sep 2025

Pitch Accent Detection improves Pretrained Automatic Speech Recognition

David Sasu

Natalie Schluter

06 Aug 2025

Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering

...

345

05 Jun 2025

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual InputsNeural Information Processing Systems (NeurIPS), 2024

431

04 Nov 2024

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-LabelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Ahmed Hussen Abdelaziz

Shinji Watanabe

Tatiana Likhomanenko

B. Theobald

VLM SSL

323

16 Sep 2024

Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data

Hung-yi Lee

376

15 Jul 2024

Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer

Tahira Shehzadi

Ifza

Didier Stricker

Muhammad Zeshan Afzal

ViT

460

11 Jul 2024

Token-Weighted RNN-T for Learning from Flawed Data

Gil Keren

Wei Zhou

Ozlem Kalinli

366

26 Jun 2024

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement

...

652

17 Jun 2024

Revisiting ASR Error Correction with Specialized Models

319

24 May 2024

A Large-Scale Evaluation of Speech Foundation Models

...

Shinji Watanabe

Hung-yi Lee

336

15 Apr 2024

Mai Hoómāuna i ka Ái: Language Models Improve Automatic Speech Recognition in Hawaiian

182

03 Apr 2024

Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition

Quazi Sarwar Muhtaseem

Md. Tariqul Islam

Shammur A. Chowdhury

Firoj Alam

332

06 Nov 2023

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

287

29 Sep 2023

Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks

Sizhou Chen

Songyang Gao

Sen Fang

304

14 Sep 2023

LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-trainingBritish Machine Vision Conference (BMVC), 2023

469

22 Aug 2023

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Pengyuan Zhang

306

12 Aug 2023

A Novel Self-training Approach for Low-resource Speech RecognitionInterspeech (Interspeech), 2023

Satwinder Singh

Feng Hou

Ruili Wang

253

10 Aug 2023

How to Scale Your EMANeural Information Processing Systems (NeurIPS), 2023

Dan Busbridge

Jason Ramapuram

Pierre Ablin

Tatiana Likhomanenko

Eeshan Gunesh Dhekane

Xavier Suau

Russ Webb

342

25 Jul 2023

Scalable and Weakly Supervised Bank Transaction Classification

149

28 May 2023

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers

296

23 May 2023

Unsupervised ASR via Cross-Lingual Pseudo-Labeling

Tatiana Likhomanenko

Loren Lugosch

R. Collobert

358

19 May 2023

Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data AugmentationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Dan Jurafsky

289

18 May 2023

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition

Xiaoyu Yang

Qiujia Li

Chuxu Zhang

P. Woodland

237

20 Mar 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

...

534

370

02 Mar 2023

MAC: A unified framework boosting low resource automatic speech recognition

365

05 Feb 2023

Joint Speech Transcription and Translation: Pseudo-Labeling with Out-of-Distribution DataAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

281

20 Dec 2022

TriNet: stabilizing self-supervised learning from complete or slow collapse on ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

175

12 Dec 2022

Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit

Jianqing Gao

257

07 Dec 2022

Progressive Multi-Scale Self-Supervised Learning for Speech RecognitionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2022

222

07 Dec 2022

EURO: ESPnet Unsupervised ASR Open-source ToolkitIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Dongji Gao

Jiatong Shi

Shun-Po Chuang

Leibny Paola García-Perera

Hung-yi Lee

Shinji Watanabe

Sanjeev Khudanpur

273

30 Nov 2022

Continuous Soft Pseudo-Labeling in ASR

365

11 Nov 2022

More Speaking or More Speakers?IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

286

02 Nov 2022

InterMPL: Momentum Pseudo-Labeling with Intermediate CTC LossIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

309

02 Nov 2022

Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition

179

28 Oct 2022

Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

362

27 Oct 2022

Continuous Pseudo-Labeling from the StartInternational Conference on Learning Representations (ICLR), 2022

296

17 Oct 2022

Unsupervised domain adaptation for speech recognition with unsupervised error correctionInterspeech (Interspeech), 2022

Long Mai

Julie Carson-Berndsen

341

24 Sep 2022

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational EnvironmentsInterspeech (Interspeech), 2022

150

15 Jul 2022

Supervision-Guided Codebooks for Masked Prediction in Speech Pre-trainingInterspeech (Interspeech), 2022

265

21 Jun 2022

Boosting Cross-Domain Speech Recognition with Self-SupervisionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Pengyuan Zhang

408

20 Jun 2022

Decoupled Federated Learning for ASR with Non-IID DataInterspeech (Interspeech), 2022

Pengyuan Zhang

267

18 Jun 2022

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-trainingInterspeech (Interspeech), 2022

272

16 Jun 2022

Self-Supervised Speech Representation Learning: A ReviewIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Abdel-rahman Mohamed

Hung-yi Lee

Lasse Borgholt

Jakob Drachmann Havtorn

...

789

475

21 May 2022

Towards End-to-end Unsupervised Speech RecognitionSpoken Language Technology Workshop (SLT), 2022

266

05 Apr 2022

A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition

192

05 Apr 2022

Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentInterspeech (Interspeech), 2022

369

29 Mar 2022

RemixIT: Continual self-training of speech enhancement models via bootstrapped remixingIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Yossi Adi

322

17 Feb 2022

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and LanguageInternational Conference on Machine Learning (ICML), 2022

876

1,100

07 Feb 2022

SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-TrainingInternational Conference on Learning Representations (ICLR), 2022

Wenyong Huang

Zhenhe Zhang

Y. Yeung

Xin Jiang

Qun Liu

333

25 Jan 2022