Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

16 June 2021

ArXiv (abs)PDF HTML Github

Papers citing "Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition"

36 / 36 papers shown

LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Using in-the-wild Data

Wen Ding

Fan Qian

385

05 Jun 2025

CR-CTC: Consistency regularization on CTC for improved speech recognitionInternational Conference on Learning Representations (ICLR), 2024

477

17 Feb 2025

Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition

337

15 Dec 2024

Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual InputsNeural Information Processing Systems (NeurIPS), 2024

431

04 Nov 2024

Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to Boost Semi-Supervised Facial Expression Recognition

289

23 Oct 2024

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with WhisperConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Iuliia Thorbecke

Juan Zuluaga-Gomez

389

20 Sep 2024

Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data

395

22 Jul 2024

Token-Weighted RNN-T for Learning from Flawed Data

Gil Keren

Wei Zhou

Ozlem Kalinli

366

26 Jun 2024

Self-Train Before You Transcribe

Robert Flynn

Anton Ragni

327

17 Jun 2024

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Kevin Zhang

Luka Chkhetiani

Francis McCann Ramirez

...

216

10 Apr 2024

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

287

29 Sep 2023

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Pengyuan Zhang

307

12 Aug 2023

A Novel Self-training Approach for Low-resource Speech RecognitionInterspeech (Interspeech), 2023

Satwinder Singh

Feng Hou

Ruili Wang

253

10 Aug 2023

Unsupervised ASR via Cross-Lingual Pseudo-Labeling

Tatiana Likhomanenko

Loren Lugosch

R. Collobert

358

19 May 2023

Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition

Xiaoyu Yang

Qiujia Li

Chuxu Zhang

P. Woodland

237

20 Mar 2023

Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern GreekIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Georgios Paraskevopoulos

Theodoros Kouzelis

Georgios Rouvalis

Athanasios Katsamanis

Vassilis Katsouros

Alexandros Potamianos

VLM

357

31 Dec 2022

Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-DistillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Jianqing Gao

296

06 Dec 2022

Continuous Soft Pseudo-Labeling in ASR

365

11 Nov 2022

More Speaking or More Speakers?IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

286

02 Nov 2022

InterMPL: Momentum Pseudo-Labeling with Intermediate CTC LossIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

310

02 Nov 2022

Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition

179

28 Oct 2022

Continuous Pseudo-Labeling from the StartInternational Conference on Learning Representations (ICLR), 2022

296

17 Oct 2022

Semi-supervised Vision Transformers at ScaleNeural Information Processing Systems (NeurIPS), 2022

321

11 Aug 2022

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational EnvironmentsInterspeech (Interspeech), 2022

150

15 Jul 2022

Boosting Cross-Domain Speech Recognition with Self-SupervisionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Pengyuan Zhang

408

20 Jun 2022

Decoupled Federated Learning for ASR with Non-IID DataInterspeech (Interspeech), 2022

Pengyuan Zhang

267

18 Jun 2022

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-trainingInterspeech (Interspeech), 2022

272

16 Jun 2022

Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-TranscribingInterspeech (Interspeech), 2022

362

14 May 2022

Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentInterspeech (Interspeech), 2022

369

29 Mar 2022

RemixIT: Continual self-training of speech enhancement models via bootstrapped remixingIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022

Yossi Adi

322

17 Feb 2022

Pseudo-Labeling for Massively Multilingual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

360

30 Oct 2021

Continual self-training with bootstrapped remixing for speech enhancement

Yossi Adi

353

19 Oct 2021

Word Order Does Not Matter For Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

291

12 Oct 2021

Advancing Momentum Pseudo-Labeling with Conformer and Initialization StrategyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

213

11 Oct 2021

Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASRInterspeech (Interspeech), 2021

Pengyuan Zhang

295

09 Oct 2021

Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognitionAutomatic Speech Recognition & Understanding (ASRU), 2021

304

14 Jun 2021