ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.08922
  4. Cited By
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

16 June 2021
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
    VLM
ArXiv (abs)PDFHTMLGithub

Papers citing "Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition"

36 / 36 papers shown
LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Using in-the-wild Data
LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models Using in-the-wild Data
Wen Ding
Fan Qian
385
1
0
05 Jun 2025
CR-CTC: Consistency regularization on CTC for improved speech recognition
CR-CTC: Consistency regularization on CTC for improved speech recognitionInternational Conference on Learning Representations (ICLR), 2024
Zengwei Yao
Wei Kang
Xiaoyu Yang
Fangjun Kuang
Liyong Guo
Han Zhu
Zengrui Jin
Zhaoqing Li
Long Lin
Daniel Povey
477
19
0
17 Feb 2025
Transliterated Zero-Shot Domain Adaptation for Automatic Speech
  Recognition
Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition
Han Zhu
Gaofeng Cheng
Qingwei Zhao
Pengyuan Zhang
VLM
337
0
0
15 Dec 2024
Unified Speech Recognition: A Single Model for Auditory, Visual, and
  Audiovisual Inputs
Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual InputsNeural Information Processing Systems (NeurIPS), 2024
A. Haliassos
Rodrigo Mira
Honglie Chen
Zoe Landgraf
Stavros Petridis
Maja Pantic
SSL
431
17
0
04 Nov 2024
Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to
  Boost Semi-Supervised Facial Expression Recognition
Bridging the Gaps: Utilizing Unlabeled Face Recognition Datasets to Boost Semi-Supervised Facial Expression Recognition
Jie Song
Mengqiao He
Jinhua Feng
Bo Shen
289
2
0
23 Oct 2024
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation
  with Whisper
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with WhisperConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Iuliia Thorbecke
Juan Zuluaga-Gomez
Esaú Villatoro-Tello
Shashi Kumar
Pradeep Rangappa
Sergio Burdisso
P. Motlícek
Karthik Pandia
A. Ganapathiraju
389
1
0
20 Sep 2024
Is user feedback always informative? Retrieval Latent Defending for
  Semi-Supervised Domain Adaptation without Source Data
Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data
Junha Song
Tae Soo Kim
Junha Kim
Gunhee Nam
Thijs Kooi
Jaegul Choo
395
4
0
22 Jul 2024
Token-Weighted RNN-T for Learning from Flawed Data
Token-Weighted RNN-T for Learning from Flawed Data
Gil Keren
Wei Zhou
Ozlem Kalinli
366
1
0
26 Jun 2024
Self-Train Before You Transcribe
Self-Train Before You Transcribe
Robert Flynn
Anton Ragni
327
0
0
17 Jun 2024
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping
Kevin Zhang
Luka Chkhetiani
Francis McCann Ramirez
Yash Khare
Andrea Vanzo
...
Ruben Bousbib
Taufiquzzaman Peyash
Michael Nguyen
Dillon Pulliam
Domenic Donato
216
5
0
10 Apr 2024
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Andrew Rouditchenko
R. Collobert
Tatiana Likhomanenko
VLM
287
6
0
29 Sep 2023
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech
  Recognition
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hanjing Zhu
Dongji Gao
Gaofeng Cheng
Daniel Povey
Pengyuan Zhang
Yonghong Yan
NoLa
307
12
0
12 Aug 2023
A Novel Self-training Approach for Low-resource Speech Recognition
A Novel Self-training Approach for Low-resource Speech RecognitionInterspeech (Interspeech), 2023
Satwinder Singh
Feng Hou
Ruili Wang
253
13
0
10 Aug 2023
Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Unsupervised ASR via Cross-Lingual Pseudo-Labeling
Tatiana Likhomanenko
Loren Lugosch
R. Collobert
358
1
0
19 May 2023
Knowledge Distillation from Multiple Foundation Models for End-to-End
  Speech Recognition
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition
Xiaoyu Yang
Qiujia Li
Chuxu Zhang
P. Woodland
237
12
0
20 Mar 2023
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition
  Systems A case study for Modern Greek
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern GreekIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Georgios Paraskevopoulos
Theodoros Kouzelis
Georgios Rouvalis
Athanasios Katsamanis
Vassilis Katsouros
Alexandros Potamianos
VLM
357
14
0
31 Dec 2022
Self-Supervised Audio-Visual Speech Representations Learning By
  Multimodal Self-Distillation
Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-DistillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Jing-Xuan Zhang
Genshun Wan
Zhenhua Ling
Jia Pan
Jianqing Gao
Cong Liu
SSL
296
16
0
06 Dec 2022
Continuous Soft Pseudo-Labeling in ASR
Continuous Soft Pseudo-Labeling in ASR
Tatiana Likhomanenko
R. Collobert
Navdeep Jaitly
Samy Bengio
VLM
365
6
0
11 Nov 2022
More Speaking or More Speakers?
More Speaking or More Speakers?IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Dan Berrebbi
R. Collobert
Navdeep Jaitly
Tatiana Likhomanenko
286
7
0
02 Nov 2022
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC LossIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yosuke Higuchi
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
310
1
0
02 Nov 2022
Filter and evolve: progressive pseudo label refining for semi-supervised
  automatic speech recognition
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition
Zezhong Jin
Dading Zhong
Xiao Song
Zhaoyi Liu
Naipeng Ye
Qingcheng Zeng
179
3
0
28 Oct 2022
Continuous Pseudo-Labeling from the Start
Continuous Pseudo-Labeling from the StartInternational Conference on Learning Representations (ICLR), 2022
Dan Berrebbi
R. Collobert
Samy Bengio
Navdeep Jaitly
Tatiana Likhomanenko
296
17
0
17 Oct 2022
Semi-supervised Vision Transformers at Scale
Semi-supervised Vision Transformers at ScaleNeural Information Processing Systems (NeurIPS), 2022
Zhaowei Cai
Avinash Ravichandran
Paolo Favaro
Manchen Wang
Davide Modolo
Rahul Bhotika
Zhuowen Tu
Stefano Soatto
ViT
321
74
0
11 Aug 2022
Direction-Aware Joint Adaptation of Neural Speech Enhancement and
  Recognition in Real Multiparty Conversational Environments
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational EnvironmentsInterspeech (Interspeech), 2022
Yicheng Du
Aditya Arie Nugraha
Kouhei Sekiguchi
Yoshiaki Bando
Mathieu Fontaine
Kazuyoshi Yoshii
150
0
0
15 Jul 2022
Boosting Cross-Domain Speech Recognition with Self-Supervision
Boosting Cross-Domain Speech Recognition with Self-SupervisionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Hanjing Zhu
Gaofeng Cheng
Yongfeng Zhang
Wenxin Hou
Pengyuan Zhang
Yonghong Yan
408
24
0
20 Jun 2022
Decoupled Federated Learning for ASR with Non-IID Data
Decoupled Federated Learning for ASR with Non-IID DataInterspeech (Interspeech), 2022
Hanjing Zhu
Yongfeng Zhang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
267
15
0
18 Jun 2022
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based
  on Self-supervised Pre-training
Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-trainingInterspeech (Interspeech), 2022
Bowen Zhang
Songjun Cao
Xiaoming Zhang
Yike Zhang
Long Ma
T. Shinozaki
SSL
272
6
0
16 Jun 2022
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence
  ASR via Speech Chain Reconstruction and Self-Transcribing
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-TranscribingInterspeech (Interspeech), 2022
Heli Qi
Sashi Novitasari
S. Sakti
Satoshi Nakamura
AI4TS
362
2
0
14 May 2022
Improving Mispronunciation Detection with Wav2vec2-based Momentum
  Pseudo-Labeling for Accentedness and Intelligibility Assessment
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility AssessmentInterspeech (Interspeech), 2022
Mu Yang
K. Hirschi
S. Looney
Okim Kang
John H. L. Hansen
369
23
0
29 Mar 2022
RemixIT: Continual self-training of speech enhancement models via
  bootstrapped remixing
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixingIEEE Journal on Selected Topics in Signal Processing (IEEE JSTSP), 2022
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Paris Smaragdis
Anurag Kumar
CLL
322
72
0
17 Feb 2022
Pseudo-Labeling for Massively Multilingual Speech Recognition
Pseudo-Labeling for Massively Multilingual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Loren Lugosch
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
VLM
360
35
0
30 Oct 2021
Continual self-training with bootstrapped remixing for speech
  enhancement
Continual self-training with bootstrapped remixing for speech enhancement
Efthymios Tzinis
Yossi Adi
V. Ithapu
Buye Xu
Anurag Kumar
353
18
0
19 Oct 2021
Word Order Does Not Matter For Speech Recognition
Word Order Does Not Matter For Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Vineel Pratap
Qiantong Xu
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
291
4
0
12 Oct 2021
Advancing Momentum Pseudo-Labeling with Conformer and Initialization
  Strategy
Advancing Momentum Pseudo-Labeling with Conformer and Initialization StrategyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yosuke Higuchi
Niko Moritz
Jonathan Le Roux
Takaaki Hori
213
13
0
11 Oct 2021
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASRInterspeech (Interspeech), 2021
Hanjing Zhu
Li Wang
Yongfeng Zhang
Gaofeng Cheng
Pengyuan Zhang
Yonghong Yan
SSLVLM
295
11
0
09 Oct 2021
Kaizen: Continuously improving teacher using Exponential Moving Average
  for semi-supervised speech recognition
Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Vimal Manohar
Tatiana Likhomanenko
Qiantong Xu
Wei-Ning Hsu
R. Collobert
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
304
31
0
14 Jun 2021
1
Page 1 of 1