ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.03098
  4. Cited By
CTC Variations Through New WFST Topologies
v1v2v3 (latest)

CTC Variations Through New WFST Topologies

6 October 2021
A. Laptev
Somshubra Majumdar
Boris Ginsburg
ArXiv (abs)PDFHTMLGithub (1332★)

Papers citing "CTC Variations Through New WFST Topologies"

15 / 15 papers shown
Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction
Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction
Christophe Van Gysel
Maggie Wu
Lyan Verwimp
Caglar Tirkaz
Marco Bertola
Zhihong Lei
Youssef Oualil
199
0
0
06 Jun 2025
Enhancing GOP in CTC-Based Mispronunciation Detection with Phonological Knowledge
Enhancing GOP in CTC-Based Mispronunciation Detection with Phonological Knowledge
Aditya Kamlesh Parikh
Cristian Tejedor-García
C. Cucchiarini
H. Strik
324
1
0
02 Jun 2025
RNN-Transducer-based Losses for Speech Recognition on Noisy Targets
RNN-Transducer-based Losses for Speech Recognition on Noisy Targets
Vladimir Bataev
449
1
0
09 Apr 2025
GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech
  Recognition
GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Daniel Galvez
Tim Kaldewey
276
5
0
08 Nov 2023
Learning from Flawed Data: Weakly Supervised Automatic Speech
  Recognition
Learning from Flawed Data: Weakly Supervised Automatic Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023
Dongji Gao
Hainan Xu
Desh Raj
Leibny Paola García Perera
Daniel Povey
Sanjeev Khudanpur
246
7
0
26 Sep 2023
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech
  Recognition
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hanjing Zhu
Dongji Gao
Gaofeng Cheng
Daniel Povey
Pengyuan Zhang
Yonghong Yan
NoLa
314
12
0
12 Aug 2023
Bypass Temporal Classification: Weakly Supervised Automatic Speech
  Recognition with Imperfect Transcripts
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect TranscriptsInterspeech (Interspeech), 2023
Dongji Gao
Sanjeev Khudanpur
Hainan Xu
Leibny Paola García
Daniel Povey
Sanjeev Khudanpur
330
12
0
01 Jun 2023
Weakly-supervised forced alignment of disfluent speech using
  phoneme-level modeling
Weakly-supervised forced alignment of disfluent speech using phoneme-level modelingInterspeech (Interspeech), 2023
Theodoros Kouzelis
Georgios Paraskevopoulos
Athanasios Katsamanis
Vassilis Katsouros
347
9
0
30 May 2023
Blank-regularized CTC for Frame Skipping in Neural Transducer
Blank-regularized CTC for Frame Skipping in Neural TransducerInterspeech (Interspeech), 2023
Yifan Yang
Xiaoyu Yang
Liyong Guo
Zengwei Yao
Wei Kang
Fangjun Kuang
Long Lin
Xie Chen
Daniel Povey
263
13
0
19 May 2023
DiffVoice: Text-to-Speech with Latent Diffusion
DiffVoice: Text-to-Speech with Latent DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zhijun Liu
Yiwei Guo
K. Yu
DiffM
229
27
0
23 Apr 2023
Powerful and Extensible WFST Framework for RNN-Transducer Losses
Powerful and Extensible WFST Framework for RNN-Transducer LossesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
A. Laptev
Vladimir Bataev
Igor Gitman
Boris Ginsburg
348
6
0
18 Mar 2023
End-to-End Speech Recognition: A Survey
End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
362
276
0
03 Mar 2023
Text-only domain adaptation for end-to-end ASR using integrated
  text-to-mel-spectrogram generator
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generatorInterspeech (Interspeech), 2023
Vladimir Bataev
Roman Korostik
Evgeny Shabalin
Vitaly Lavrukhin
Boris Ginsburg
VLM
308
19
0
27 Feb 2023
Blank Collapse: Compressing CTC emission for the faster decoding
Blank Collapse: Compressing CTC emission for the faster decodingInterspeech (Interspeech), 2022
Minkyu Jung
Ohhyeok Kwon
S. Seo
Soonshin Seo
339
4
0
31 Oct 2022
Star Temporal Classification: Sequence Classification with Partially
  Labeled Data
Star Temporal Classification: Sequence Classification with Partially Labeled Data
Vineel Pratap
Awni Y. Hannun
Gabriel Synnaeve
R. Collobert
224
10
0
28 Jan 2022
1
Page 1 of 1