Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2110.03098
Cited By
v1
v2
v3 (latest)
CTC Variations Through New WFST Topologies
6 October 2021
A. Laptev
Somshubra Majumdar
Boris Ginsburg
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1332★)
Papers citing
"CTC Variations Through New WFST Topologies"
15 / 15 papers shown
Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction
Christophe Van Gysel
Maggie Wu
Lyan Verwimp
Caglar Tirkaz
Marco Bertola
Zhihong Lei
Youssef Oualil
199
0
0
06 Jun 2025
Enhancing GOP in CTC-Based Mispronunciation Detection with Phonological Knowledge
Aditya Kamlesh Parikh
Cristian Tejedor-García
C. Cucchiarini
H. Strik
324
1
0
02 Jun 2025
RNN-Transducer-based Losses for Speech Recognition on Noisy Targets
Vladimir Bataev
449
1
0
09 Apr 2025
GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition
Daniel Galvez
Tim Kaldewey
276
5
0
08 Nov 2023
Learning from Flawed Data: Weakly Supervised Automatic Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Dongji Gao
Hainan Xu
Desh Raj
Leibny Paola García Perera
Daniel Povey
Sanjeev Khudanpur
246
7
0
26 Sep 2023
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Hanjing Zhu
Dongji Gao
Gaofeng Cheng
Daniel Povey
Pengyuan Zhang
Yonghong Yan
NoLa
314
12
0
12 Aug 2023
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Interspeech (Interspeech), 2023
Dongji Gao
Sanjeev Khudanpur
Hainan Xu
Leibny Paola García
Daniel Povey
Sanjeev Khudanpur
330
12
0
01 Jun 2023
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling
Interspeech (Interspeech), 2023
Theodoros Kouzelis
Georgios Paraskevopoulos
Athanasios Katsamanis
Vassilis Katsouros
347
9
0
30 May 2023
Blank-regularized CTC for Frame Skipping in Neural Transducer
Interspeech (Interspeech), 2023
Yifan Yang
Xiaoyu Yang
Liyong Guo
Zengwei Yao
Wei Kang
Fangjun Kuang
Long Lin
Xie Chen
Daniel Povey
263
13
0
19 May 2023
DiffVoice: Text-to-Speech with Latent Diffusion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zhijun Liu
Yiwei Guo
K. Yu
DiffM
229
27
0
23 Apr 2023
Powerful and Extensible WFST Framework for RNN-Transducer Losses
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
A. Laptev
Vladimir Bataev
Igor Gitman
Boris Ginsburg
348
6
0
18 Mar 2023
End-to-End Speech Recognition: A Survey
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
362
276
0
03 Mar 2023
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator
Interspeech (Interspeech), 2023
Vladimir Bataev
Roman Korostik
Evgeny Shabalin
Vitaly Lavrukhin
Boris Ginsburg
VLM
308
19
0
27 Feb 2023
Blank Collapse: Compressing CTC emission for the faster decoding
Interspeech (Interspeech), 2022
Minkyu Jung
Ohhyeok Kwon
S. Seo
Soonshin Seo
339
4
0
31 Oct 2022
Star Temporal Classification: Sequence Classification with Partially Labeled Data
Vineel Pratap
Awni Y. Hannun
Gabriel Synnaeve
R. Collobert
224
10
0
28 Jan 2022
1
Page 1 of 1