v1v2v3 (latest)

CTC Variations Through New WFST Topologies

6 October 2021

A. Laptev

Somshubra Majumdar

Boris Ginsburg

ArXiv (abs)PDF HTML Github (1332★)

Papers citing "CTC Variations Through New WFST Topologies"

15 / 15 papers shown

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

199

06 Jun 2025

Enhancing GOP in CTC-Based Mispronunciation Detection with Phonological Knowledge

Aditya Kamlesh Parikh

Cristian Tejedor-García

C. Cucchiarini

H. Strik

324

02 Jun 2025

RNN-Transducer-based Losses for Speech Recognition on Noisy Targets

Vladimir Bataev

449

09 Apr 2025

GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition

Daniel Galvez

Tim Kaldewey

276

08 Nov 2023

Learning from Flawed Data: Weakly Supervised Automatic Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

Dongji Gao

Hainan Xu

Desh Raj

Leibny Paola García Perera

Daniel Povey

Sanjeev Khudanpur

246

26 Sep 2023

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Pengyuan Zhang

314

12 Aug 2023

Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect TranscriptsInterspeech (Interspeech), 2023

Hainan Xu

Sanjeev Khudanpur

330

01 Jun 2023

Weakly-supervised forced alignment of disfluent speech using phoneme-level modelingInterspeech (Interspeech), 2023

Theodoros Kouzelis

Georgios Paraskevopoulos

Athanasios Katsamanis

Vassilis Katsouros

347

30 May 2023

Blank-regularized CTC for Frame Skipping in Neural TransducerInterspeech (Interspeech), 2023

Yifan Yang

Xiaoyu Yang

Liyong Guo

Zengwei Yao

Wei Kang

Fangjun Kuang

Long Lin

Xie Chen

Daniel Povey

263

19 May 2023

DiffVoice: Text-to-Speech with Latent DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhijun Liu

Yiwei Guo

K. Yu

DiffM

229

23 Apr 2023

Powerful and Extensible WFST Framework for RNN-Transducer LossesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

A. Laptev

Vladimir Bataev

Igor Gitman

Boris Ginsburg

348

18 Mar 2023

End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

362

276

03 Mar 2023

Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generatorInterspeech (Interspeech), 2023

Boris Ginsburg

308

27 Feb 2023

Blank Collapse: Compressing CTC emission for the faster decodingInterspeech (Interspeech), 2022

339

31 Oct 2022

Star Temporal Classification: Sequence Classification with Partially Labeled Data

224

28 Jan 2022