Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR

9 November 2020

Jun Liu

Papers citing "Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR"

14 / 14 papers shown

Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model EstimationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

187

05 May 2023

Neural Transducer Training: Reduced Memory Consumption with Sample-wise ComputationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Stefan Braun

Erik McDermott

Roger Hsiao

173

29 Nov 2022

Anchored Speech Recognition with Neural TransducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Ozlem Kalinli

294

20 Oct 2022

Learning a Dual-Mode Speech Recognition Model via Self-PruningSpoken Language Technology Workshop (SLT), 2022

Raghuraman Krishnamoorthi

Ozlem Kalinli

SSL

351

25 Jul 2022

Pruned RNN-T for fast, memory-efficient ASR trainingInterspeech (Interspeech), 2022

Fangjun Kuang

Liyong Guo

Wei Kang

Long Lin

Mingshuang Luo

Zengwei Yao

Daniel Povey

273

23 Jun 2022

Squeezeformer: An Efficient Transformer for Automatic Speech RecognitionNeural Information Processing Systems (NeurIPS), 2022

Sehoon Kim

Nicholas Lee

397

134

02 Jun 2022

Memory-Efficient Training of RNN-Transducer with Sampled SoftmaxInterspeech (Interspeech), 2022

Jaesong Lee

Lukas Lee

Shinji Watanabe

357

31 Mar 2022

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

498

440

02 Nov 2021

CTC Variations Through New WFST Topologies

A. Laptev

Somshubra Majumdar

Boris Ginsburg

353

06 Oct 2021

A Configurable Multilingual Model is All You Need to Recognize All Languages

282

13 Jul 2021

On lattice-free boosted MMI training of HMM and CTC-based full-context ASR modelsAutomatic Speech Recognition & Understanding (ASRU), 2021

341

09 Jul 2021

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow FusionInterspeech (Interspeech), 2021

...

Ozlem Kalinli

284

127

05 Apr 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks

361

05 Nov 2020

Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applicationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

340

27 Oct 2020