Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.04785
Cited By
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR
9 November 2020
Xiaohui Zhang
Frank Zhang
Chunxi Liu
Kjell Schubert
Julian Chan
Pradyot Prakash
Jun Liu
Ching-Feng Yeh
Fuchun Peng
Yatharth Saraf
Geoffrey Zweig
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR"
14 / 14 papers shown
Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Nilaksh Das
Monica Sunkara
S. Bodapati
Jason (Jinglun) Cai
Devang Kulshreshtha
Jeffrey J. Farris
Katrin Kirchhoff
187
4
0
05 May 2023
Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Stefan Braun
Erik McDermott
Roger Hsiao
173
1
0
29 Nov 2022
Anchored Speech Recognition with Neural Transducers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Desh Raj
Junteng Jia
Jay Mahadeokar
Chunyang Wu
Niko Moritz
Xiaohui Zhang
Ozlem Kalinli
294
2
0
20 Oct 2022
Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Spoken Language Technology Workshop (SLT), 2022
Chunxi Liu
Yuan Shangguan
Haichuan Yang
Yangyang Shi
Raghuraman Krishnamoorthi
Ozlem Kalinli
SSL
351
7
0
25 Jul 2022
Pruned RNN-T for fast, memory-efficient ASR training
Interspeech (Interspeech), 2022
Fangjun Kuang
Liyong Guo
Wei Kang
Long Lin
Mingshuang Luo
Zengwei Yao
Daniel Povey
273
90
0
23 Jun 2022
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Neural Information Processing Systems (NeurIPS), 2022
Sehoon Kim
A. Gholami
Albert Eaton Shaw
Nicholas Lee
K. Mangalam
Jitendra Malik
Michael W. Mahoney
Kurt Keutzer
397
134
0
02 Jun 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Interspeech (Interspeech), 2022
Jaesong Lee
Lukas Lee
Shinji Watanabe
357
8
0
31 Mar 2022
Recent Advances in End-to-End Automatic Speech Recognition
APSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
498
440
0
02 Nov 2021
CTC Variations Through New WFST Topologies
A. Laptev
Somshubra Majumdar
Boris Ginsburg
353
24
0
06 Oct 2021
A Configurable Multilingual Model is All You Need to Recognize All Languages
Long Zhou
Jinyu Li
Eric Sun
Shujie Liu
282
47
0
13 Jul 2021
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models
Automatic Speech Recognition & Understanding (ASRU), 2021
Xiaohui Zhang
Vimal Manohar
David C. Zhang
Frank Zhang
Yangyang Shi
Nayan Singhal
Julian Chan
Fuchun Peng
Yatharth Saraf
M. Seltzer
341
14
0
09 Jul 2021
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Interspeech (Interspeech), 2021
Duc Le
Mahaveer Jain
Gil Keren
Suyoun Kim
Yangyang Shi
...
Yuan Shangguan
Christian Fuegen
Ozlem Kalinli
Yatharth Saraf
M. Seltzer
284
127
0
05 Apr 2021
Improving RNN Transducer Based ASR with Auxiliary Tasks
Chunxi Liu
Frank Zhang
Duc Le
Suyoun Kim
Yatharth Saraf
Geoffrey Zweig
361
49
0
05 Nov 2020
Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yongqiang Wang
Yangyang Shi
Frank Zhang
Chunyang Wu
Julian Chan
Ching-Feng Yeh
Alex Xiao
340
28
0
27 Oct 2020
1
Page 1 of 1