ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.10491
  4. Cited By
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large
  Text Corpus
v1v2 (latest)

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

24 August 2020
Cal Peyser
S. Mavandadi
Tara N. Sainath
J. Apfel
Ruoming Pang
Shankar Kumar
ArXiv (abs)PDFHTML

Papers citing "Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus"

28 / 28 papers shown
Initial Decoding with Minimally Augmented Language Model for Improved
  Lattice Rescoring in Low Resource ASR
Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR
Savitha Murthy
D. Sitaram
173
1
0
16 Mar 2024
Extreme Encoder Output Frame Rate Reduction: Improving Computational
  Latencies of Large End-to-End Models
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models
Rohit Prabhavalkar
Zhong Meng
Weiran Wang
Adam Stooke
Xingyu Cai
Yanzhang He
Arun Narayanan
Dongseong Hwang
Tara N. Sainath
Pedro J. Moreno
250
11
0
27 Feb 2024
Improving Large-scale Deep Biasing with Phoneme Features and Text-only
  Data in Streaming Transducer
Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming TransducerAutomatic Speech Recognition & Understanding (ASRU), 2023
Jin Qiu
Lu Huang
Boyu Li
Jun Zhang
Lu Lu
Zejun Ma
357
8
0
15 Nov 2023
Server-side Rescoring of Spoken Entity-centric Knowledge Queries for
  Virtual Assistants
Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual AssistantsInternational Journal of Speech Technology (IJST), 2023
Youyuan Zhang
Sashank Gondala
Thiago Fraga-Silva
Christophe Van Gysel
318
3
0
02 Nov 2023
Large-scale Language Model Rescoring on Long-form Data
Large-scale Language Model Rescoring on Long-form DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel S. Park
David Rybach
...
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
Michael Riley
333
25
0
13 Jun 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in
  Transducer
Text-only Domain Adaptation using Unified Speech-Text Representation in TransducerInterspeech (Interspeech), 2023
Lu Huang
Yangqiu Song
Jun Zhang
Lu Lu
Zejun Ma
335
4
0
07 Jun 2023
A Deliberation-based Joint Acoustic and Text Decoder
A Deliberation-based Joint Acoustic and Text DecoderInterspeech (Interspeech), 2021
S. Mavandadi
Tara N. Sainath
Ke Hu
Zelin Wu
172
7
0
23 Mar 2023
Improving Contextual Spelling Correction by External Acoustics Attention
  and Semantic Aware Data Augmentation
Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data AugmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoqiang Wang
Yanqing Liu
Jinyu Li
Sheng Zhao
279
10
0
22 Feb 2023
Massively Multilingual Shallow Fusion with Large Language Models
Massively Multilingual Shallow Fusion with Large Language ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ke Hu
Tara N. Sainath
Yue Liu
Nan Du
Yanping Huang
Andrew M. Dai
Yu Zhang
Rodrigo Cabrera
Zhiwen Chen
Trevor Strohman
226
17
0
17 Feb 2023
Dual Learning for Large Vocabulary On-Device ASR
Dual Learning for Large Vocabulary On-Device ASRSpoken Language Technology Workshop (SLT), 2023
Cal Peyser
Ronny Huang
Tara N. Sainath
Rohit Prabhavalkar
M. Picheny
K. Cho
SSL
211
1
0
11 Jan 2023
Memory Augmented Lookup Dictionary based Language Modeling for Automatic
  Speech Recognition
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech RecognitionInterspeech (Interspeech), 2022
Yukun Feng
Ming Tu
Rui Xia
Chuanzeng Huang
Yuxuan Wang
RALM
265
0
0
30 Dec 2022
Random Utterance Concatenation Based Data Augmentation for Improving
  Short-video Speech Recognition
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech RecognitionInterspeech (Interspeech), 2022
Yist Y. Lin
Tao Han
Haihua Xu
Van Tung Pham
Yerbolat Khassanov
Tze Yuang Chong
Yi He
Lu Lu
Zejun Ma
217
4
0
28 Oct 2022
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition ChallengeInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Yan Jia
Mihee Hong
Jingyu Hou
Kailong Ren
Sifan Ma
Jin Wang
Fangzhen Peng
Yinglin Ji
Lin Yang
Junjie Wang
291
1
0
14 Oct 2022
JOIST: A Joint Speech and Text Streaming Model For ASR
JOIST: A Joint Speech and Text Streaming Model For ASRSpoken Language Technology Workshop (SLT), 2022
Tara N. Sainath
Rohit Prabhavalkar
Ankur Bapna
Yu Zhang
Zhouyuan Huo
Zhehuai Chen
Yue Liu
Weiran Wang
Trevor Strohman
RALMAuLLM
218
37
0
13 Oct 2022
Improving Deliberation by Text-Only and Semi-Supervised Training
Improving Deliberation by Text-Only and Semi-Supervised TrainingInterspeech (Interspeech), 2022
Ke Hu
Tara N. Sainath
Yanzhang He
Rohit Prabhavalkar
Trevor Strohman
S. Mavandadi
Weiran Wang
287
12
0
29 Jun 2022
Improving Rare Word Recognition with LM-aware MWER Training
Improving Rare Word Recognition with LM-aware MWER TrainingInterspeech (Interspeech), 2022
Weiran Wang
Tongzhou Chen
Tara N. Sainath
Ehsan Variani
Rohit Prabhavalkar
...
S. Mavandadi
Cal Peyser
Trevor Strohman
Yanzhang He
David Rybach
KELM
243
13
0
15 Apr 2022
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word
  Speech Recognition
Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech RecognitionInterspeech (Interspeech), 2022
Wenjie Huang
Cal Peyser
Tara N. Sainath
Ruoming Pang
Trevor Strohman
Shankar Kumar
361
16
0
09 Mar 2022
Adaptive Discounting of Implicit Language Models in RNN-Transducers
Adaptive Discounting of Implicit Language Models in RNN-TransducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Vinit Unni
Shreya Khare
Ashish R. Mittal
Preethi Jyothi
Sunita Sarawagi
Samarth Bharadwaj
228
5
0
21 Feb 2022
A Likelihood Ratio based Domain Adaptation Method for E2E Models
A Likelihood Ratio based Domain Adaptation Method for E2E ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chhavi Choudhury
Ankur Gandhe
Xiaohan Ding
I. Bulyko
263
11
0
10 Jan 2022
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
569
444
0
02 Nov 2021
Internal Language Model Adaptation with Text-Only Data for End-to-End
  Speech Recognition
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
Zhong Meng
Yashesh Gaur
Naoyuki Kanda
Jinyu Li
Xie Chen
Yu Wu
Yifan Gong
AuLLM
317
35
0
06 Oct 2021
Injecting Text in Self-Supervised Speech Pretraining
Injecting Text in Self-Supervised Speech PretrainingAutomatic Speech Recognition & Understanding (ASRU), 2021
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Gary Wang
Pedro J. Moreno
SSL
256
38
0
27 Aug 2021
Minimum Word Error Rate Training with Language Model Fusion for
  End-to-End Speech Recognition
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech RecognitionInterspeech (Interspeech), 2021
Zhong Meng
Yu-Huan Wu
Naoyuki Kanda
Liang Lu
Xie Chen
Guoli Ye
Eric Sun
Jinyu Li
Jiawei Liu
MoMe
224
22
0
04 Jun 2021
Lookup-Table Recurrent Language Models for Long Tail Speech Recognition
Lookup-Table Recurrent Language Models for Long Tail Speech RecognitionInterspeech (Interspeech), 2021
Wenjie Huang
Tara N. Sainath
Cal Peyser
Shankar Kumar
David Rybach
Trevor Strohman
RALMLMTD
283
8
0
09 Apr 2021
Improving accuracy of rare words for RNN-Transducer through unigram
  shallow fusion
Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion
Vijay Ravi
Yile Gu
Ankur Gandhe
Ariya Rastrow
Linda Liu
Denis Filimonov
Scott Novotney
I. Bulyko
286
11
0
30 Nov 2020
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary
  Words in End-To-End ASR Systems
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Xianrui Zheng
Yulan Liu
Deniz Gunceler
D. Willett
349
89
0
23 Nov 2020
On Minimum Word Error Rate Training of the Hybrid Autoregressive
  Transducer
On Minimum Word Error Rate Training of the Hybrid Autoregressive TransducerInterspeech (Interspeech), 2020
Liang Lu
Zhong Meng
Naoyuki Kanda
Jinyu Li
Jiawei Liu
267
12
0
23 Oct 2020
Enriching Under-Represented Named-Entities To Improve Speech Recognition
  Performance
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance
Tingzhi Mao
Yerbolat Khassanov
Van Tung Pham
Haihua Xu
Hao-Ming Huang
Aishan Wumaier
Chng Eng Siong
126
0
0
23 Oct 2020
1
Page 1 of 1