v1v2 (latest)

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

24 August 2020

Papers citing "Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus"

28 / 28 papers shown

Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR

Savitha Murthy

D. Sitaram

173

16 Mar 2024

Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models

250

27 Feb 2024

Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming TransducerAutomatic Speech Recognition & Understanding (ASRU), 2023

357

15 Nov 2023

Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual AssistantsInternational Journal of Speech Technology (IJST), 2023

318

02 Nov 2023

Large-scale Language Model Rescoring on Long-form DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

...

Kartik Audhkhasi

Bhuvana Ramabhadran

Pedro J. Moreno

Michael Riley

333

13 Jun 2023

Text-only Domain Adaptation using Unified Speech-Text Representation in TransducerInterspeech (Interspeech), 2023

335

07 Jun 2023

A Deliberation-based Joint Acoustic and Text DecoderInterspeech (Interspeech), 2021

172

23 Mar 2023

Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data AugmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

279

22 Feb 2023

Massively Multilingual Shallow Fusion with Large Language ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

226

17 Feb 2023

Dual Learning for Large Vocabulary On-Device ASRSpoken Language Technology Workshop (SLT), 2023

211

11 Jan 2023

Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech RecognitionInterspeech (Interspeech), 2022

Yuxuan Wang

265

30 Dec 2022

Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech RecognitionInterspeech (Interspeech), 2022

Yerbolat Khassanov

217

28 Oct 2022

LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition ChallengeInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

291

14 Oct 2022

JOIST: A Joint Speech and Text Streaming Model For ASRSpoken Language Technology Workshop (SLT), 2022

Zhehuai Chen

218

13 Oct 2022

Improving Deliberation by Text-Only and Semi-Supervised TrainingInterspeech (Interspeech), 2022

287

29 Jun 2022

Improving Rare Word Recognition with LM-aware MWER TrainingInterspeech (Interspeech), 2022

...

243

15 Apr 2022

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech RecognitionInterspeech (Interspeech), 2022

361

09 Mar 2022

Adaptive Discounting of Implicit Language Models in RNN-TransducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

228

21 Feb 2022

A Likelihood Ratio based Domain Adaptation Method for E2E ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

263

10 Jan 2022

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

569

444

02 Nov 2021

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Xie Chen

317

06 Oct 2021

Injecting Text in Self-Supervised Speech PretrainingAutomatic Speech Recognition & Understanding (ASRU), 2021

Zhehuai Chen

Yu Zhang

Andrew Rosenberg

Bhuvana Ramabhadran

Gary Wang

Pedro J. Moreno

SSL

256

27 Aug 2021

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech RecognitionInterspeech (Interspeech), 2021

Xie Chen

224

04 Jun 2021

Lookup-Table Recurrent Language Models for Long Tail Speech RecognitionInterspeech (Interspeech), 2021

283

09 Apr 2021

Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion

286

30 Nov 2020

Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR SystemsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

349

23 Nov 2020

On Minimum Word Error Rate Training of the Hybrid Autoregressive TransducerInterspeech (Interspeech), 2020

267

23 Oct 2020

Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance

Yerbolat Khassanov

126

23 Oct 2020