Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

3 November 2020

Xie Chen

Papers citing "Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition"

50 / 89 papers shown

Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation

171

26 Nov 2025

AsyncSwitch: Asynchronous Text-Speech Adaptation for Code-Switched ASR

Tuan Nguyen

Huy-Dat Tran

175

17 Jun 2025

Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction

196

06 Jun 2025

NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding

157

28 May 2025

SegAug: CTC-Aligned Segmented Augmentation For Robust RNN-Transducer Based Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

275

20 Feb 2025

Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

220

17 Jan 2025

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient RetrievalIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024

563

01 Nov 2024

Alignment-Free Training for Transducer-based Multi-Talker ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Marc Delcroix

269

30 Sep 2024

Unifying Global and Near-Context Biasing in a Single Trie PassInternational Conference on Text, Speech and Dialogue (TSD), 2024

...

415

20 Sep 2024

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion RecognitionSpoken Language Technology Workshop (SLT), 2024

Chao-Han Huck Yang

Taejin Park

Yuan Gong

Yuanchao Li

Zhehuai Chen

...

Peter Bell

Shinji Watanabe

359

15 Sep 2024

ASR Error Correction using Large Language ModelsIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024

354

14 Sep 2024

SALSA: Speedy ASR-LLM Synchronous AggregationInterspeech (Interspeech), 2024

398

29 Aug 2024

LLM Internal States Reveal Hallucination Risk Faced With a Query

Delong Chen

391

03 Jul 2024

Text Injection for Neural Contextual Biasing

Bhuvana Ramabhadran

359

05 Jun 2024

"Pass the butter": A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT

209

27 May 2024

Revisiting ASR Error Correction with Specialized Models

319

24 May 2024

Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function ApproximatorAsia and South Pacific Design Automation Conference (ASP-DAC), 2024

196

03 May 2024

Effective internal language model training and fusion for factorized transducer modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Ozlem Kalinli

274

02 Apr 2024

Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition

248

28 Mar 2024

Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech RecognisersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

351

22 Jan 2024

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

175

17 Oct 2023

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

...

258

16 Oct 2023

Acoustic Model Fusion for End-to-end Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

...

262

10 Oct 2023

Neural Language Model Pruning for Automatic Speech Recognition

263

05 Oct 2023

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation

312

16 Sep 2023

t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation CapabilityIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhuo Chen

212

15 Sep 2023

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model AdaptationSpoken Language Technology Workshop (SLT), 2023

273

14 Sep 2023

Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic RepresentationInterspeech (Interspeech), 2023

Zhiyong Wu

201

04 Sep 2023

Decoupled Structure for Improved Adaptability of End-to-End ModelsSpeech Communication (Speech Commun.), 2023

Keqi Deng

P. Woodland

AuLLM

302

25 Aug 2023

Text Injection for Capitalization and Turn-Taking Prediction in Speech ModelsInterspeech (Interspeech), 2023

211

14 Aug 2023

Improving RNN-Transducers with Acoustic LookAheadInterspeech (Interspeech), 2023

314

11 Jul 2023

Can Generative Large Language Models Perform ASR Error Correction?

Rao Ma

Potsawee Manakul

376

09 Jul 2023

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

278

28 Jun 2023

Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You ThinkInterspeech (Interspeech), 2023

189

15 Jun 2023

Improving Language Model Integration for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

251

08 Jun 2023

Text-only Domain Adaptation using Unified Speech-Text Representation in TransducerInterspeech (Interspeech), 2023

335

07 Jun 2023

Can Contextual Biasing Remain Effective with Whisper and GPT-2?Interspeech (Interspeech), 2023

Guangzhi Sun

Xianrui Zheng

Chuxu Zhang

P. Woodland

228

02 Jun 2023

Adapting an Unadaptable ASR SystemInterspeech (Interspeech), 2023

Rao Ma

Mengjie Qian

Mark Gales

Kate Knill

378

01 Jun 2023

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer GeneratorIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Guangzhi Sun

Chuxu Zhang

P. Woodland

273

30 May 2023

External Language Model Integration for Factorized Neural Transducers

Michael Levit

S. Parthasarathy

Cem Aksoylar

Mohammad Sadegh Rasooli

Shuangyu Chang

289

26 May 2023

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative DecodingInterspeech (Interspeech), 2023

Xinyuan Qian

171

23 May 2023

Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model EstimationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

218

05 May 2023

CB-Conformer: Contextual biasing Conformer for biased word recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhiyong Wu

Shiyin Kang

Helen Meng

318

19 Apr 2023

Approximate Nearest Neighbour Phrase Mining for Contextual Speech RecognitionInterspeech (Interspeech), 2023

Maurits J. R. Bleeker

P. Swietojanski

Stefan Braun

Xiaodan Zhuang

285

18 Apr 2023

End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

361

276

03 Mar 2023

Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech RecognitionNeural Networks (Neural Netw.), 2023

Leyuan Qu

C. Weber

S. Wermter

197

20 Feb 2023

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Andrew Rosenberg

Bhuvana Ramabhadran

AuLLM VLM

260

16 Feb 2023

Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual SoftmaxIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Keqi Deng

P. Woodland

AuLLM KELM

220

16 Feb 2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical DistillationInterspeech (Interspeech), 2023

Minglun Han

Bo Xu

261

30 Jan 2023

Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

326

05 Dec 2022