An analysis of incorporating an external language model into a sequence-to-sequence model

6 December 2017

Papers citing "An analysis of incorporating an external language model into a sequence-to-sequence model"

50 / 175 papers shown

ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features

130

26 Nov 2025

Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios

133

19 Oct 2025

Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation

125

11 Oct 2025

Denoising GER: A Noise-Robust Generative Error Correction with LLM for Speech Recognition

136

04 Sep 2025

Supporting SENCOTEN Language Documentation Efforts with Automatic Speech Recognition

142

14 Jul 2025

Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech RecognitionIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025

297

12 Jul 2025

Audio-3DVG: Unified Audio -- Point Cloud Fusion for 3D Visual Grounding

226

01 Jul 2025

Context Biasing for Pronunciations-Orthography Mismatch in Automatic Speech Recognition

Christian Huber

Alexander Waibel

143

23 Jun 2025

Improving Named Entity Transcription with Contextual LLM-based Revision

325

12 Jun 2025

FlanEC: Exploring Flan-T5 for Post-ASR Error CorrectionSpoken Language Technology Workshop (SLT), 2024

Moreno La Quatra

Valerio Mario Salerno

Yu Tsao

Sabato Marco Siniscalchi

411

22 Jan 2025

Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

124

08 Jan 2025

LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific TransliterationAAAI Conference on Artificial Intelligence (AAAI), 2024

Sangmin Lee

Woo-Jin Chung Hong-Goo Kang

Hong-Goo Kang

471

19 Dec 2024

Alignment-Free Training for Transducer-based Multi-Talker ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Marc Delcroix

217

30 Sep 2024

Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with WhisperConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Iuliia Thorbecke

Juan Zuluaga-Gomez

328

20 Sep 2024

Unifying Global and Near-Context Biasing in a Single Trie PassInternational Conference on Text, Speech and Dialogue (TSD), 2024

...

334

20 Sep 2024

SALSA: Speedy ASR-LLM Synchronous AggregationInterspeech (Interspeech), 2024

335

29 Aug 2024

XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition

168

20 Aug 2024

An efficient text augmentation approach for contextualized Mandarin speech recognitionInterspeech (Interspeech), 2024

Naijun Zheng

Xucheng Wan

Kai Liu

Ziqing Du

Zhou Huan

187

14 Jun 2024

Enhancing CTC-based speech recognition with diverse modeling units

Zhen Huang

339

05 Jun 2024

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

245

24 May 2024

Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Robust and Instruction-Aware ASR and OCR

486

23 May 2024

Contextualized Automatic Speech Recognition with Dynamic Vocabulary

Shinji Watanabe

289

22 May 2024

MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech RecognitionIEEE Signal Processing Letters (SPL), 2024

Qijie Shao

Lei Xie

353

06 May 2024

Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview

Heyang Liu

Yu Wang

Yanfeng Wang

278

01 Mar 2024

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition

Chen Chen

Ruizhe Li

Yuchen Hu

Sabato Marco Siniscalchi

Pin-Yu Chen

Ensiong Chng

Chao-Han Huck Yang

227

08 Feb 2024

Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive StudyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

251

23 Jan 2024

Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech RecognisersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

277

22 Jan 2024

Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search

Shinji Watanabe

221

19 Jan 2024

Retrieve and Copy: Scaling ASR Personalization to Large CatalogsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Sai Muralidhar Jayanthi

208

14 Nov 2023

Improving Seq2Seq Grammatical Error Correction via Decoding InterventionsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Min Zhang

Ji Zhang

Fei Huang

220

23 Oct 2023

Multi-stage Large Language Model Correction for Speech Recognition

287

17 Oct 2023

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

152

17 Oct 2023

Correction Focused Language Model Training for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Yingyi Ma

Zhe Liu

Ozlem Kalinli

KELM

280

17 Oct 2023

Acoustic Model Fusion for End-to-end Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

...

203

10 Oct 2023

Forgetting Private Textual Sequences in Language Models via Leave-One-Out EnsembleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhe Liu

Ozlem Kalinli

MU KELM

231

28 Sep 2023

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023

Cheng Chen

Yuchen Hu

Chao-Han Huck Yang

Sabato Marco Siniscalchi

Pin-Yu Chen

Eng Siong Chng

212

27 Sep 2023

Improved Factorized Neural Transducer Model For text-only Domain AdaptationInterspeech (Interspeech), 2023

Jing Liu

Jianwei Yu

Xie Chen

326

18 Sep 2023

t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation CapabilityIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhuo Chen

163

15 Sep 2023

PromptASR for contextualized ASR with controllable styleIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Xiaoyu Yang

Wei Kang

Zengwei Yao

Yifan Yang

Liyong Guo

Fangjun Kuang

Long Lin

Daniel Povey

341

14 Sep 2023

Recovering from Privacy-Preserving Masking with Large Language ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Ozlem Kalinli

230

12 Sep 2023

Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic RepresentationInterspeech (Interspeech), 2023

Zhiyong Wu

165

04 Sep 2023

Decoupled Structure for Improved Adaptability of End-to-End ModelsSpeech Communication (Speech Commun.), 2023

Keqi Deng

P. Woodland

AuLLM

204

25 Aug 2023

Text Injection for Capitalization and Turn-Taking Prediction in Speech ModelsInterspeech (Interspeech), 2023

157

14 Aug 2023

A Novel Self-training Approach for Low-resource Speech RecognitionInterspeech (Interspeech), 2023

Satwinder Singh

Feng Hou

Ruili Wang

206

10 Aug 2023

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech RecognitionInterspeech (Interspeech), 2023

Shinji Watanabe

185

24 Jul 2023

Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical StudyInternational Conference on Neural Information Processing (ICONIP), 2023

Zeping Min

Jinbo Wang

AuLLM

197

13 Jul 2023

Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource LanguagesInterspeech (Interspeech), 2023

217

03 Jul 2023

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

253

28 Jun 2023

Large-scale Language Model Rescoring on Long-form DataIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

...

Kartik Audhkhasi

Bhuvana Ramabhadran

Pedro J. Moreno

Michael Riley

184

13 Jun 2023

VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Minglun Han

Bo Xu

185

31 May 2023