Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1712.01996
Cited By
An analysis of incorporating an external language model into a sequence-to-sequence model
6 December 2017
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
Zhiwen Chen
Rohit Prabhavalkar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An analysis of incorporating an external language model into a sequence-to-sequence model"
50 / 175 papers shown
ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features
Ye Bhone Lin
Thura Aung
Ye Kyaw Thu
Thazin Myint Oo
130
0
0
26 Nov 2025
Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios
Shiyao Wang
Shiwan Zhao
Jiaming Zhou
Yong Qin
133
0
0
19 Oct 2025
Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation
Md. Nayeem
Md Shamse Tabrej
Kabbojit Jit Deb
Shaonti Goswami
Md. Azizul Hakim
AI4TS
VLM
125
3
0
11 Oct 2025
Denoising GER: A Noise-Robust Generative Error Correction with LLM for Speech Recognition
Yanyan Liu
Minqiang Xu
Yihao Chen
Liang He
Lei Fang
Sian Fang
Lin Liu
VLM
136
2
0
04 Sep 2025
Supporting SENCOTEN Language Documentation Efforts with Automatic Speech Recognition
Mengzhe Geng
Patrick Littell
Aidan Pine
PENÁĆ
Marc Tessier
Roland Kuhn
142
0
0
14 Jul 2025
Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Bingshen Mu
Kun Wei
Pengcheng Guo
Lei Xie
297
7
0
12 Jul 2025
Audio-3DVG: Unified Audio -- Point Cloud Fusion for 3D Visual Grounding
Duc Cao-Dinh
Khai Le-Duc
Anh Dao
Bach Phan Tat
Chris Ngo
Duy M. H. Nguyen
Nguyen X. Khanh
Thanh Nguyen-Tang
226
0
0
01 Jul 2025
Context Biasing for Pronunciations-Orthography Mismatch in Automatic Speech Recognition
Christian Huber
Alexander Waibel
143
0
0
23 Jun 2025
Improving Named Entity Transcription with Contextual LLM-based Revision
V. Trinh
Xinlu He
Jacob Whitehill
KELM
325
0
0
12 Jun 2025
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction
Spoken Language Technology Workshop (SLT), 2024
Moreno La Quatra
Valerio Mario Salerno
Yu Tsao
Sabato Marco Siniscalchi
411
5
0
22 Jan 2025
Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Wei Zhang
Tian-Hao Zhang
Chao Luo
Hui Zhou
Chao Yang
Xinyuan Qian
Xu-cheng Yin
124
0
0
08 Jan 2025
LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration
AAAI Conference on Artificial Intelligence (AAAI), 2024
Sangmin Lee
Woo-Jin Chung Hong-Goo Kang
Hong-Goo Kang
471
1
0
19 Dec 2024
Alignment-Free Training for Transducer-based Multi-Talker ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Takafumi Moriya
Shota Horiguchi
Marc Delcroix
Ryo Masumura
Takanori Ashihara
Hiroshi Sato
Kohei Matsuura
Masato Mimura
217
6
0
30 Sep 2024
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Iuliia Thorbecke
Juan Zuluaga-Gomez
Esaú Villatoro-Tello
Shashi Kumar
Pradeep Rangappa
Sergio Burdisso
P. Motlícek
Karthik Pandia
A. Ganapathiraju
328
0
0
20 Sep 2024
Unifying Global and Near-Context Biasing in a Single Trie Pass
International Conference on Text, Speech and Dialogue (TSD), 2024
Iuliia Thorbecke
Esaú Villatoro-Tello
Juan Zuluaga-Gomez
Shashi Kumar
Sergio Burdisso
...
A. Ganapathiraju
P. Motlícek
Karthik Pandia
Kadri Hacioğlu
Andreas Stolcke
334
0
0
20 Sep 2024
SALSA: Speedy ASR-LLM Synchronous Aggregation
Interspeech (Interspeech), 2024
Ashish R. Mittal
Darshan Prabhu
Sunita Sarawagi
Preethi Jyothi
335
10
0
29 Aug 2024
XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition
Xucheng Wan
Naijun Zheng
Kai Liu
Huan Zhou
168
0
0
20 Aug 2024
An efficient text augmentation approach for contextualized Mandarin speech recognition
Interspeech (Interspeech), 2024
Naijun Zheng
Xucheng Wan
Kai Liu
Ziqing Du
Zhou Huan
187
2
0
14 Jun 2024
Enhancing CTC-based speech recognition with diverse modeling units
Shiyi Han
Zhihong Lei
Mingbin Xu
Xingyu Na
Zhen Huang
339
1
0
05 Jun 2024
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition
Zijin Gu
Tatiana Likhomanenko
Richard He Bai
Erik McDermott
R. Collobert
Navdeep Jaitly
AuLLM
245
10
0
24 May 2024
Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Robust and Instruction-Aware ASR and OCR
Chan-Jan Hsu
Yi-Chang Chen
Feng-Ting Liao
Pei-Chen Ho
Yu-Hsiang Wang
Po-Chun Hsu
Da-shan Shiu
486
3
0
23 May 2024
Contextualized Automatic Speech Recognition with Dynamic Vocabulary
Yui Sudo
Yosuke Fukumoto
Muhammad Shakeel
Yifan Peng
Shinji Watanabe
289
7
0
22 May 2024
MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition
IEEE Signal Processing Letters (SPL), 2024
Bingshen Mu
Yangze Li
Qijie Shao
Kun Wei
Xucheng Wan
Naijun Zheng
Huan Zhou
Lei Xie
353
19
0
06 May 2024
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Heyang Liu
Yu Wang
Yanfeng Wang
278
0
0
01 Mar 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
Chen Chen
Ruizhe Li
Yuchen Hu
Sabato Marco Siniscalchi
Pin-Yu Chen
Ensiong Chng
Chao-Han Huck Yang
227
32
0
08 Feb 2024
Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Wenjie Huang
Cyril Allauzen
Tongzhou Chen
Kilol Gupta
Ke Hu
James Qin
Yu Zhang
Yongqiang Wang
Shuo-yiin Chang
Tara N. Sainath
MoMe
251
16
0
23 Jan 2024
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Michael Hentschel
Yuta Nishikawa
Tatsuya Komatsu
Yusuke Fujita
277
5
0
22 Jan 2024
Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search
Yui Sudo
Muhammad Shakeel
Yosuke Fukumoto
Yifan Peng
Shinji Watanabe
221
16
0
19 Jan 2024
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Sai Muralidhar Jayanthi
Devang Kulshreshtha
Saket Dingliwal
S. Ronanki
S. Bodapati
208
9
0
14 Nov 2023
Improving Seq2Seq Grammatical Error Correction via Decoding Interventions
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Houquan Zhou
Yumeng Liu
Zhenghua Li
Min Zhang
Bo Zhang
Chen Li
Ji Zhang
Fei Huang
220
13
0
23 Oct 2023
Multi-stage Large Language Model Correction for Speech Recognition
Jie Pu
Thai-Son Nguyen
Sebastian Stüker
LRM
287
14
0
17 Oct 2023
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
A. Ogawa
Takafumi Moriya
Naoyuki Kamo
Naohiro Tawara
Marc Delcroix
152
3
0
17 Oct 2023
Correction Focused Language Model Training for Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yingyi Ma
Zhe Liu
Ozlem Kalinli
KELM
280
6
0
17 Oct 2023
Acoustic Model Fusion for End-to-end Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Zhihong Lei
Mingbin Xu
Shiyi Han
Leo Liu
Zhen Huang
...
Yuanyuan Zhang
Ernest Pusateri
Mirko Hannemann
Yaqiao Deng
Man-Hung Siu
203
6
0
10 Oct 2023
Forgetting Private Textual Sequences in Language Models via Leave-One-Out Ensemble
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zhe Liu
Ozlem Kalinli
MU
KELM
231
5
0
28 Sep 2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Neural Information Processing Systems (NeurIPS), 2023
Cheng Chen
Yuchen Hu
Chao-Han Huck Yang
Sabato Marco Siniscalchi
Pin-Yu Chen
Eng Siong Chng
212
61
0
27 Sep 2023
Improved Factorized Neural Transducer Model For text-only Domain Adaptation
Interspeech (Interspeech), 2023
Jing Liu
Jianwei Yu
Xie Chen
326
2
0
18 Sep 2023
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jian Wu
Naoyuki Kanda
Takuya Yoshioka
Rui Zhao
Zhuo Chen
Jinyu Li
163
6
0
15 Sep 2023
PromptASR for contextualized ASR with controllable style
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xiaoyu Yang
Wei Kang
Zengwei Yao
Yifan Yang
Liyong Guo
Fangjun Kuang
Long Lin
Daniel Povey
341
24
0
14 Sep 2023
Recovering from Privacy-Preserving Masking with Large Language Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
A. Vats
Zhe Liu
Peng Su
Debjyoti Paul
Yingyi Ma
Yutong Pang
Zeeshan Ahmed
Ozlem Kalinli
230
12
0
12 Sep 2023
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Interspeech (Interspeech), 2023
Jiaxu Zhu
Weinan Tong
Yaoxun Xu
Chang Song
Zhiyong Wu
Zhao You
Jane Polak Scowcroft
Dong Yu
Helen M. Meng
165
0
0
04 Sep 2023
Decoupled Structure for Improved Adaptability of End-to-End Models
Speech Communication (Speech Commun.), 2023
Keqi Deng
P. Woodland
AuLLM
204
6
0
25 Aug 2023
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Interspeech (Interspeech), 2023
Shaan Bijwadia
Shuo-yiin Chang
Weiran Wang
Zhong Meng
Hao Zhang
Tara N. Sainath
157
3
0
14 Aug 2023
A Novel Self-training Approach for Low-resource Speech Recognition
Interspeech (Interspeech), 2023
Satwinder Singh
Feng Hou
Ruili Wang
206
13
0
10 Aug 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
Interspeech (Interspeech), 2023
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
185
4
0
24 Jul 2023
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study
International Conference on Neural Information Processing (ICONIP), 2023
Zeping Min
Jinbo Wang
AuLLM
197
19
0
13 Jul 2023
Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource Languages
Interspeech (Interspeech), 2023
Devang Kulshreshtha
Saket Dingliwal
Brady C. Houston
S. Bodapati
217
6
0
03 Jul 2023
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Yuang Li
Yu-Huan Wu
Jinyu Li
Shujie Liu
253
61
0
28 Jun 2023
Large-scale Language Model Rescoring on Long-form Data
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Tongzhou Chen
Cyril Allauzen
Yinghui Huang
Daniel S. Park
David Rybach
...
Rodrigo Cabrera
Kartik Audhkhasi
Bhuvana Ramabhadran
Pedro J. Moreno
Michael Riley
184
25
0
13 Jun 2023
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ziyi Ni
Minglun Han
Feilong Chen
Linghui Meng
Jing Shi
Shuang Xu
Bo Xu
185
3
0
31 May 2023
1
2
3
4
Next