Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.01991
Cited By
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
3 November 2020
Zhong Meng
S. Parthasarathy
Eric Sun
Yashesh Gaur
Naoyuki Kanda
Liang Lu
Xie Chen
Rui Zhao
Jinyu Li
Jiawei Liu
AuLLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition"
50 / 89 papers shown
Voice, Bias, and Coreference: An Interpretability Study of Gender in Speech Translation
Lina Conti
Dennis Fucci
Marco Gaido
Matteo Negri
Guillaume Wisniewski
L. Bentivogli
171
1
0
26 Nov 2025
AsyncSwitch: Asynchronous Text-Speech Adaptation for Code-Switched ASR
Tuan Nguyen
Huy-Dat Tran
175
0
0
17 Jun 2025
Phonetically-Augmented Discriminative Rescoring for Voice Search Error Correction
Christophe Van Gysel
Maggie Wu
Lyan Verwimp
Caglar Tirkaz
Marco Bertola
Zhihong Lei
Youssef Oualil
196
0
0
06 Jun 2025
NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding
Vladimir Bataev
A. Andrusenko
Lilit Grigoryan
A. Laptev
Vitaly Lavrukhin
Boris Ginsburg
157
0
0
28 May 2025
SegAug: CTC-Aligned Segmented Augmentation For Robust RNN-Transducer Based Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Khanh Le
Tuan Vu Ho
Dung Tran
Duc Thanh Chau
275
2
0
20 Feb 2025
Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Takaaki Hori
Martin Kocour
Adnan Haider
Erik McDermott
Xiaodan Zhuang
AuLLM
220
9
0
17 Jan 2025
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Nikolaos Flemotomos
Roger Hsiao
P. Swietojanski
Takaaki Hori
Dogan Can
Xiaodan Zhuang
563
3
0
01 Nov 2024
Alignment-Free Training for Transducer-based Multi-Talker ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Takafumi Moriya
Shota Horiguchi
Marc Delcroix
Ryo Masumura
Takanori Ashihara
Hiroshi Sato
Kohei Matsuura
Masato Mimura
269
9
0
30 Sep 2024
Unifying Global and Near-Context Biasing in a Single Trie Pass
International Conference on Text, Speech and Dialogue (TSD), 2024
Iuliia Thorbecke
Esaú Villatoro-Tello
Juan Zuluaga-Gomez
Shashi Kumar
Sergio Burdisso
...
A. Ganapathiraju
P. Motlícek
Karthik Pandia
Kadri Hacioğlu
Andreas Stolcke
415
0
0
20 Sep 2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Spoken Language Technology Workshop (SLT), 2024
Chao-Han Huck Yang
Taejin Park
Yuan Gong
Yuanchao Li
Zhehuai Chen
...
Eng Siong Chng
Peter Bell
Catherine Lai
Shinji Watanabe
A. Stolcke
AuLLM
ELM
359
14
0
15 Sep 2024
ASR Error Correction using Large Language Models
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Rao Ma
Mengjie Qian
Mark Gales
Kate Knill
KELM
354
33
0
14 Sep 2024
SALSA: Speedy ASR-LLM Synchronous Aggregation
Interspeech (Interspeech), 2024
Ashish R. Mittal
Darshan Prabhu
Sunita Sarawagi
Preethi Jyothi
398
12
0
29 Aug 2024
LLM Internal States Reveal Hallucination Risk Faced With a Query
Ziwei Ji
Delong Chen
Etsuko Ishii
Samuel Cahyawijaya
Yejin Bang
Bryan Wilie
Pascale Fung
HILM
LRM
391
79
0
03 Jul 2024
Text Injection for Neural Contextual Biasing
Zhong Meng
Zelin Wu
Rohit Prabhavalkar
Cal Peyser
Weiran Wang
Nanxin Chen
Tara N. Sainath
Bhuvana Ramabhadran
359
6
0
05 Jun 2024
"Pass the butter": A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT
Haohua Que
Wenbin Pan
Jie Xu
Hao Luo
Pei Wang
Li Zhang
209
1
0
27 May 2024
Revisiting ASR Error Correction with Specialized Models
Zijin Gu
Tatiana Likhomanenko
Richard He Bai
Erik McDermott
R. Collobert
Navdeep Jaitly
KELM
AuLLM
LRM
319
10
0
24 May 2024
Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator
Asia and South Pacific Design Automation Conference (ASP-DAC), 2024
Xincheng Feng
Guodong Shen
Jianhao Hu
Meng Li
Ngai Wong
196
2
0
03 May 2024
Effective internal language model training and fusion for factorized transducer model
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jinxi Guo
Niko Moritz
Yingyi Ma
Frank Seide
Chunyang Wu
Jay Mahadeokar
Ozlem Kalinli
Christian Fuegen
Michael Seltzer
274
4
0
02 Apr 2024
Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
Siyuan Shen
Yu Gao
Feng Liu
Hanyang Wang
Aimin Zhou
248
24
0
28 Mar 2024
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Michael Hentschel
Yuta Nishikawa
Tatsuya Komatsu
Yusuke Fujita
351
5
0
22 Jan 2024
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
A. Ogawa
Takafumi Moriya
Naoyuki Kamo
Naohiro Tawara
Marc Delcroix
175
3
0
17 Oct 2023
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Zhihong Lei
Ernest Pusateri
Shiyi Han
Leo Liu
Mingbin Xu
...
R. Travadi
Youyuan Zhang
Mirko Hannemann
Man-Hung Siu
Zhen Huang
258
10
0
16 Oct 2023
Acoustic Model Fusion for End-to-end Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Zhihong Lei
Mingbin Xu
Shiyi Han
Leo Liu
Zhen Huang
...
Yuanyuan Zhang
Ernest Pusateri
Mirko Hannemann
Yaqiao Deng
Man-Hung Siu
262
6
0
10 Oct 2023
Neural Language Model Pruning for Automatic Speech Recognition
Leonardo Emili
Thiago Fraga-Silva
Ernest Pusateri
M. Nußbaum-Thom
Youssef Oualil
263
3
0
05 Oct 2023
Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
VLM
AuLLM
RALM
312
11
0
16 Sep 2023
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Jian Wu
Naoyuki Kanda
Takuya Yoshioka
Rui Zhao
Zhuo Chen
Jinyu Li
212
6
0
15 Sep 2023
Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation
Spoken Language Technology Workshop (SLT), 2023
Shaoshi Ling
Guoli Ye
Rui Zhao
Yifan Gong
VLM
273
2
0
14 Sep 2023
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Interspeech (Interspeech), 2023
Jiaxu Zhu
Weinan Tong
Yaoxun Xu
Chang Song
Zhiyong Wu
Zhao You
Jane Polak Scowcroft
Dong Yu
Helen M. Meng
201
0
0
04 Sep 2023
Decoupled Structure for Improved Adaptability of End-to-End Models
Speech Communication (Speech Commun.), 2023
Keqi Deng
P. Woodland
AuLLM
302
7
0
25 Aug 2023
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models
Interspeech (Interspeech), 2023
Shaan Bijwadia
Shuo-yiin Chang
Weiran Wang
Zhong Meng
Hao Zhang
Tara N. Sainath
211
3
0
14 Aug 2023
Improving RNN-Transducers with Acoustic LookAhead
Interspeech (Interspeech), 2023
Vinit Unni
Ashish R. Mittal
Preethi Jyothi
Sunita Sarawagi
314
4
0
11 Jul 2023
Can Generative Large Language Models Perform ASR Error Correction?
Rao Ma
Mengjie Qian
Potsawee Manakul
Mark Gales
Kate Knill
AuLLM
KELM
376
83
0
09 Jul 2023
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Yuang Li
Yu-Huan Wu
Jinyu Li
Shujie Liu
278
71
0
28 Jun 2023
Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Interspeech (Interspeech), 2023
Tina Raissi
Christoph Luscher
Moritz Gunz
Ralf Schluter
Hermann Ney
BDL
189
5
0
15 Jun 2023
Improving Language Model Integration for Neural Machine Translation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Christian Herold
Yingbo Gao
Mohammad Zeineldeen
Hermann Ney
251
3
0
08 Jun 2023
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer
Interspeech (Interspeech), 2023
Lu Huang
Yangqiu Song
Jun Zhang
Lu Lu
Zejun Ma
335
4
0
07 Jun 2023
Can Contextual Biasing Remain Effective with Whisper and GPT-2?
Interspeech (Interspeech), 2023
Guangzhi Sun
Xianrui Zheng
Chuxu Zhang
P. Woodland
228
28
0
02 Jun 2023
Adapting an Unadaptable ASR System
Interspeech (Interspeech), 2023
Rao Ma
Mengjie Qian
Mark Gales
Kate Knill
378
4
0
01 Jun 2023
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Guangzhi Sun
Chuxu Zhang
P. Woodland
273
9
0
30 May 2023
External Language Model Integration for Factorized Neural Transducers
Michael Levit
S. Parthasarathy
Cem Aksoylar
Mohammad Sadegh Rasooli
Shuangyu Chang
289
2
0
26 May 2023
Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding
Interspeech (Interspeech), 2023
Tianren Zhang
Haibo Qin
Zhibing Lai
Songlu Chen
Qi Liu
Feng Chen
Xinyuan Qian
Xu-Cheng Yin
171
1
0
23 May 2023
Mask The Bias: Improving Domain-Adaptive Generalization of CTC-based ASR with Internal Language Model Estimation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Nilaksh Das
Monica Sunkara
S. Bodapati
Jason (Jinglun) Cai
Devang Kulshreshtha
Jeffrey J. Farris
Katrin Kirchhoff
218
4
0
05 May 2023
CB-Conformer: Contextual biasing Conformer for biased word recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yaoxun Xu
Baiji Liu
Qiaochu Huang and
Xingcheng Song
Zhiyong Wu
Shiyin Kang
Helen Meng
318
19
0
19 Apr 2023
Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Interspeech (Interspeech), 2023
Maurits J. R. Bleeker
P. Swietojanski
Stefan Braun
Xiaodan Zhuang
285
9
0
18 Apr 2023
End-to-End Speech Recognition: A Survey
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
361
276
0
03 Mar 2023
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Neural Networks (Neural Netw.), 2023
Leyuan Qu
C. Weber
S. Wermter
197
13
0
20 Feb 2023
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zhong Meng
Weiran Wang
Rohit Prabhavalkar
Tara N. Sainath
Tongzhou Chen
Ehsan Variani
Yu Zhang
Yue Liu
Andrew Rosenberg
Bhuvana Ramabhadran
AuLLM
VLM
260
13
0
16 Feb 2023
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Keqi Deng
P. Woodland
AuLLM
KELM
220
13
0
16 Feb 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Interspeech (Interspeech), 2023
Minglun Han
Feilong Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
261
17
0
30 Jan 2023
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rui Zhao
Jian Xue
P. Parthasarathy
Veljko Miljanic
Jinyu Li
326
16
0
05 Dec 2022
1
2
Next
Page 1 of 2