Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1712.01996
Cited By
An analysis of incorporating an external language model into a sequence-to-sequence model
6 December 2017
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
Zhiwen Chen
Rohit Prabhavalkar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An analysis of incorporating an external language model into a sequence-to-sequence model"
50 / 175 papers shown
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation
Interspeech (Interspeech), 2023
Yui Sudo
K. Hata
K. Nakadai
182
5
0
29 May 2023
Blank-regularized CTC for Frame Skipping in Neural Transducer
Interspeech (Interspeech), 2023
Yifan Yang
Xiaoyu Yang
Liyong Guo
Zengwei Yao
Wei Kang
Fangjun Kuang
Long Lin
Xie Chen
Daniel Povey
148
11
0
19 May 2023
CB-Conformer: Contextual biasing Conformer for biased word recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yaoxun Xu
Baiji Liu
Qiaochu Huang and
Xingcheng Song
Zhiyong Wu
Shiyin Kang
Helen Meng
282
18
0
19 Apr 2023
PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
R. Pandey
Roger Ren
Qi Luo
Jing Liu
Ariya Rastrow
Ankur Gandhe
Denis Filimonov
Grant P. Strimel
A. Stolcke
I. Bulyko
147
15
0
30 Mar 2023
A Deliberation-based Joint Acoustic and Text Decoder
Interspeech (Interspeech), 2021
S. Mavandadi
Tara N. Sainath
Ke Hu
Zelin Wu
137
7
0
23 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
APSIPA Transactions on Signal and Information Processing (TASIP), 2023
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
279
53
0
10 Mar 2023
End-to-End Speech Recognition: A Survey
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
294
248
0
03 Mar 2023
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition
Neural Networks (Neural Netw.), 2023
Leyuan Qu
C. Weber
S. Wermter
150
13
0
20 Feb 2023
Massively Multilingual Shallow Fusion with Large Language Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ke Hu
Tara N. Sainath
Yue Liu
Nan Du
Yanping Huang
Andrew M. Dai
Yu Zhang
Rodrigo Cabrera
Zhiwen Chen
Trevor Strohman
173
17
0
17 Feb 2023
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Keqi Deng
P. Woodland
AuLLM
KELM
175
13
0
16 Feb 2023
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Ho-Lam Chung
Junan Li
Pengfei Liu1
Wai-Kim Leung
Xixin Wu
Helen Meng
232
5
0
02 Feb 2023
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition
Interspeech (Interspeech), 2022
Yukun Feng
Ming Tu
Rui Xia
Chuanzeng Huang
Yuxuan Wang
RALM
226
0
0
30 Dec 2022
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rui Zhao
Jian Xue
P. Parthasarathy
Veljko Miljanic
Jinyu Li
236
16
0
05 Dec 2022
Adaptive Multi-Corpora Language Model Training for Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yingyi Ma
Zhe Liu
Xuedong Zhang
188
3
0
09 Nov 2022
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Ao Zhang
F. Yu
Kaixun Huang
Linfu Xie
Longbiao Wang
Eng Siong Chng
Hui Bu
Binbin Zhang
Wei Chen
Xin Xu
201
5
0
03 Nov 2022
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Suyoun Kim
Ke Li
Lucas Kabela
Rongqing Huang
Jiedan Zhu
Ozlem Kalinli
Duc Le
202
8
0
31 Oct 2022
Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ashish R. Mittal
D. Sivasubramanian
Rishabh K. Iyer
Preethi Jyothi
Ganesh Ramakrishnan
162
4
0
30 Oct 2022
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yosuke Higuchi
Brian Yan
Siddhant Arora
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
256
31
0
29 Oct 2022
SAN: a robust end-to-end ASR model architecture
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zeping Min
Qian Ge
Guanhua Huang
125
2
0
27 Oct 2022
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Pradip Pramanick
Chayan Sarkar
221
8
0
21 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Spoken Language Technology Workshop (SLT), 2022
Zhehuai Chen
Ankur Bapna
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Pedro J. Moreno
Nanxin Chen
240
17
0
18 Oct 2022
Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting
Saket Dingliwal
Monica Sunkara
S. Bodapati
S. Ronanki
Jeffrey J. Farris
Katrin Kirchhoff
235
0
0
18 Oct 2022
JOIST: A Joint Speech and Text Streaming Model For ASR
Spoken Language Technology Workshop (SLT), 2022
Tara N. Sainath
Rohit Prabhavalkar
Ankur Bapna
Yu Zhang
Zhouyuan Huo
Zhehuai Chen
Yue Liu
Weiran Wang
Trevor Strohman
RALM
AuLLM
198
37
0
13 Oct 2022
Mitigating Unintended Memorization in Language Models via Alternating Teaching
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhe Liu
Xuedong Zhang
Fuchun Peng
130
5
0
13 Oct 2022
Scaling Up Deliberation for Multilingual ASR
Spoken Language Technology Workshop (SLT), 2022
Ke Hu
Yue Liu
Tara N. Sainath
LRM
309
10
0
11 Oct 2022
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Interspeech (Interspeech), 2022
Hayato Futami
Hirofumi Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
KELM
249
13
0
08 Sep 2022
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Kaushal Bhogale
A. Raman
Tahir Javed
Sumanth Doddapaneni
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
159
30
0
26 Aug 2022
Adversarial Attacks on ASR Systems: An Overview
International Conference on Data Science in Cyberspace (ICDSC), 2022
Xiao Zhang
Hao Tan
Xuan Huang
Denghui Zhang
Keke Tang
Zhaoquan Gu
AAML
132
3
0
03 Aug 2022
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text data
Interspeech (Interspeech), 2022
Naoki Makishima
Satoshi Suzuki
Atsushi Ando
Ryo Masumura
324
6
0
11 Jul 2022
UserLibri: A Dataset for ASR Personalization Using Only Text
Interspeech (Interspeech), 2022
Theresa Breiner
Swaroop Indra Ramaswamy
Ehsan Variani
Shefali Garg
Rajiv Mathews
K. Sim
Kilol Gupta
Mingqing Chen
Lara McConnaughey
143
17
0
02 Jul 2022
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems
Interspeech (Interspeech), 2021
Jesús Andrés-Ferrer
Dario Albesano
P. Zhan
Paul Vozila
107
6
0
29 Jun 2022
On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring Mode
International Conference on Signal Processing and Communications (ICSPC), 2022
Raviraj Joshi
Subodh Kumar
114
2
0
26 Jun 2022
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Raviraj Joshi
Ashutosh Kumar Singh
185
11
0
22 Jun 2022
Residual Language Model for End-to-end Speech Recognition
Interspeech (Interspeech), 2022
E. Tsunoo
Yosuke Kashiwagi
Chaitanya Narisetty
Shinji Watanabe
148
11
0
15 Jun 2022
Deep Learning for Visual Speech Analysis: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Yike Guo
Xin Xu
M. Pietikäinen
Tianpeng Liu
VLM
321
53
0
22 May 2022
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Guangzhi Sun
Chuxu Zhang
P. Woodland
208
16
0
18 May 2022
Detecting Unintended Memorization in Language-Model-Fused ASR
Interspeech (Interspeech), 2022
Wenjie Huang
Steve Chien
Om Thakkar
Rajiv Mathews
193
11
0
20 Apr 2022
Improving Rare Word Recognition with LM-aware MWER Training
Interspeech (Interspeech), 2022
Weiran Wang
Tongzhou Chen
Tara N. Sainath
Ehsan Variani
Rohit Prabhavalkar
...
S. Mavandadi
Cal Peyser
Trevor Strohman
Yanzhang He
David Rybach
KELM
182
13
0
15 Apr 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Ye Du
Jie Zhang
Qiu-shi Zhu
Lirong Dai
Ming Wu
Xin Fang
Zhouwang Yang
145
2
0
05 Apr 2022
Scaling Language Model Size in Cross-Device Federated Learning
Jae Hun Ro
Theresa Breiner
Lara McConnaughey
Mingqing Chen
A. Suresh
Shankar Kumar
Rajiv Mathews
FedML
148
34
0
31 Mar 2022
An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Interspeech (Interspeech), 2022
Huahuan Zheng
Keyu An
Zhijian Ou
Chen Huang
Ke Ding
Guanglu Wan
200
5
0
31 Mar 2022
End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Zheng Zhang
Pan Zhou
151
7
0
18 Feb 2022
AISHELL-NER: Named Entity Recognition from Chinese Speech
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Boli Chen
Guangwei Xu
Xiaobin Wang
Pengjun Xie
Meishan Zhang
Fei Huang
126
39
0
17 Feb 2022
Joint Speech Recognition and Audio Captioning
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chaitanya Narisetty
E. Tsunoo
Xuankai Chang
Yosuke Kashiwagi
Michael Hentschel
Shinji Watanabe
145
10
0
03 Feb 2022
Neural-FST Class Language Model for End-to-End Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Bruguier
Duc Le
Rohit Prabhavalkar
Dangna Li
Zhe Liu
Bo Wang
Eun Chang
Fuchun Peng
Ozlem Kalinli
M. Seltzer
242
6
0
28 Jan 2022
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR
Interspeech (Interspeech), 2022
Yufei Liu
Rao Ma
Haihua Xu
Yi He
Zejun Ma
Weibin Zhang
155
15
0
26 Jan 2022
A Likelihood Ratio based Domain Adaptation Method for E2E Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chhavi Choudhury
Ankur Gandhe
Xiaohan Ding
I. Bulyko
142
11
0
10 Jan 2022
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
90
32
0
14 Dec 2021
Context-Aware Transformer Transducer for Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2021
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
194
96
0
05 Nov 2021
Advances and Challenges in Deep Lip Reading
Marzieh Oghbaie
Arian Sabaghi
Kooshan Hashemifard
Mohammad Akbari
VLM
143
17
0
15 Oct 2021
Previous
1
2
3
4
Next
Page 2 of 4