ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01996
  4. Cited By
An analysis of incorporating an external language model into a
  sequence-to-sequence model

An analysis of incorporating an external language model into a sequence-to-sequence model

6 December 2017
Anjuli Kannan
Yonghui Wu
Patrick Nguyen
Tara N. Sainath
Zhiwen Chen
Rohit Prabhavalkar
ArXiv (abs)PDFHTML

Papers citing "An analysis of incorporating an external language model into a sequence-to-sequence model"

50 / 175 papers shown
Retraining-free Customized ASR for Enharmonic Words Based on a
  Named-Entity-Aware Model and Phoneme Similarity Estimation
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity EstimationInterspeech (Interspeech), 2023
Yui Sudo
K. Hata
K. Nakadai
182
5
0
29 May 2023
Blank-regularized CTC for Frame Skipping in Neural Transducer
Blank-regularized CTC for Frame Skipping in Neural TransducerInterspeech (Interspeech), 2023
Yifan Yang
Xiaoyu Yang
Liyong Guo
Zengwei Yao
Wei Kang
Fangjun Kuang
Long Lin
Xie Chen
Daniel Povey
148
11
0
19 May 2023
CB-Conformer: Contextual biasing Conformer for biased word recognition
CB-Conformer: Contextual biasing Conformer for biased word recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yaoxun Xu
Baiji Liu
Qiaochu Huang and
Xingcheng Song
Zhiyong Wu
Shiyin Kang
Helen Meng
282
18
0
19 Apr 2023
PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech
  recognition in neural transducers
PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
R. Pandey
Roger Ren
Qi Luo
Jing Liu
Ariya Rastrow
Ankur Gandhe
Denis Filimonov
Grant P. Strimel
A. Stolcke
I. Bulyko
147
15
0
30 Mar 2023
A Deliberation-based Joint Acoustic and Text Decoder
A Deliberation-based Joint Acoustic and Text DecoderInterspeech (Interspeech), 2021
S. Mavandadi
Tara N. Sainath
Ke Hu
Zelin Wu
137
7
0
23 Mar 2023
An Overview on Language Models: Recent Developments and Outlook
An Overview on Language Models: Recent Developments and OutlookAPSIPA Transactions on Signal and Information Processing (TASIP), 2023
Chengwei Wei
Yun Cheng Wang
Bin Wang
C.-C. Jay Kuo
279
53
0
10 Mar 2023
End-to-End Speech Recognition: A Survey
End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
294
248
0
03 Mar 2023
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End
  Speech Recognition
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech RecognitionNeural Networks (Neural Netw.), 2023
Leyuan Qu
C. Weber
S. Wermter
150
13
0
20 Feb 2023
Massively Multilingual Shallow Fusion with Large Language Models
Massively Multilingual Shallow Fusion with Large Language ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ke Hu
Tara N. Sainath
Yue Liu
Nan Du
Yanping Huang
Andrew M. Dai
Yu Zhang
Rodrigo Cabrera
Zhiwen Chen
Trevor Strohman
173
17
0
17 Feb 2023
Adaptable End-to-End ASR Models using Replaceable Internal LMs and
  Residual Softmax
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual SoftmaxIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Keqi Deng
P. Woodland
AuLLMKELM
175
13
0
16 Feb 2023
Improving Rare Words Recognition through Homophone Extension and Unified
  Writing for Low-resource Cantonese Speech Recognition
Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech RecognitionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Ho-Lam Chung
Junan Li
Pengfei Liu1
Wai-Kim Leung
Xixin Wu
Helen Meng
232
5
0
02 Feb 2023
Memory Augmented Lookup Dictionary based Language Modeling for Automatic
  Speech Recognition
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech RecognitionInterspeech (Interspeech), 2022
Yukun Feng
Ming Tu
Rui Xia
Chuanzeng Huang
Yuxuan Wang
RALM
226
0
0
30 Dec 2022
Fast and accurate factorized neural transducer for text adaption of
  end-to-end speech recognition models
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Rui Zhao
Jian Xue
P. Parthasarathy
Veljko Miljanic
Jinyu Li
236
16
0
05 Dec 2022
Adaptive Multi-Corpora Language Model Training for Speech Recognition
Adaptive Multi-Corpora Language Model Training for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yingyi Ma
Zhe Liu
Xuedong Zhang
188
3
0
09 Nov 2022
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
  (ICSRC): Dataset, Tracks, Baseline and Results
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and ResultsInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Ao Zhang
F. Yu
Kaixun Huang
Linfu Xie
Longbiao Wang
Eng Siong Chng
Hui Bu
Binbin Zhang
Wei Chen
Xin Xu
201
5
0
03 Nov 2022
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech
  Recognition
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech RecognitionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Suyoun Kim
Ke Li
Lucas Kabela
Rongqing Huang
Jiedan Zhu
Ozlem Kalinli
Duc Le
202
8
0
31 Oct 2022
Partitioned Gradient Matching-based Data Subset Selection for
  Compute-Efficient Robust ASR Training
Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ashish R. Mittal
D. Sivasubramanian
Rishabh K. Iyer
Preethi Jyothi
Ganesh Ramakrishnan
162
4
0
30 Oct 2022
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with
  Pre-trained Masked Language Model
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yosuke Higuchi
Brian Yan
Siddhant Arora
Tetsuji Ogawa
Tetsunori Kobayashi
Shinji Watanabe
256
31
0
29 Oct 2022
SAN: a robust end-to-end ASR model architecture
SAN: a robust end-to-end ASR model architectureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zeping Min
Qian Ge
Guanhua Huang
125
2
0
27 Oct 2022
Can Visual Context Improve Automatic Speech Recognition for an Embodied
  Agent?
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Pradip Pramanick
Chayan Sarkar
221
8
0
21 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero
  supervised speech ASR
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASRSpoken Language Technology Workshop (SLT), 2022
Zhehuai Chen
Ankur Bapna
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Pedro J. Moreno
Nanxin Chen
240
17
0
18 Oct 2022
Towards Personalization of CTC Speech Recognition Models with Contextual
  Adapters and Adaptive Boosting
Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting
Saket Dingliwal
Monica Sunkara
S. Bodapati
S. Ronanki
Jeffrey J. Farris
Katrin Kirchhoff
235
0
0
18 Oct 2022
JOIST: A Joint Speech and Text Streaming Model For ASR
JOIST: A Joint Speech and Text Streaming Model For ASRSpoken Language Technology Workshop (SLT), 2022
Tara N. Sainath
Rohit Prabhavalkar
Ankur Bapna
Yu Zhang
Zhouyuan Huo
Zhehuai Chen
Yue Liu
Weiran Wang
Trevor Strohman
RALMAuLLM
198
37
0
13 Oct 2022
Mitigating Unintended Memorization in Language Models via Alternating
  Teaching
Mitigating Unintended Memorization in Language Models via Alternating TeachingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhe Liu
Xuedong Zhang
Fuchun Peng
130
5
0
13 Oct 2022
Scaling Up Deliberation for Multilingual ASR
Scaling Up Deliberation for Multilingual ASRSpoken Language Technology Workshop (SLT), 2022
Ke Hu
Yue Liu
Tara N. Sainath
LRM
309
10
0
11 Oct 2022
Non-autoregressive Error Correction for CTC-based ASR with
  Phone-conditioned Masked LM
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LMInterspeech (Interspeech), 2022
Hayato Futami
Hirofumi Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
KELM
249
13
0
08 Sep 2022
Effectiveness of Mining Audio and Text Pairs from Public Data for
  Improving ASR Systems for Low-Resource Languages
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Kaushal Bhogale
A. Raman
Tahir Javed
Sumanth Doddapaneni
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
159
30
0
26 Aug 2022
Adversarial Attacks on ASR Systems: An Overview
Adversarial Attacks on ASR Systems: An OverviewInternational Conference on Data Science in Cyberspace (ICDSC), 2022
Xiao Zhang
Hao Tan
Xuan Huang
Denghui Zhang
Keke Tang
Zhaoquan Gu
AAML
132
3
0
03 Aug 2022
Speaker consistency loss and step-wise optimization for semi-supervised
  joint training of TTS and ASR using unpaired text data
Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text dataInterspeech (Interspeech), 2022
Naoki Makishima
Satoshi Suzuki
Atsushi Ando
Ryo Masumura
324
6
0
11 Jul 2022
UserLibri: A Dataset for ASR Personalization Using Only Text
UserLibri: A Dataset for ASR Personalization Using Only TextInterspeech (Interspeech), 2022
Theresa Breiner
Swaroop Indra Ramaswamy
Ehsan Variani
Shefali Garg
Rajiv Mathews
K. Sim
Kilol Gupta
Mingqing Chen
Lara McConnaughey
143
17
0
02 Jul 2022
Contextual Density Ratio for Language Model Biasing of Sequence to
  Sequence ASR Systems
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR SystemsInterspeech (Interspeech), 2021
Jesús Andrés-Ferrer
Dario Albesano
P. Zhan
Paul Vozila
107
6
0
29 Jun 2022
On Comparison of Encoders for Attention based End to End Speech
  Recognition in Standalone and Rescoring Mode
On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring ModeInternational Conference on Signal Processing and Communications (ICSPC), 2022
Raviraj Joshi
Subodh Kumar
114
2
0
26 Jun 2022
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using
  Synthetic Data
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data
Raviraj Joshi
Ashutosh Kumar Singh
185
11
0
22 Jun 2022
Residual Language Model for End-to-end Speech Recognition
Residual Language Model for End-to-end Speech RecognitionInterspeech (Interspeech), 2022
E. Tsunoo
Yosuke Kashiwagi
Chaitanya Narisetty
Shinji Watanabe
148
11
0
15 Jun 2022
Deep Learning for Visual Speech Analysis: A Survey
Deep Learning for Visual Speech Analysis: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Changchong Sheng
Gangyao Kuang
L. Bai
Chen Hou
Yike Guo
Xin Xu
M. Pietikäinen
Tianpeng Liu
VLM
321
53
0
22 May 2022
Minimising Biasing Word Errors for Contextual ASR with the
  Tree-Constrained Pointer Generator
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer GeneratorIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Guangzhi Sun
Chuxu Zhang
P. Woodland
208
16
0
18 May 2022
Detecting Unintended Memorization in Language-Model-Fused ASR
Detecting Unintended Memorization in Language-Model-Fused ASRInterspeech (Interspeech), 2022
Wenjie Huang
Steve Chien
Om Thakkar
Rajiv Mathews
193
11
0
20 Apr 2022
Improving Rare Word Recognition with LM-aware MWER Training
Improving Rare Word Recognition with LM-aware MWER TrainingInterspeech (Interspeech), 2022
Weiran Wang
Tongzhou Chen
Tara N. Sainath
Ehsan Variani
Rohit Prabhavalkar
...
S. Mavandadi
Cal Peyser
Trevor Strohman
Yanzhang He
David Rybach
KELM
182
13
0
15 Apr 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text
  for Low-Resource Automatic Speech Recognition
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition
Ye Du
Jie Zhang
Qiu-shi Zhu
Lirong Dai
Ming Wu
Xin Fang
Zhouwang Yang
145
2
0
05 Apr 2022
Scaling Language Model Size in Cross-Device Federated Learning
Scaling Language Model Size in Cross-Device Federated Learning
Jae Hun Ro
Theresa Breiner
Lara McConnaughey
Mingqing Chen
A. Suresh
Shankar Kumar
Rajiv Mathews
FedML
148
34
0
31 Mar 2022
An Empirical Study of Language Model Integration for Transducer based
  Speech Recognition
An Empirical Study of Language Model Integration for Transducer based Speech RecognitionInterspeech (Interspeech), 2022
Huahuan Zheng
Keyu An
Zhijian Ou
Chen Huang
Ke Ding
Guanglu Wan
200
5
0
31 Mar 2022
End-to-end contextual asr based on posterior distribution adaptation for
  hybrid ctc/attention system
End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system
Zheng Zhang
Pan Zhou
151
7
0
18 Feb 2022
AISHELL-NER: Named Entity Recognition from Chinese Speech
AISHELL-NER: Named Entity Recognition from Chinese SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Boli Chen
Guangwei Xu
Xiaobin Wang
Pengjun Xie
Meishan Zhang
Fei Huang
126
39
0
17 Feb 2022
Joint Speech Recognition and Audio Captioning
Joint Speech Recognition and Audio CaptioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chaitanya Narisetty
E. Tsunoo
Xuankai Chang
Yosuke Kashiwagi
Michael Hentschel
Shinji Watanabe
145
10
0
03 Feb 2022
Neural-FST Class Language Model for End-to-End Speech Recognition
Neural-FST Class Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Bruguier
Duc Le
Rohit Prabhavalkar
Dangna Li
Zhe Liu
Bo Wang
Eun Chang
Fuchun Peng
Ozlem Kalinli
M. Seltzer
242
6
0
28 Jan 2022
Internal Language Model Estimation Through Explicit Context Vector
  Learning for Attention-based Encoder-decoder ASR
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASRInterspeech (Interspeech), 2022
Yufei Liu
Rao Ma
Haihua Xu
Yi He
Zejun Ma
Weibin Zhang
155
15
0
26 Jan 2022
A Likelihood Ratio based Domain Adaptation Method for E2E Models
A Likelihood Ratio based Domain Adaptation Method for E2E ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chhavi Choudhury
Ankur Gandhe
Xiaohan Ding
I. Bulyko
142
11
0
10 Jan 2022
Improving Hybrid CTC/Attention End-to-end Speech Recognition with
  Pretrained Acoustic and Language Model
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model
Keqi Deng
Songjun Cao
Yike Zhang
Long Ma
VLM
90
32
0
14 Dec 2021
Context-Aware Transformer Transducer for Speech Recognition
Context-Aware Transformer Transducer for Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
194
96
0
05 Nov 2021
Advances and Challenges in Deep Lip Reading
Advances and Challenges in Deep Lip Reading
Marzieh Oghbaie
Arian Sabaghi
Kooshan Hashemifard
Mohammad Akbari
VLM
143
17
0
15 Oct 2021
Previous
1234
Next
Page 2 of 4