An analysis of incorporating an external language model into a sequence-to-sequence model

6 December 2017

Papers citing "An analysis of incorporating an external language model into a sequence-to-sequence model"

50 / 175 papers shown

Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity EstimationInterspeech (Interspeech), 2023

Yui Sudo

K. Hata

K. Nakadai

182

29 May 2023

Blank-regularized CTC for Frame Skipping in Neural TransducerInterspeech (Interspeech), 2023

Yifan Yang

Xiaoyu Yang

Liyong Guo

Zengwei Yao

Wei Kang

Fangjun Kuang

Long Lin

Xie Chen

Daniel Povey

148

19 May 2023

CB-Conformer: Contextual biasing Conformer for biased word recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zhiyong Wu

Shiyin Kang

Helen Meng

282

19 Apr 2023

PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

147

30 Mar 2023

A Deliberation-based Joint Acoustic and Text DecoderInterspeech (Interspeech), 2021

137

23 Mar 2023

An Overview on Language Models: Recent Developments and OutlookAPSIPA Transactions on Signal and Information Processing (TASIP), 2023

Chengwei Wei

Yun Cheng Wang

Bin Wang

C.-C. Jay Kuo

279

10 Mar 2023

End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

294

248

03 Mar 2023

Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech RecognitionNeural Networks (Neural Netw.), 2023

Leyuan Qu

C. Weber

S. Wermter

150

20 Feb 2023

Massively Multilingual Shallow Fusion with Large Language ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

173

17 Feb 2023

Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual SoftmaxIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Keqi Deng

P. Woodland

AuLLM KELM

175

16 Feb 2023

Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech RecognitionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

232

02 Feb 2023

Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech RecognitionInterspeech (Interspeech), 2022

Yuxuan Wang

226

30 Dec 2022

Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

236

05 Dec 2022

Adaptive Multi-Corpora Language Model Training for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Yingyi Ma

Zhe Liu

Xuedong Zhang

188

09 Nov 2022

The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and ResultsInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

Ao Zhang

Longbiao Wang

Hui Bu

Binbin Zhang

Wei Chen

Xin Xu

201

03 Nov 2022

Joint Audio/Text Training for Transformer Rescorer of Streaming Speech RecognitionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Ozlem Kalinli

202

31 Oct 2022

Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Ganesh Ramakrishnan

162

30 Oct 2022

BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

256

29 Oct 2022

SAN: a robust end-to-end ASR model architectureIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Zeping Min

Qian Ge

Guanhua Huang

125

27 Oct 2022

Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Pradip Pramanick

Chayan Sarkar

221

21 Oct 2022

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASRSpoken Language Technology Workshop (SLT), 2022

Zhehuai Chen

Andrew Rosenberg

Bhuvana Ramabhadran

240

18 Oct 2022

Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting

235

18 Oct 2022

JOIST: A Joint Speech and Text Streaming Model For ASRSpoken Language Technology Workshop (SLT), 2022

Zhehuai Chen

198

13 Oct 2022

Mitigating Unintended Memorization in Language Models via Alternating TeachingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Zhe Liu

Xuedong Zhang

Fuchun Peng

130

13 Oct 2022

Scaling Up Deliberation for Multilingual ASRSpoken Language Technology Workshop (SLT), 2022

309

11 Oct 2022

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LMInterspeech (Interspeech), 2022

249

08 Sep 2022

Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource LanguagesIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Mitesh M. Khapra

159

26 Aug 2022

Adversarial Attacks on ASR Systems: An OverviewInternational Conference on Data Science in Cyberspace (ICDSC), 2022

132

03 Aug 2022

Speaker consistency loss and step-wise optimization for semi-supervised joint training of TTS and ASR using unpaired text dataInterspeech (Interspeech), 2022

Naoki Makishima

Satoshi Suzuki

Atsushi Ando

Ryo Masumura

324

11 Jul 2022

UserLibri: A Dataset for ASR Personalization Using Only TextInterspeech (Interspeech), 2022

Theresa Breiner

Swaroop Indra Ramaswamy

143

02 Jul 2022

Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR SystemsInterspeech (Interspeech), 2021

107

29 Jun 2022

On Comparison of Encoders for Attention based End to End Speech Recognition in Standalone and Rescoring ModeInternational Conference on Signal Processing and Communications (ICSPC), 2022

Raviraj Joshi

Subodh Kumar

114

26 Jun 2022

A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data

Raviraj Joshi

Ashutosh Kumar Singh

185

22 Jun 2022

Residual Language Model for End-to-end Speech RecognitionInterspeech (Interspeech), 2022

148

15 Jun 2022

Deep Learning for Visual Speech Analysis: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

321

22 May 2022

Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer GeneratorIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Guangzhi Sun

Chuxu Zhang

P. Woodland

208

18 May 2022

Detecting Unintended Memorization in Language-Model-Fused ASRInterspeech (Interspeech), 2022

193

20 Apr 2022

Improving Rare Word Recognition with LM-aware MWER TrainingInterspeech (Interspeech), 2022

...

182

15 Apr 2022

A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition

145

05 Apr 2022

Scaling Language Model Size in Cross-Device Federated Learning

148

31 Mar 2022

An Empirical Study of Language Model Integration for Transducer based Speech RecognitionInterspeech (Interspeech), 2022

Zhijian Ou

200

31 Mar 2022

End-to-end contextual asr based on posterior distribution adaptation for hybrid ctc/attention system

Zheng Zhang

Pan Zhou

151

18 Feb 2022

AISHELL-NER: Named Entity Recognition from Chinese SpeechIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Fei Huang

126

17 Feb 2022

Joint Speech Recognition and Audio CaptioningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

145

03 Feb 2022

Neural-FST Class Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Ozlem Kalinli

242

28 Jan 2022

Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASRInterspeech (Interspeech), 2022

Rao Ma

155

26 Jan 2022

A Likelihood Ratio based Domain Adaptation Method for E2E ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

142

10 Jan 2022

Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model

14 Dec 2021

Context-Aware Transformer Transducer for Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2021

Feng-Ju Chang

Jing Liu

Martin H. Radfar

Athanasios Mouchtaris

M. Omologo

Ariya Rastrow

Siegfried Kunzmann

194

05 Nov 2021

Advances and Challenges in Deep Lip Reading

Mohammad Akbari

143

15 Oct 2021