v1v2 (latest)

Librispeech Transducer Model with Internal Language Model Prior Correction

Interspeech (Interspeech), 2021

7 April 2021

Papers citing "Librispeech Transducer Model with Internal Language Model Prior Correction"

26 / 26 papers shown

OrdMoE: Preference Alignment via Hierarchical Expert Group Ranking in Multimodal Mixture-of-Experts LLMs

120

24 Nov 2025

Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation

...

349

28 Oct 2025

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

...

393

26 Oct 2025

M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance

...

578

26 Feb 2025

Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Chien-Chun Wang

Li-Wei Chen

Cheng-Kang Chou

Hung-Shin Lee

Berlin Chen

Hsin-Min Wang

242

19 Sep 2024

Effective internal language model training and fusion for factorized transducer modelIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Ozlem Kalinli

189

02 Apr 2024

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation

229

16 Sep 2023

Chunked Attention-based Encoder-Decoder Model for Streaming Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

306

15 Sep 2023

Hybrid Attention-based Encoder-decoder Model for Efficient Language Model AdaptationSpoken Language Technology Workshop (SLT), 2023

206

14 Sep 2023

Improving Language Model Integration for Neural Machine TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

189

08 Jun 2023

End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

285

243

03 Mar 2023

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Andrew Rosenberg

Bhuvana Ramabhadran

AuLLM VLM

207

16 Feb 2023

Internal Language Model Estimation based Adaptive Language Model Fusion for Domain AdaptationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Rao Ma

170

02 Nov 2022

Modular Hybrid Autoregressive TransducerSpoken Language Technology Workshop (SLT), 2022

...

Bhuvana Ramabhadran

181

31 Oct 2022

AutoLV: Automatic Lecture Video GeneratorInternational Conference on Information Photonics (ICIP), 2022

216

19 Sep 2022

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

134

09 Jul 2022

Residual Language Model for End-to-end Speech RecognitionInterspeech (Interspeech), 2022

143

15 Jun 2022

Domain Adaptation of low-resource Target-Domain models using well-trained ASR Conformer ModelsSpoken Language Technology Workshop (SLT), 2022

Vrunda N. Sukhadia

S. Umesh

286

18 Feb 2022

Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASRInterspeech (Interspeech), 2022

Rao Ma

140

26 Jan 2022

A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding StrategiesAutomatic Speech Recognition & Understanding (ASRU), 2021

260

14 Jan 2022

PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition

273

13 Dec 2021

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

424

425

02 Nov 2021

On Language Model Integration for RNN Transducer based Speech Recognition

253

13 Oct 2021

Back from the future: bidirectional CTC decoding using future information in speech recognition

Namkyu Jung

Geon-min Kim

Han-Gyu Kim

227

07 Oct 2021

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Xie Chen

213

06 Oct 2021

Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition ArchitecturesAutomatic Speech Recognition & Understanding (ASRU), 2021

170

12 Apr 2021