Attention based on-device streaming speech recognition with large speech corpus

Automatic Speech Recognition & Understanding (ASRU), 2019

2 January 2020

Kwangyoun Kim

Papers citing "Attention based on-device streaming speech recognition with large speech corpus"

31 / 31 papers shown

An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech RecognitionSpoken Language Technology Workshop (SLT), 2024

Hsin-Wei Wang

225

10 Sep 2024

DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition

Bi-Cheng Yan

252

26 Mar 2024

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural TransducerInterspeech (Interspeech), 2023

Yifan Yang

Xie Chen

271

14 Sep 2023

End-to-End Speech Recognition: A SurveyIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

353

268

03 Mar 2023

Streaming Parrotron for on-device speech-to-speech conversionInterspeech (Interspeech), 2022

353

25 Oct 2022

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech RecognitionInterspeech (Interspeech), 2022

238

01 Oct 2022

E-Branchformer: Branchformer with Enhanced merging for speech recognitionSpoken Language Technology Workshop (SLT), 2022

Kwangyoun Kim

461

163

30 Sep 2022

Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech RecognitionInternational Conference on Data Science and Advanced Analytics (DSAA), 2022

208

30 Sep 2022

Unified Modeling of Multi-Domain Multi-Device ASR SystemsInternational Conference on Text, Speech and Dialogue (TSD), 2022

229

13 May 2022

Neural-FST Class Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Ozlem Kalinli

293

28 Jan 2022

Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

188

25 Jan 2022

Two-Pass End-to-End ASR Model CompressionAutomatic Speech Recognition & Understanding (ASRU), 2021

131

08 Jan 2022

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

487

440

02 Nov 2021

Noisy Training Improves E2E ASR for the Edge

Ozlem Kalinli

264

09 Jul 2021

Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition

195

02 Jul 2021

Streaming end-to-end speech recognition with jointly trained neural feature enhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

214

04 May 2021

WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition

190

08 Apr 2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device ScenariosInterspeech (Interspeech), 2021

Ozlem Kalinli

170

06 Apr 2021

Alignment Knowledge Distillation for Online Streaming Attention-based Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Hirofumi Inaguma

Tatsuya Kawahara

398

28 Feb 2021

Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech RecognitionIntelligent Systems with Applications (ISA), 2021

Priyabrata Karmakar

S. Teng

Guojun Lu

164

14 Feb 2021

A review of on-device fully neural end-to-end automatic speech recognition algorithmsAsilomar Conference on Signals, Systems and Computers (Asilomar), 2020

267

14 Dec 2020

Alignment Restricted Streaming Recurrent Neural Network Transducer

253

05 Nov 2020

Iterative Compression of End-to-End ASR Model using AutoMLInterspeech (Interspeech), 2020

...

Alberto Gil C. P. Ramos

121

06 Aug 2020

Sequential Routing Framework: Fully Capsule Network-based Speech RecognitionComputer Speech and Language (CSL), 2020

Kwangyoun Kim

239

23 Jul 2020

Streaming Transformer ASR with Blockwise Synchronous Beam Search

E. Tsunoo

Yosuke Kashiwagi

Shinji Watanabe

387

25 Jun 2020

CTC-synchronous Training for Monotonic Attention Model

Hirofumi Inaguma

Masato Mimura

Tatsuya Kawahara

204

10 May 2020

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

370

10 Apr 2020

Small energy masking for improved neural network training for end-to-end speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Chanwoo Kim

Kwangyoun Kim

S. Indurthi

149

15 Feb 2020

power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognitionAutomatic Speech Recognition & Understanding (ASRU), 2019

Chanwoo Kim

Mehul Kumar

Kwangyoun Kim

Dhananjaya N. Gowda

177

22 Dec 2019

end-to-end training of a large vocabulary end-to-end speech recognition systemAutomatic Speech Recognition & Understanding (ASRU), 2019

Kwangyoun Kim

...

190

22 Dec 2019

ShrinkML: End-to-End ASR Model Compression Using Reinforcement LearningInterspeech (Interspeech), 2019

Łukasz Dudziak

Mohamed S. Abdelfattah

339

08 Jul 2019