Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2001.00577
Cited By
Attention based on-device streaming speech recognition with large speech corpus
Automatic Speech Recognition & Understanding (ASRU), 2019
2 January 2020
Kwangyoun Kim
Kyungmin Lee
Dhananjaya N. Gowda
Junmo Park
Sungsoo Kim
Sichen Jin
Young-Yoon Lee
Jinsu Yeo
Daehyun Kim
Seokyeong Jung
Jungin Lee
Myoungji Han
Chanwoo Kim
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention based on-device streaming speech recognition with large speech corpus"
31 / 31 papers shown
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Spoken Language Technology Workshop (SLT), 2024
Yi-Cheng Wang
Li-Ting Pai
Bi-Cheng Yan
Hsin-Wei Wang
Chi-Han Lin
Berlin Chen
225
2
0
10 Sep 2024
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Yi-Cheng Wang
Hsin-Wei Wang
Bi-Cheng Yan
Chi-Han Lin
Berlin Chen
252
3
0
26 Mar 2024
Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer
Interspeech (Interspeech), 2023
Peng Wang
Yifan Yang
Zheng Liang
Tian Tan
Shiliang Zhang
Xie Chen
271
1
0
14 Sep 2023
End-to-End Speech Recognition: A Survey
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Rohit Prabhavalkar
Takaaki Hori
Tara N. Sainath
Ralf Schluter
Shinji Watanabe
VLM
353
268
0
03 Mar 2023
Streaming Parrotron for on-device speech-to-speech conversion
Interspeech (Interspeech), 2022
Oleg Rybakov
Fadi Biadsy
Xia Zhang
Liyang Jiang
Phoenix Meadowlark
Shivani Agrawal
353
4
0
25 Oct 2022
Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Interspeech (Interspeech), 2022
Jash Rathod
Nauman Dawalatabad
Shatrughan Singh
Dhananjaya N. Gowda
238
14
0
01 Oct 2022
E-Branchformer: Branchformer with Enhanced merging for speech recognition
Spoken Language Technology Workshop (SLT), 2022
Kwangyoun Kim
Felix Wu
Yifan Peng
Jing Pan
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
461
163
0
30 Sep 2022
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition
International Conference on Data Science and Advanced Analytics (DSAA), 2022
Chendong Zhao
Jianzong Wang
Wentao Wei
Xiaoyang Qu
Haoqian Wang
Jing Xiao
208
2
0
30 Sep 2022
Unified Modeling of Multi-Domain Multi-Device ASR Systems
International Conference on Text, Speech and Dialogue (TSD), 2022
Soumyajit Mitra
Swayambhu Nath Ray
Bharat Padi
Arunasish Sen
Raghavendra Bilgi
Harish Arsikere
Shalini Ghosh
A. Srinivasamurthy
Sri Garimella
229
4
0
13 May 2022
Neural-FST Class Language Model for End-to-End Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
A. Bruguier
Duc Le
Rohit Prabhavalkar
Dangna Li
Zhe Liu
Bo Wang
Eun Chang
Fuchun Peng
Ozlem Kalinli
M. Seltzer
293
6
0
28 Jan 2022
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
188
3
0
25 Jan 2022
Two-Pass End-to-End ASR Model Compression
Automatic Speech Recognition & Understanding (ASRU), 2021
Nauman Dawalatabad
Tushar Vatsal
Ashutosh Gupta
Sungsoo Kim
Shatrughan Singh
Dhananjaya N. Gowda
Chanwoo Kim
131
6
0
08 Jan 2022
Recent Advances in End-to-End Automatic Speech Recognition
APSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
487
440
0
02 Nov 2021
Noisy Training Improves E2E ASR for the Edge
Dilin Wang
Yuan Shangguan
Haichuan Yang
P. Chuang
Jiatong Zhou
Meng Li
Ganesh Venkatesh
Ozlem Kalinli
Vikas Chandra
264
4
0
09 Jul 2021
Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Timo Lohrenz
P. Schwarz
Zhengyang Li
Tim Fingscheidt
195
11
0
02 Jul 2021
Streaming end-to-end speech recognition with jointly trained neural feature enhancement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Chanwoo Kim
Abhinav Garg
Dhananjaya N. Gowda
Seongkyu Mun
C. Han
AuLLM
214
6
0
04 May 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
190
18
0
08 Apr 2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios
Interspeech (Interspeech), 2021
Jay Mahadeokar
Yangyang Shi
Yuan Shangguan
Chunyang Wu
Alex Xiao
Hang Su
Duc Le
Ozlem Kalinli
Christian Fuegen
M. Seltzer
170
3
0
06 Apr 2021
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Hirofumi Inaguma
Tatsuya Kawahara
398
19
0
28 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Intelligent Systems with Applications (ISA), 2021
Priyabrata Karmakar
S. Teng
Guojun Lu
164
37
0
14 Feb 2021
A review of on-device fully neural end-to-end automatic speech recognition algorithms
Asilomar Conference on Signals, Systems and Computers (Asilomar), 2020
Chanwoo Kim
Dhananjaya N. Gowda
Dongsoo Lee
Jiyeon Kim
Ankur Kumar
Sungsoo Kim
Abhinav Garg
C. Han
267
31
0
14 Dec 2020
Alignment Restricted Streaming Recurrent Neural Network Transducer
Jay Mahadeokar
Yuan Shangguan
Duc Le
Gil Keren
Hang Su
Thong Le
Ching-Feng Yeh
Christian Fuegen
M. Seltzer
AI4TS
253
69
0
05 Nov 2020
Iterative Compression of End-to-End ASR Model using AutoML
Interspeech (Interspeech), 2020
Abhinav Mehrotra
Łukasz Dudziak
Jinsu Yeo
Young-Yoon Lee
Ravichander Vipperla
...
Samin S. Ishtiaq
Alberto Gil C. P. Ramos
Sangjeong Lee
Daehyun Kim
Nicholas D. Lane
OffRL
121
9
0
06 Aug 2020
Sequential Routing Framework: Fully Capsule Network-based Speech Recognition
Computer Speech and Language (CSL), 2020
Kyungmin Lee
Hyunwhan Joe
Hyeon-Seon Lim
Kwangyoun Kim
Sungsoo Kim
C. Han
H. Kim
239
5
0
23 Jul 2020
Streaming Transformer ASR with Blockwise Synchronous Beam Search
E. Tsunoo
Yosuke Kashiwagi
Shinji Watanabe
387
11
0
25 Jun 2020
CTC-synchronous Training for Monotonic Attention Model
Hirofumi Inaguma
Masato Mimura
Tatsuya Kawahara
204
7
0
10 May 2020
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Hirofumi Inaguma
Yashesh Gaur
Liang Lu
Jinyu Li
Jiawei Liu
AI4TS
370
49
0
10 Apr 2020
Small energy masking for improved neural network training for end-to-end speech recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Chanwoo Kim
Kwangyoun Kim
S. Indurthi
149
9
0
15 Feb 2020
power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
Automatic Speech Recognition & Understanding (ASRU), 2019
Chanwoo Kim
Mehul Kumar
Kwangyoun Kim
Dhananjaya N. Gowda
177
9
0
22 Dec 2019
end-to-end training of a large vocabulary end-to-end speech recognition system
Automatic Speech Recognition & Understanding (ASRU), 2019
Chanwoo Kim
Sungsoo Kim
Kwangyoun Kim
Mehul Kumar
Jiyeon Kim
...
Eunhyang Kim
Minkyoo Shin
Shatrughan Singh
Larry Heck
Dhananjaya N. Gowda
190
27
0
22 Dec 2019
ShrinkML: End-to-End ASR Model Compression Using Reinforcement Learning
Interspeech (Interspeech), 2019
Łukasz Dudziak
Mohamed S. Abdelfattah
Ravichander Vipperla
Stefanos Laskaridis
Nicholas D. Lane
OffRL
339
20
0
08 Jul 2019
1
Page 1 of 1