Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2001.08290
Cited By
v1
v2 (latest)
Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
15 January 2020
Haoran Miao
Gaofeng Cheng
Changfeng Gao
Pengyuan Zhang
Yonghong Yan
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture"
31 / 31 papers shown
Title
Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
92
0
0
01 Oct 2025
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Junhyeok Kim
Min Soo Kim
Jiwan Chung
Jungbin Cho
Jisoo Kim
Sungwoong Kim
Gyeongbo Sim
Youngjae Yu
EgoV
145
3
0
17 Feb 2025
Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure
Weidong Ji
Shijie Zan
Guohui Zhou
Xu Wang
SyDa
174
1
0
14 Jun 2023
Streaming Speech-to-Confusion Network Speech Recognition
Interspeech (Interspeech), 2023
Denis Filimonov
Prabhat Pandey
Ariya Rastrow
Ankur Gandhe
A. Stolcke
HAI
178
0
0
02 Jun 2023
Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation
Interspeech (Interspeech), 2023
Hanbyul Kim
S. Seo
Lukas Lee
Seolki Baek
105
3
0
02 Jun 2023
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuguang Yang
Yu Pan
Jingjing Yin
Jiangyu Han
Lei Ma
Heng Lu
111
14
0
15 Mar 2023
UFO2: A unified pre-training framework for online and offline speech recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Li Fu
Siqi Li
Qingtao Li
L. Deng
Fangzhu Li
Lu Fan
Meng Chen
Xiaodong He
OffRL
273
8
0
26 Oct 2022
Attention Enhanced Citrinet for Speech Recognition
Interspeech (Interspeech), 2022
Xianchao Wu
182
1
0
01 Sep 2022
Deep Sparse Conformer for Speech Recognition
Interspeech (Interspeech), 2022
Xianchao Wu
110
2
0
01 Sep 2022
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Jicheng Zhang
Yizhou Peng
Haihua Xu
Yi He
Chng Eng Siong
Hao-Ming Huang
AuLLM
227
7
0
09 Jul 2022
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
Interspeech (Interspeech), 2022
Zehan Li
Haoran Miao
Keqi Deng
Gaofeng Cheng
Sanli Tian
Ta Li
Yonghong Yan
KELM
179
5
0
06 Jul 2022
Boosting Cross-Domain Speech Recognition with Self-Supervision
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Hanjing Zhu
Gaofeng Cheng
Yongfeng Zhang
Wenxin Hou
Pengyuan Zhang
Yonghong Yan
326
22
0
20 Jun 2022
CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR
Interspeech (Interspeech), 2022
Keyu An
Huahuan Zheng
Zhijian Ou
Hongyu Xiang
Ke Ding
Guanglu Wan
AI4TS
153
21
0
31 Mar 2022
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
Interspeech (Interspeech), 2022
Binbin Zhang
Di Wu
Zhendong Peng
Xingcheng Song
Zhuoyuan Yao
Hang Lv
Linfu Xie
Chao Yang
Fuping Pan
Jianwei Niu
VLM
256
127
0
29 Mar 2022
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
127
3
0
25 Jan 2022
Recent Advances in End-to-End Automatic Speech Recognition
APSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
403
425
0
02 Nov 2021
A Melody-Unsupervision Model for Singing Voice Synthesis
Soonbeom Choi
Juhan Nam
135
15
0
13 Oct 2021
Deformable TDNN with adaptive receptive fields for speech recognition
Interspeech (Interspeech), 2021
Keyu An
Yi Zhang
Zhijian Ou
82
5
0
30 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
142
18
0
08 Apr 2021
Mutually-Constrained Monotonic Multihead Attention for Online ASR
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jae-gyun Song
Hajin Shim
Eunho Yang
63
0
0
26 Mar 2021
Parallelizing Legendre Memory Unit Training
International Conference on Machine Learning (ICML), 2021
Narsimha Chilkuri
C. Eliasmith
195
44
0
22 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Intelligent Systems with Applications (ISA), 2021
Priyabrata Karmakar
S. Teng
Guojun Lu
126
35
0
14 Feb 2021
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit
Interspeech (Interspeech), 2021
Zhuoyuan Yao
Di Wu
Xiong Wang
Binbin Zhang
Fan Yu
Chao Yang
Zhendong Peng
Xiaoyu Chen
Lei Xie
X. Lei
343
307
0
02 Feb 2021
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
ETRI Journal (ETRI J.), 2021
Y. Oh
Kiyoung Park
Jeongue Park
OffRL
294
6
0
14 Jan 2021
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling
Jiahui Yu
Wei Han
Anmol Gulati
Chung-Cheng Chiu
Yue Liu
Tara N. Sainath
Yonghui Wu
Ruoming Pang
327
19
0
12 Oct 2020
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
329
41
0
07 Oct 2020
Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Interspeech (Interspeech), 2020
X. Jia
Jianzong Wang
Zhiyong Zhang
Ning Cheng
Jing Xiao
158
17
0
13 Aug 2020
Transformer with Bidirectional Decoder for Speech Recognition
Interspeech (Interspeech), 2020
Xi Chen
Songyang Zhang
Dandan Song
P. Ouyang
Shouyi Yin
131
15
0
11 Aug 2020
Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
Danni Liu
Gerasimos Spanakis
Jan Niehues
127
60
0
22 May 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Linhao Dong
Cheng Yi
Jianzong Wang
Shiyu Zhou
Shuang Xu
X. Jia
Bo Xu
130
18
0
20 May 2020
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei Jiang
Wubo Li
Ruixiong Zhang
Miao Cao
Ne Luo
Yang Han
Wei Zou
Xiangang Li
SSL
201
31
0
20 May 2020
1