ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.08290
  4. Cited By
Transformer-based Online CTC/attention End-to-End Speech Recognition
  Architecture
v1v2 (latest)

Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
15 January 2020
Haoran Miao
Gaofeng Cheng
Changfeng Gao
Pengyuan Zhang
Yonghong Yan
ArXiv (abs)PDFHTML

Papers citing "Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture"

31 / 31 papers shown
Title
Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting
Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
92
0
0
01 Oct 2025
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the WildNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Junhyeok Kim
Min Soo Kim
Jiwan Chung
Jungbin Cho
Jisoo Kim
Sungwoong Kim
Gyeongbo Sim
Youngjae Yu
EgoV
145
3
0
17 Feb 2025
Research on an improved Conformer end-to-end Speech Recognition Model
  with R-Drop Structure
Research on an improved Conformer end-to-end Speech Recognition Model with R-Drop Structure
Weidong Ji
Shijie Zan
Guohui Zhou
Xu Wang
SyDa
174
1
0
14 Jun 2023
Streaming Speech-to-Confusion Network Speech Recognition
Streaming Speech-to-Confusion Network Speech RecognitionInterspeech (Interspeech), 2023
Denis Filimonov
Prabhat Pandey
Ariya Rastrow
Ankur Gandhe
A. Stolcke
HAI
178
0
0
02 Jun 2023
Improved Training for End-to-End Streaming Automatic Speech Recognition
  Model with Punctuation
Improved Training for End-to-End Streaming Automatic Speech Recognition Model with PunctuationInterspeech (Interspeech), 2023
Hanbyul Kim
S. Seo
Lukas Lee
Seolki Baek
105
3
0
02 Jun 2023
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR
  mechanism
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanismIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yuguang Yang
Yu Pan
Jingjing Yin
Jiangyu Han
Lei Ma
Heng Lu
111
14
0
15 Mar 2023
UFO2: A unified pre-training framework for online and offline speech
  recognition
UFO2: A unified pre-training framework for online and offline speech recognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Li Fu
Siqi Li
Qingtao Li
L. Deng
Fangzhu Li
Lu Fan
Meng Chen
Xiaodong He
OffRL
273
8
0
26 Oct 2022
Attention Enhanced Citrinet for Speech Recognition
Attention Enhanced Citrinet for Speech RecognitionInterspeech (Interspeech), 2022
Xianchao Wu
182
1
0
01 Sep 2022
Deep Sparse Conformer for Speech Recognition
Deep Sparse Conformer for Speech RecognitionInterspeech (Interspeech), 2022
Xianchao Wu
110
2
0
01 Sep 2022
Intermediate-layer output Regularization for Attention-based Speech
  Recognition with Shared Decoder
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Jicheng Zhang
Yizhou Peng
Haihua Xu
Yi He
Chng Eng Siong
Hao-Ming Huang
AuLLM
227
7
0
09 Jul 2022
Improving Streaming End-to-End ASR on Transformer-based Causal Models
  with Encoder States Revision Strategies
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision StrategiesInterspeech (Interspeech), 2022
Zehan Li
Haoran Miao
Keqi Deng
Gaofeng Cheng
Sanli Tian
Ta Li
Yonghong Yan
KELM
179
5
0
06 Jul 2022
Boosting Cross-Domain Speech Recognition with Self-Supervision
Boosting Cross-Domain Speech Recognition with Self-SupervisionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Hanjing Zhu
Gaofeng Cheng
Yongfeng Zhang
Wenxin Hou
Pengyuan Zhang
Yonghong Yan
326
22
0
20 Jun 2022
CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming
  ASR
CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASRInterspeech (Interspeech), 2022
Keyu An
Huahuan Zheng
Zhijian Ou
Hongyu Xiang
Ke Ding
Guanglu Wan
AI4TS
153
21
0
31 Mar 2022
WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit
WeNet 2.0: More Productive End-to-End Speech Recognition ToolkitInterspeech (Interspeech), 2022
Binbin Zhang
Di Wu
Zhendong Peng
Xingcheng Song
Zhuoyuan Yao
Hang Lv
Linfu Xie
Chao Yang
Fuping Pan
Jianwei Niu
VLM
256
127
0
29 Mar 2022
Run-and-back stitch search: novel block synchronous decoding for
  streaming encoder-decoder ASR
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
127
3
0
25 Jan 2022
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
403
425
0
02 Nov 2021
A Melody-Unsupervision Model for Singing Voice Synthesis
A Melody-Unsupervision Model for Singing Voice Synthesis
Soonbeom Choi
Juhan Nam
135
15
0
13 Oct 2021
Deformable TDNN with adaptive receptive fields for speech recognition
Deformable TDNN with adaptive receptive fields for speech recognitionInterspeech (Interspeech), 2021
Keyu An
Yi Zhang
Zhijian Ou
82
5
0
30 Apr 2021
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech
  Recognition
WNARS: WFST based Non-autoregressive Streaming End-to-End Speech Recognition
Zhichao Wang
Wenwen Yang
Pan Zhou
Wei Chen
RALM
142
18
0
08 Apr 2021
Mutually-Constrained Monotonic Multihead Attention for Online ASR
Mutually-Constrained Monotonic Multihead Attention for Online ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jae-gyun Song
Hajin Shim
Eunho Yang
63
0
0
26 Mar 2021
Parallelizing Legendre Memory Unit Training
Parallelizing Legendre Memory Unit TrainingInternational Conference on Machine Learning (ICML), 2021
Narsimha Chilkuri
C. Eliasmith
195
44
0
22 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural
  Networks for Automatic Speech Recognition
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech RecognitionIntelligent Systems with Applications (ISA), 2021
Priyabrata Karmakar
S. Teng
Guojun Lu
126
35
0
14 Feb 2021
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech
  Recognition Toolkit
WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition ToolkitInterspeech (Interspeech), 2021
Zhuoyuan Yao
Di Wu
Xiong Wang
Binbin Zhang
Fan Yu
Chao Yang
Zhendong Peng
Xiaoyu Chen
Lei Xie
X. Lei
343
307
0
02 Feb 2021
Fast offline Transformer-based end-to-end automatic speech recognition
  for real-world applications
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applicationsETRI Journal (ETRI J.), 2021
Y. Oh
Kiyoung Park
Jeongue Park
OffRL
294
6
0
14 Jan 2021
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context
  Modeling
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling
Jiahui Yu
Wei Han
Anmol Gulati
Chung-Cheng Chiu
Yue Liu
Tara N. Sainath
Yonghui Wu
Ruoming Pang
327
19
0
12 Oct 2020
Super-Human Performance in Online Low-latency Recognition of
  Conversational Speech
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
329
41
0
07 Oct 2020
Large-scale Transfer Learning for Low-resource Spoken Language
  Understanding
Large-scale Transfer Learning for Low-resource Spoken Language UnderstandingInterspeech (Interspeech), 2020
X. Jia
Jianzong Wang
Zhiyong Zhang
Ning Cheng
Jing Xiao
158
17
0
13 Aug 2020
Transformer with Bidirectional Decoder for Speech Recognition
Transformer with Bidirectional Decoder for Speech RecognitionInterspeech (Interspeech), 2020
Xi Chen
Songyang Zhang
Dandan Song
P. Ouyang
Shouyi Yin
131
15
0
11 Aug 2020
Low-Latency Sequence-to-Sequence Speech Recognition and Translation by
  Partial Hypothesis Selection
Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
Danni Liu
Gerasimos Spanakis
Jan Niehues
127
60
0
22 May 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End
  Models for Speech Recognition
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
Linhao Dong
Cheng Yi
Jianzong Wang
Shiyu Zhou
Shuang Xu
X. Jia
Bo Xu
130
18
0
20 May 2020
A Further Study of Unsupervised Pre-training for Transformer Based
  Speech Recognition
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei Jiang
Wubo Li
Ruixiong Zhang
Miao Cao
Ne Luo
Yang Han
Wei Zou
Xiangang Li
SSL
201
31
0
20 May 2020
1