ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.07903
  4. Cited By
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech
  Recognition

Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition

16 May 2020
Zhengkun Tian
Jiangyan Yi
Jianhua Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
ArXiv (abs)PDFHTML

Papers citing "Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition"

42 / 42 papers shown
H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems
H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems
Huangyu Dai
Lingtao Mao
Ben Chen
Zihan Wang
Zihan Liang
Ying Han
Chenyi Lei
Han Li
KELM
153
0
0
22 Aug 2025
Spiking Transformer with Spatial-Temporal Attention
Spiking Transformer with Spatial-Temporal AttentionComputer Vision and Pattern Recognition (CVPR), 2024
Donghyun Lee
Yuhang Li
Youngeun Kim
Shiting Xiao
Priyadarshini Panda
507
24
0
29 Sep 2024
Paraformer-v2: An improved non-autoregressive transformer for
  noise-robust speech recognition
Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition
Keyu An
Zerui Li
Zhifu Gao
Shiliang Zhang
300
6
0
26 Sep 2024
DANIEL: A fast Document Attention Network for Information Extraction and
  Labelling of handwritten documents
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents
Thomas Constum
Pierrick Tranouez
Thierry Paquet
305
11
0
12 Jul 2024
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Can Ma
DiffM
507
11
0
19 Dec 2023
U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword
  Bias
U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword BiasAutomatic Speech Recognition & Understanding (ASRU), 2023
Aoting Zhang
Pan Zhou
Kaixun Huang
Yong Zou
Ming Liu
Lei Xie
314
9
0
15 Dec 2023
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech
  Recognition
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023
Kaixun Huang
Aoting Zhang
Binbin Zhang
Tianyi Xu
Xingchen Song
Lei Xie
219
5
0
07 Oct 2023
Semi-Autoregressive Streaming ASR With Label Context
Semi-Autoregressive Streaming ASR With Label ContextIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Siddhant Arora
G. Saon
Shinji Watanabe
Brian Kingsbury
AI4TS
322
12
0
19 Sep 2023
TST: Time-Sparse Transducer for Automatic Speech Recognition
TST: Time-Sparse Transducer for Automatic Speech RecognitionCAAI International Conference on Artificial Intelligence (ICCAI), 2023
Xiaohui Zhang
Mangui Liang
Zhengkun Tian
Jiangyan Yi
Jianhua Tao
188
0
0
17 Jul 2023
Spike-driven Transformer
Spike-driven TransformerNeural Information Processing Systems (NeurIPS), 2023
Man Yao
Jiakui Hu
Zhaokun Zhou
Liuliang Yuan
Yonghong Tian
Boxing Xu
Guoqi Li
304
263
0
04 Jul 2023
A Lexical-aware Non-autoregressive Transformer-based ASR Model
A Lexical-aware Non-autoregressive Transformer-based ASR ModelInterspeech (Interspeech), 2023
Chong Lin
Kuan-Yu Chen
AI4TS
143
3
0
18 May 2023
A CTC Alignment-based Non-autoregressive Transformer for End-to-end
  Automatic Speech Recognition
A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Ruchao Fan
Wei Chu
Peng Chang
Abeer Alwan
257
20
0
15 Apr 2023
Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for
  Mandarin Speech Recognition
Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition
Kai Liu
Hailiang Xiong
Gangqiang Yang
Zhengfeng Du
Yewen Cao
D. Shah
295
0
0
23 Mar 2023
Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy
Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross EntropyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xulong Zhang
Haobin Tang
Jianzong Wang
Ning Cheng
Jian Luo
Jing Xiao
256
4
0
14 Mar 2023
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying
  Peak-First Regularization
Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First RegularizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Zhengkun Tian
Hongyu Xiang
Min Li
Fei Lin
Ke Ding
Guanglu Wan
200
7
0
07 Nov 2022
Linguistic-Enhanced Transformer with CTC Embedding for Speech
  Recognition
Linguistic-Enhanced Transformer with CTC Embedding for Speech RecognitionInternational Conference on Mobile Ad-hoc and Sensor Networks (MSN), 2022
Xulong Zhang
Jianzong Wang
Ning Cheng
Mengyuan Zhao
Zhiyong Zhang
Jing Xiao
171
1
0
25 Oct 2022
Revisiting Checkpoint Averaging for Neural Machine Translation
Revisiting Checkpoint Averaging for Neural Machine Translation
Yingbo Gao
Christian Herold
Zijian Yang
Hermann Ney
MoMe
391
13
0
21 Oct 2022
Acoustic-aware Non-autoregressive Spell Correction with Mask Sample
  Decoding
Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Ruchao Fan
Guoli Ye
Yashesh Gaur
Jinyu Li
275
4
0
16 Oct 2022
Paraformer: Fast and Accurate Parallel Transformer for
  Non-autoregressive End-to-End Speech Recognition
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech RecognitionInterspeech (Interspeech), 2022
Zhifu Gao
Shiliang Zhang
Ian Mcloughlin
Zhijie Yan
321
197
0
16 Jun 2022
InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASRInterspeech (Interspeech), 2022
Yumi Nakagome
Tatsuya Komatsu
Yusuke Fujita
Shuta Ichimura
Yusuke Kida
326
7
0
01 Apr 2022
Improving non-autoregressive end-to-end speech recognition with
  pre-trained acoustic and language models
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Zehui Yang
Shinji Watanabe
Yosuke Higuchi
Gaofeng Cheng
Pengyuan Zhang
250
31
0
25 Jan 2022
Recent Advances in End-to-End Automatic Speech Recognition
Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021
Jinyu Li
VLM
573
448
0
02 Nov 2021
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text
  Generation
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text GenerationAutomatic Speech Recognition & Understanding (ASRU), 2021
Yosuke Higuchi
Nanxin Chen
Yuya Fujita
Hirofumi Inaguma
Tatsuya Komatsu
Jaesong Lee
Jumon Nozaki
Tianzi Wang
Shinji Watanabe
199
51
0
11 Oct 2021
Non-autoregressive Transformer with Unified Bidirectional Decoder for
  Automatic Speech Recognition
Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition
Chuan-Fei Zhang
Wenshu Fan
Tianren Zhang
Songlu Chen
Feng Chen
Xu-Cheng Yin
199
11
0
14 Sep 2021
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text
  Recognition
PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text RecognitionACM Multimedia (ACM MM), 2021
Zhi Qiao
Can Ma
Jin Wei
Wei Wang
Yuanqing Zhang
Ning Jiang
Hongbin Wang
Weiping Wang
337
84
0
09 Sep 2021
Decoupling recognition and transcription in Mandarin ASR
Decoupling recognition and transcription in Mandarin ASR
Jiahong Yuan
Xingyu Cai
Dongji Gao
Renjie Zheng
Liang Huang
Kenneth Church
254
14
0
02 Aug 2021
Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
Streaming End-to-End ASR based on Blockwise Non-Autoregressive ModelsInterspeech (Interspeech), 2021
Tianzi Wang
Yuya Fujita
Xuankai Chang
Shinji Watanabe
281
18
0
20 Jul 2021
Conformer-based End-to-end Speech Recognition With Rotary Position
  Embedding
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
283
10
0
13 Jul 2021
An Improved Single Step Non-autoregressive Transformer for Automatic
  Speech Recognition
An Improved Single Step Non-autoregressive Transformer for Automatic Speech RecognitionInterspeech (Interspeech), 2021
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
Abeer Alwan
354
20
0
18 Jun 2021
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and
  Conditional Speaker Chain
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain
Pengcheng Guo
Xuankai Chang
Shinji Watanabe
Lei Xie
233
22
0
16 Jun 2021
Efficient conformer-based speech recognition with linear attention
Efficient conformer-based speech recognition with linear attentionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
248
28
0
14 Apr 2021
Boundary and Context Aware Training for CIF-based Non-Autoregressive
  End-to-end ASR
Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASRAutomatic Speech Recognition & Understanding (ASRU), 2021
Fan Yu
Haoneng Luo
Pengcheng Guo
Yuhao Liang
Zhuoyuan Yao
Lei Xie
Yingying Gao
Leijing Hou
Shilei Zhang
158
14
0
10 Apr 2021
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech
  Recognition
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021
Zhengkun Tian
Jiangyan Yi
Jianhua Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
Zhengqi Wen
151
24
0
04 Apr 2021
Transformer-based end-to-end speech recognition with residual Gaussian-based self-attentionInterspeech (Interspeech), 2021
Chen Liang
Menglong Xu
Xiao-Lei Zhang
265
9
0
29 Mar 2021
Intermediate Loss Regularization for CTC-based Speech Recognition
Intermediate Loss Regularization for CTC-based Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Jaesong Lee
Shinji Watanabe
319
161
0
05 Feb 2021
Joint Entity and Relation Extraction with Set Prediction Networks
Joint Entity and Relation Extraction with Set Prediction Networks
Dianbo Sui
Yubo Chen
Kang Liu
Jun Zhao
Xiangrong Zeng
Shengping Liu
302
196
0
03 Nov 2020
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder InputIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Xingcheng Song
Zhiyong Wu
Yiheng Huang
Chao Weng
Jane Polak Scowcroft
Helen Meng
285
42
0
28 Oct 2020
Bridging the Modality Gap for Speech-to-Text Translation
Bridging the Modality Gap for Speech-to-Text Translation
Yuchen Liu
Junnan Zhu
Jiajun Zhang
Chengqing Zong
261
76
0
28 Oct 2020
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer
  for Speech Recognition
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ruchao Fan
Wei Chu
Peng Chang
Jing Xiao
204
43
0
28 Oct 2020
Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Improved Mask-CTC for Non-Autoregressive End-to-End ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Yosuke Higuchi
Hirofumi Inaguma
Shinji Watanabe
Tetsuji Ogawa
Tetsunori Kobayashi
444
68
0
26 Oct 2020
Orthros: Non-autoregressive End-to-end Speech Translation with
  Dual-decoder
Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Hirofumi Inaguma
Yosuke Higuchi
Kevin Duh
Tatsuya Kawahara
Shinji Watanabe
423
26
0
25 Oct 2020
Transformer-based End-to-End Speech Recognition with Local Dense
  Synthesizer Attention
Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Menglong Xu
Shengqiang Li
Xiao-Lei Zhang
295
37
0
23 Oct 2020
1
Page 1 of 1