Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition

16 May 2020

Jiangyan Yi

Papers citing "Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition"

42 / 42 papers shown

H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems

153

22 Aug 2025

Spiking Transformer with Spatial-Temporal AttentionComputer Vision and Pattern Recognition (CVPR), 2024

507

29 Sep 2024

Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition

Keyu An

Zerui Li

Zhifu Gao

Shiliang Zhang

300

26 Sep 2024

DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documents

Thomas Constum

Pierrick Tranouez

Thierry Paquet

305

12 Jul 2024

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

507

19 Dec 2023

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword BiasAutomatic Speech Recognition & Understanding (ASRU), 2023

Lei Xie

314

15 Dec 2023

Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2023

Binbin Zhang

Lei Xie

219

07 Oct 2023

Semi-Autoregressive Streaming ASR With Label ContextIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Shinji Watanabe

322

19 Sep 2023

TST: Time-Sparse Transducer for Automatic Speech RecognitionCAAI International Conference on Artificial Intelligence (ICCAI), 2023

Jiangyan Yi

188

17 Jul 2023

Spike-driven TransformerNeural Information Processing Systems (NeurIPS), 2023

Yonghong Tian

304

263

04 Jul 2023

A Lexical-aware Non-autoregressive Transformer-based ASR ModelInterspeech (Interspeech), 2023

Chong Lin

Kuan-Yu Chen

AI4TS

143

18 May 2023

A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

257

15 Apr 2023

Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition

295

23 Mar 2023

Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross EntropyIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

256

14 Mar 2023

Peak-First CTC: Reducing the Peak Latency of CTC Models by Applying Peak-First RegularizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

200

07 Nov 2022

Linguistic-Enhanced Transformer with CTC Embedding for Speech RecognitionInternational Conference on Mobile Ad-hoc and Sensor Networks (MSN), 2022

171

25 Oct 2022

Revisiting Checkpoint Averaging for Neural Machine Translation

391

21 Oct 2022

Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding

275

16 Oct 2022

Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech RecognitionInterspeech (Interspeech), 2022

321

197

16 Jun 2022

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASRInterspeech (Interspeech), 2022

326

01 Apr 2022

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Pengyuan Zhang

250

25 Jan 2022

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

573

448

02 Nov 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text GenerationAutomatic Speech Recognition & Understanding (ASRU), 2021

Tianzi Wang

199

11 Oct 2021

Non-autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition

199

14 Sep 2021

PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text RecognitionACM Multimedia (ACM MM), 2021

337

09 Sep 2021

Decoupling recognition and transcription in Mandarin ASR

254

02 Aug 2021

Streaming End-to-End ASR based on Blockwise Non-Autoregressive ModelsInterspeech (Interspeech), 2021

Tianzi Wang

Yuya Fujita

Xuankai Chang

Shinji Watanabe

281

20 Jul 2021

Conformer-based End-to-end Speech Recognition With Rotary Position Embedding

Shengqiang Li

Menglong Xu

Xiao-Lei Zhang

283

13 Jul 2021

An Improved Single Step Non-autoregressive Transformer for Automatic Speech RecognitionInterspeech (Interspeech), 2021

354

18 Jun 2021

Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain

Pengcheng Guo

Xuankai Chang

Shinji Watanabe

Lei Xie

233

16 Jun 2021

Efficient conformer-based speech recognition with linear attentionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021

Shengqiang Li

Menglong Xu

Xiao-Lei Zhang

248

14 Apr 2021

Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASRAutomatic Speech Recognition & Understanding (ASRU), 2021

Lei Xie

158

10 Apr 2021

TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021

Jiangyan Yi

151

04 Apr 2021

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attentionInterspeech (Interspeech), 2021

Chen Liang

Menglong Xu

Xiao-Lei Zhang

265

29 Mar 2021

Intermediate Loss Regularization for CTC-based Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Jaesong Lee

Shinji Watanabe

319

161

05 Feb 2021

Joint Entity and Relation Extraction with Set Prediction Networks

Dianbo Sui

Yubo Chen

Kang Liu

Jun Zhao

Xiangrong Zeng

Shengping Liu

302

196

03 Nov 2020

Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder InputIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

Zhiyong Wu

285

28 Oct 2020

Bridging the Modality Gap for Speech-to-Text Translation

261

28 Oct 2020

CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

204

28 Oct 2020

Improved Mask-CTC for Non-Autoregressive End-to-End ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

444

26 Oct 2020

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoderIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

423

25 Oct 2020

Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention

Menglong Xu

Shengqiang Li

Xiao-Lei Zhang

295

23 Oct 2020