ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.13037
  4. Cited By
Self-Attention Transducers for End-to-End Speech Recognition

Self-Attention Transducers for End-to-End Speech Recognition

28 September 2019
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Zhengqi Wen
    AI4TS
ArXivPDFHTML

Papers citing "Self-Attention Transducers for End-to-End Speech Recognition"

49 / 49 papers shown
Title
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Adam Stooke
Rohit Prabhavalkar
K. Sim
P. M. Mengibar
39
0
0
06 Feb 2025
A Decade of Deep Learning: A Survey on The Magnificent Seven
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Zhilin Wang
...
Liang Li
Siwei Liu
Yu Zhong
Wei Liu
Shangsong Liang
OOD
AI4TS
MedIm
120
0
0
13 Dec 2024
XCB: an effective contextual biasing approach to bias cross-lingual
  phrases in speech recognition
XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition
Xucheng Wan
Naijun Zheng
Kai Liu
Huan Zhou
27
0
0
20 Aug 2024
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration
  Transducer
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer
Yu Xi
Hao Li
Baochen Yang
Haoyu Li
Hai-kun Xu
Kai Yu
35
1
0
20 Mar 2024
Improving End-to-End Speech Processing by Efficient Text Data
  Utilization with Latent Synthesis
Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis
Jianqiao Lu
Wenyong Huang
Nianzu Zheng
Xingshan Zeng
Y. Yeung
Xiao Chen
SyDa
24
1
0
09 Oct 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST
  Leveraging Textual Alignments
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments
Sara Papi
Peidong Wan
Junkun Chen
Jian Xue
Jinyu Li
Yashesh Gaur
26
8
0
07 Jul 2023
GNCformer Enhanced Self-attention for Automatic Speech Recognition
GNCformer Enhanced Self-attention for Automatic Speech Recognition
Jiashi Li
Z. Duan
S. Li
X. Yu
G. Yang
13
1
0
22 May 2023
A Lexical-aware Non-autoregressive Transformer-based ASR Model
A Lexical-aware Non-autoregressive Transformer-based ASR Model
Chong Lin
Kuan-Yu Chen
AI4TS
20
1
0
18 May 2023
Efficient Sequence Transduction by Jointly Predicting Tokens and
  Durations
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Hainan Xu
Fei Jia
Somshubra Majumdar
Hengguan Huang
Shinji Watanabe
Boris Ginsburg
27
17
0
13 Apr 2023
Sim-T: Simplify the Transformer Network by Multiplexing Technique for
  Speech Recognition
Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition
Guangyong Wei
Zhikui Duan
Shiren Li
Guangguang Yang
Xinmei Yu
Junhua Li
22
4
0
11 Apr 2023
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in
  Speech Recognition
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition
Saumya Yashmohini Sahai
Jing Liu
Thejaswi Muniyappa
Kanthashree Mysore Sathyendra
Anastasios Alexandridis
...
Ross McGowan
Ariya Rastrow
Feng-Ju Chang
Athanasios Mouchtaris
Siegfried Kunzmann
36
5
0
03 Apr 2023
SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic
  Speech Processing
SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech Processing
Weidong Chen
Xiaofen Xing
Xiangmin Xu
Jianxin Pang
Lan Du
30
38
0
27 Feb 2023
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic
  Speech Recognition
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition
Chendong Zhao
Jianzong Wang
Wentao Wei
Xiaoyang Qu
Haoqian Wang
Jing Xiao
36
2
0
30 Sep 2022
Transformer-based Streaming ASR with Cumulative Attention
Transformer-based Streaming ASR with Cumulative Attention
Mohan Li
Shucong Zhang
Catalin Zorila
R. Doddipatla
19
9
0
11 Mar 2022
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End
  Mandarin Chinese ASR
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR
Zhao Yang
Dianwen Ng
Xiao Fu
Liping Han
Wei Xi
Ruimeng Wang
Rui Jiang
Jizhong Zhao
40
2
0
26 Jan 2022
Run-and-back stitch search: novel block synchronous decoding for
  streaming encoder-decoder ASR
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR
E. Tsunoo
Chaitanya Narisetty
Michael Hentschel
Yosuke Kashiwagi
Shinji Watanabe
6
2
0
25 Jan 2022
A Study of Transducer based End-to-End ASR with ESPnet: Architecture,
  Auxiliary Loss and Decoding Strategies
A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding Strategies
Florian Boyer
Yusuke Shinohara
Takaaki Ishii
H. Inaguma
Shinji Watanabe
29
34
0
14 Jan 2022
Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural
  Language Question
Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Yuanfeng Song
Raymond Chi-Wing Wong
Xuefang Zhao
Di Jiang
31
13
0
04 Jan 2022
Context-Aware Transformer Transducer for Speech Recognition
Context-Aware Transformer Transducer for Speech Recognition
Feng-Ju Chang
Jing Liu
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
Ariya Rastrow
Siegfried Kunzmann
15
79
0
05 Nov 2021
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning
  for Low-Resource Speech Recognition
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition
Guolin Zheng
Yubei Xiao
Ke Gong
Pan Zhou
Xiaodan Liang
Liang Lin
24
26
0
19 Sep 2021
Multi-Channel Transformer Transducer for Speech Recognition
Multi-Channel Transformer Transducer for Speech Recognition
Feng-Ju Chang
Martin H. Radfar
Athanasios Mouchtaris
M. Omologo
18
19
0
30 Aug 2021
Amortized Neural Networks for Low-Latency Speech Recognition
Amortized Neural Networks for Low-Latency Speech Recognition
J. Macoskey
Grant P. Strimel
Jinru Su
Ariya Rastrow
9
18
0
03 Aug 2021
Conformer-based End-to-end Speech Recognition With Rotary Position
  Embedding
Conformer-based End-to-end Speech Recognition With Rotary Position Embedding
Shengqiang Li
Menglong Xu
Xiao-Lei Zhang
13
9
0
13 Jul 2021
Non-autoregressive Transformer-based End-to-end ASR using BERT
Non-autoregressive Transformer-based End-to-end ASR using BERT
Fu-Hao Yu
Kuan-Yu Chen
25
22
0
10 Apr 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by
  Applying Fast-Skip Regularization
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
Zhengkun Tian
Jiangyan Yi
Ye Bai
J. Tao
Shuai Zhang
Zhengqi Wen
23
16
0
07 Apr 2021
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech
  Recognition
TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
Xuefei Liu
9
19
0
04 Apr 2021
Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention
Chen Liang
Menglong Xu
Xiao-Lei Zhang
23
8
0
29 Mar 2021
Unidirectional Memory-Self-Attention Transducer for Online Speech
  Recognition
Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
Jian Luo
Jianzong Wang
Ning Cheng
Jing Xiao
RALM
10
6
0
23 Feb 2021
Fast End-to-End Speech Recognition via Non-Autoregressive Models and
  Cross-Modal Knowledge Transferring from BERT
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Ye Bai
Jiangyan Yi
J. Tao
Zhengkun Tian
Zhengqi Wen
Shuai Zhang
RALM
33
50
0
15 Feb 2021
Thank you for Attention: A survey on Attention-based Artificial Neural
  Networks for Automatic Speech Recognition
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition
Priyabrata Karmakar
S. Teng
Guojun Lu
19
25
0
14 Feb 2021
Gated Recurrent Fusion with Joint Training Framework for Robust
  End-to-End Speech Recognition
Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition
Cunhang Fan
Jiangyan Yi
J. Tao
Zhengkun Tian
Bin Liu
Zhengqi Wen
6
66
0
09 Nov 2020
Improving RNN transducer with normalized jointer network
Improving RNN transducer with normalized jointer network
Mingkun Huang
Jun Zhang
Meng Cai
Yang Zhang
Jiali Yao
Yongbin You
Yi He
Zejun Ma
9
7
0
03 Nov 2020
Transformer-based End-to-End Speech Recognition with Local Dense
  Synthesizer Attention
Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Menglong Xu
Shengqiang Li
Xiao-Lei Zhang
24
31
0
23 Oct 2020
Self-Attention Generative Adversarial Network for Speech Enhancement
Self-Attention Generative Adversarial Network for Speech Enhancement
Huy P Phan
Huy Le Nguyen
Oliver Y. Chén
P. Koch
Ngoc Q. K. Duong
Ian Mcloughlin
Alfred Mertins
GAN
38
26
0
18 Oct 2020
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable
  End-to-End Speech Recognition
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong Huang
Wenchao Hu
Y. Yeung
Xiao Chen
9
50
0
13 Aug 2020
Transformer with Bidirectional Decoder for Speech Recognition
Transformer with Bidirectional Decoder for Speech Recognition
Xi Chen
Songyang Zhang
Dandan Song
P. Ouyang
Shouyi Yin
18
13
0
11 Aug 2020
Streaming Transformer ASR with Blockwise Synchronous Beam Search
Streaming Transformer ASR with Blockwise Synchronous Beam Search
E. Tsunoo
Yosuke Kashiwagi
Shinji Watanabe
6
11
0
25 Jun 2020
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for
  Transformer-based LVCSR
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR
Xinyuan Zhou
Grandee Lee
Emre Yilmaz
Yanhua Long
Jiaen Liang
Haizhou Li
11
7
0
18 Jun 2020
Simplified Self-Attention for Transformer-based End-to-End Speech
  Recognition
Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
Haoneng Luo
Shiliang Zhang
Ming Lei
Lei Xie
27
33
0
21 May 2020
SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition
SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition
Zhifu Gao
Shiliang Zhang
Ming Lei
Ian Mcloughlin
19
35
0
21 May 2020
Exploring Transformers for Large-Scale Speech Recognition
Exploring Transformers for Large-Scale Speech Recognition
Liang Lu
Changliang Liu
Jinyu Li
Jiawei Liu
6
40
0
19 May 2020
Attention-based Transducer for Online Speech Recognition
Attention-based Transducer for Online Speech Recognition
Bin Wang
Yan Yin
Hui-Ching Lin
18
4
0
18 May 2020
Exploration of Audio Quality Assessment and Anomaly Localisation Using
  Attention Models
Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models
Qiang Huang
Thomas Hain
13
1
0
16 May 2020
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech
  Recognition
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
J. Tao
Ye Bai
Shuai Zhang
Zhengqi Wen
8
54
0
16 May 2020
Research on Modeling Units of Transformer Transducer for Mandarin Speech
  Recognition
Research on Modeling Units of Transformer Transducer for Mandarin Speech Recognition
Li Fu
Xiaoxiao Li
Libo Zi
6
5
0
26 Apr 2020
Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner
  Party Transcription
Towards a Competitive End-to-End Speech Recognition for CHiME-6 Dinner Party Transcription
A. Andrusenko
A. Laptev
Ivan Medennikov
17
16
0
22 Apr 2020
Rnn-transducer with language bias for end-to-end Mandarin-English
  code-switching speech recognition
Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition
Shuai Zhang
Jiangyan Yi
Zhengkun Tian
J. Tao
Ye Bai
9
25
0
19 Feb 2020
Synchronous Transformers for End-to-End Speech Recognition
Synchronous Transformers for End-to-End Speech Recognition
Zhengkun Tian
Jiangyan Yi
Ye Bai
J. Tao
Shuai Zhang
Zhengqi Wen
19
72
0
06 Dec 2019
A Transformer with Interleaved Self-attention and Convolution for Hybrid
  Acoustic Models
A Transformer with Interleaved Self-attention and Convolution for Hybrid Acoustic Models
Liang Lu
11
4
0
23 Oct 2019
1