ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.10113
  4. Cited By
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End
  Models for Speech Recognition
v1v2 (latest)

A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition

20 May 2020
Linhao Dong
Cheng Yi
Jianzong Wang
Shiyu Zhou
Shuang Xu
X. Jia
Bo Xu
ArXiv (abs)PDFHTML

Papers citing "A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition"

12 / 12 papers shown
Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization
Chunk Based Speech Pre-training with High Resolution Finite Scalar Quantization
Yun Tang
Cindy Tseng
146
0
0
19 Sep 2025
Transducer Consistency Regularization for Speech to Text Applications
Transducer Consistency Regularization for Speech to Text ApplicationsSpoken Language Technology Workshop (SLT), 2024
Cindy Tseng
Yun Tang
Vijendra Raj Apsingekar
344
0
0
09 Oct 2024
Lightweight Transducer Based on Frame-Level Criterion
Lightweight Transducer Based on Frame-Level CriterionInterspeech (Interspeech), 2024
Genshun Wan
Mengzhi Wang
Tingzhi Mao
Hang Chen
Z. Ye
351
1
0
05 Sep 2024
Incremental Blockwise Beam Search for Simultaneous Speech Translation
  with Controllable Quality-Latency Tradeoff
Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency TradeoffInterspeech (Interspeech), 2023
Peter Polák
Brian Yan
Shinji Watanabe
A. Waibel
Ondrej Bojar
229
10
0
20 Sep 2023
CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech
  Recognition
CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Tian-Hao Zhang
Dinghao Zhou
Guiping Zhong
Jiaming Zhou
Baoxiang Li
321
7
0
26 Jul 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming
  Encoder-decoder Speech Recognition
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech RecognitionInterspeech (Interspeech), 2023
E. Tsunoo
Hayato Futami
Yosuke Kashiwagi
Siddhant Arora
Shinji Watanabe
228
4
0
24 Jul 2023
Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech
  Translation
Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech TranslationInterspeech (Interspeech), 2022
Chih-Chiang Chang
Hung-yi Lee
288
14
0
22 Mar 2022
Improving non-autoregressive end-to-end speech recognition with
  pre-trained acoustic and language models
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language modelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Keqi Deng
Zehui Yang
Shinji Watanabe
Yosuke Higuchi
Gaofeng Cheng
Pengyuan Zhang
241
30
0
25 Jan 2022
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
CarneliNet: Neural Mixture Model for Automatic Speech Recognition
A. Kalinov
Somshubra Majumdar
Jagadeesh Balam
Boris Ginsburg
MoE
128
3
0
22 Jul 2021
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented RecordingInterspeech (Interspeech), 2021
Hirofumi Inaguma
Tatsuya Kawahara
271
2
0
15 Jul 2021
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for
  Low-resource Speech Recognition
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2021
Cheng Yi
Shiyu Zhou
Bo Xu
242
44
0
17 Jan 2021
AV Taris: Online Audio-Visual Speech Recognition
AV Taris: Online Audio-Visual Speech Recognition
George Sterpu
N. Harte
197
1
0
14 Dec 2020
1
Page 1 of 1