v1v2 (latest)

TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

1 November 2022

Xingcheng Song

Di Wu

Zhiyong Wu

Binbin Zhang

ArXiv (abs)PDF HTML Github (5086★)

Papers citing "TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty"

6 / 6 papers shown

Spiralformer: Low Latency Encoder for Streaming Speech Recognition with Circular Layer Skipping and Early Exiting

134

01 Oct 2025

SegAug: CTC-Aligned Segmented Augmentation For Robust RNN-Transducer Based Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

275

20 Feb 2025

Mamba for Streaming ASR Combined with Unimodal AggregationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Ying Fang

Xiaofei Li

Mamba

279

30 Sep 2024

Towards Automatic Data Augmentation for Disordered Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Zengrui Jin

Tianzi Wang

Jiajun Deng

Shujie Hu

242

14 Dec 2023

Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play FrameworkInternational Conference on Learning Representations (ICLR), 2023

...

367

04 Jul 2023

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMsInterspeech (Interspeech), 2023