v1v2 (latest)

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

10 April 2020

Papers citing "Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR"

33 / 33 papers shown

Streaming Sequence Transduction through Dynamic Compression

620

02 Feb 2024

Unified Segment-to-Segment Framework for Simultaneous Sequence GenerationNeural Information Processing Systems (NeurIPS), 2023

Shaolei Zhang

Yang Feng

344

27 Oct 2023

CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Jiaming Zhou

324

26 Jul 2023

Globally Normalising the Transducer for Streaming Speech Recognition

Rogier van Dalen

203

20 Jul 2023

Self-regularised Minimum Latency Training for Streaming Transformer-based Speech RecognitionInterspeech (Interspeech), 2022

Mohan Li

R. Doddipatla

Catalin Zorila

342

24 Apr 2023

Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech RecognitionInterspeech (Interspeech), 2022

Yusuke Shinohara

Shinji Watanabe

AI4TS

275

04 Nov 2022

Delay-penalized transducer for low-latency streaming ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Wei Kang

Zengwei Yao

Fangjun Kuang

Liyong Guo

Xiaoyu Yang

Long lin

Piotr Żelasko

Daniel Povey

321

31 Oct 2022

Large-Scale Streaming End-to-End Speech Translation with Neural TransducersInterspeech (Interspeech), 2022

337

11 Apr 2022

Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer

234

29 Mar 2022

Transformer-based Streaming ASR with Cumulative AttentionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Mohan Li

Shucong Zhang

Catalin Zorila

R. Doddipatla

274

11 Mar 2022

Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

209

25 Jan 2022

Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

K. Kumatani

R. Gmyr

Andres Felipe Cruz Salinas

369

10 Dec 2021

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

548

444

02 Nov 2021

An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASRAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021

162

20 Oct 2021

VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented RecordingInterspeech (Interspeech), 2021

Hirofumi Inaguma

Tatsuya Kawahara

311

15 Jul 2021

StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR

Hirofumi Inaguma

Tatsuya Kawahara

218

01 Jul 2021

Reducing Streaming ASR Model Delay with Self AlignmentInterspeech (Interspeech), 2021

166

06 May 2021

Dissecting User-Perceived Latency of On-Device E2E Speech RecognitionInterspeech (Interspeech), 2021

...

Ozlem Kalinli

298

06 Apr 2021

Mutually-Constrained Monotonic Multihead Attention for Online ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Jae-gyun Song

Hajin Shim

Eunho Yang

136

26 Mar 2021

Alignment Knowledge Distillation for Online Streaming Attention-based Speech RecognitionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Hirofumi Inaguma

Tatsuya Kawahara

414

28 Feb 2021

Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech RecognitionIntelligent Systems with Applications (ISA), 2021

Priyabrata Karmakar

S. Teng

Guojun Lu

184

14 Feb 2021

A Better and Faster End-to-End Model for Streaming ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

...

444

133

21 Nov 2020

Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR

...

Jun Liu

187

09 Nov 2020

Improving RNN Transducer Based ASR with Auxiliary Tasks

364

05 Nov 2020

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive TrainingFindings (Findings), 2020

251

20 Oct 2020

Parallel Rescoring with Transformer for Streaming On-Device Speech RecognitionInterspeech (Interspeech), 2020

233

30 Aug 2020

Large-scale Transfer Learning for Low-resource Spoken Language UnderstandingInterspeech (Interspeech), 2020

216

13 Aug 2020

Online Automatic Speech Recognition with Listen, Attend and Spell ModelIEEE Signal Processing Letters (IEEE SPL), 2020

168

12 Aug 2020

Streaming Transformer ASR with Blockwise Synchronous Beam Search

E. Tsunoo

Yosuke Kashiwagi

Shinji Watanabe

401

25 Jun 2020

On the Comparison of Popular End-to-End Models for Large Scale Speech RecognitionInterspeech (Interspeech), 2020

375

142

28 May 2020

Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection

Danni Liu

Gerasimos Spanakis

Jan Niehues

201

22 May 2020

Enhancing Monotonic Multihead Attention for Streaming ASR

Hirofumi Inaguma

Masato Mimura

Tatsuya Kawahara

430

19 May 2020

CTC-synchronous Training for Monotonic Attention Model

Hirofumi Inaguma

Masato Mimura

Tatsuya Kawahara

229

10 May 2020