Espresso: A Fast End-to-end Neural Speech Recognition Toolkit

18 September 2019

Hainan Xu

Lei Xie

Sanjeev Khudanpur

Papers citing "Espresso: A Fast End-to-end Neural Speech Recognition Toolkit"

21 / 21 papers shown

Title
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition Hainan Xu Zhehuai Chen Fei Jia Boris Ginsburg 41 0 0 04 Apr 2024
FAT-HuBERT: Front-end Adaptive Training of Hidden-unit BERT for Distortion-Invariant Robust Speech Recognition Dongning Yang Wei Wang Yanmin Qian 13 3 0 29 Nov 2023
Quran Recitation Recognition using End-to-End Deep Learning Ahmad Al Harere Khloud Al Jallad 38 6 0 10 May 2023
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations Hainan Xu Fei Jia Somshubra Majumdar Hengguan Huang Shinji Watanabe Boris Ginsburg 27 17 0 13 Apr 2023
Training Integer-Only Deep Recurrent Neural Networks V. Nia Eyyub Sari Vanessa Courville M. Asgharian MQ 53 2 0 22 Dec 2022
Probing Statistical Representations For End-To-End ASR A. Ollerenshaw Md. Asif Jalal Thomas Hain 27 2 0 03 Nov 2022
Relaxed Attention for Transformer Models Timo Lohrenz Björn Möller Zhengyang Li Tim Fingscheidt KELM 29 11 0 20 Sep 2022
Improving Low-Resource Speech Recognition with Pretrained Speech Models: Continued Pretraining vs. Semi-Supervised Training Mitchell DeHaven J. Billa VLM AI4TS 15 8 0 01 Jul 2022
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian P. Mihajlik A. Balog T. E. Gráczi A. Kohári Balázs Tarján K. Mády 25 8 0 01 Feb 2022
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction Heming Wang Yao Qian Xiaofei Wang Yiming Wang Chengyi Wang Shujie Liu Takuya Yoshioka Jinyu Li DeLiang Wang 21 29 0 28 Oct 2021
TorchAudio: Building Blocks for Audio and Speech Processing Yao-Yuan Yang Moto Hira Zhaoheng Ni Anjali Chourdia Artyom Astafurov ... Sean Narenthiran Shinji Watanabe Soumith Chintala Vincent Quenneville-Bélair Yangyang Shi 31 165 0 28 Oct 2021
Lhotse: a speech data representation library for the modern deep learning ecosystem Willem Hagemann Daniel Povey Jan "Yenda" Trmal Sanjeev Khudanpur AuLLM AI4TS 33 32 0 25 Oct 2021
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition Yiming Wang Jinyu Li Heming Wang Yao Qian Chengyi Wang Yu Wu 38 48 0 11 Oct 2021
iRNN: Integer-only Recurrent Neural Network Eyyub Sari Vanessa Courville V. Nia MQ 56 4 0 20 Sep 2021
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model Apoorv Vyas S. Madikeri H. Bourlard 19 15 0 06 Apr 2021
End-to-End Speech Recognition and Disfluency Removal Paria Jamshid Lou Mark Johnson 19 32 0 22 Sep 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Ashish Arora Desh Raj Aswin Shanmugam Subramanian Ke Li Bar Ben Yair Matthew Maciejewski Piotr Żelasko Leibny Paola García-Perera Shinji Watanabe Sanjeev Khudanpur 39 9 0 14 Jun 2020
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR Yiwen Shao Yiming Wang Daniel Povey Sanjeev Khudanpur AI4TS 16 37 0 20 May 2020
OpenNMT: Open-Source Toolkit for Neural Machine Translation Guillaume Klein Yoon Kim Yuntian Deng Jean Senellart Alexander M. Rush 273 1,896 0 10 Jan 2017
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Z. Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 716 6,746 0 26 Sep 2016
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 218 7,926 0 17 Aug 2015