Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015

Jingdong Chen

Linxi Fan

Sharan Narang

Yi Wang

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown

Impact of Artificial Intelligence on Businesses: from Research, Innovation, Market Deployment to Future Shifts in Business Models

03 May 2019

Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text

M. Baskar

Shinji Watanabe

Ramón Fernández Astudillo

Takaaki Hori

L. Burget

J. Černocký

183

30 Apr 2019

Forget the Learning Rate, Decay Loss

Jiakai Wei

101

27 Apr 2019

Realizing Petabyte Scale Acoustic Modeling

114

24 Apr 2019

MinCall - MinION end2end convolutional deep learning basecaller

N. Miculinic

Marko Ratkovic

M. Šikić

22 Apr 2019

An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions

Aswin Shanmugam Subramanian

166

19 Apr 2019

Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation

Gakuto Kurata

Kartik Audhkhasi

156

17 Apr 2019

A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition

131

17 Apr 2019

Hard Sample Mining for the Improved Retraining of Automatic Speech Recognition

114

17 Apr 2019

Predicting Time-to-Failure of Plasma Etching Equipment using Machine Learning

16 Apr 2019

SpeechYOLO: Detection and Localization of Speech Objects

135

14 Apr 2019

wav2vec: Unsupervised Pre-training for Speech Recognition

370

721

11 Apr 2019

From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings

124

10 Apr 2019

Distributed Deep Learning Strategies For Automatic Speech Recognition

Wei Zhang

134

10 Apr 2019

Who Needs Words? Lexicon-Free Speech Recognition

Tatiana Likhomanenko

Gabriel Synnaeve

R. Collobert

228

09 Apr 2019

Speech Model Pre-training for End-to-End Spoken Language Understanding

Mirco Ravanelli

196

376

07 Apr 2019

On The Power of Curriculum Learning in Training Deep Networks

Guy Hacohen

D. Weinshall

ODL

259

515

07 Apr 2019

Jasper: An End-to-End Convolutional Neural Acoustic Model

Boris Ginsburg

251

277

05 Apr 2019

Lessons from Building Acoustic Models with a Million Hours of Speech

S. Parthasarathi

N. Strom

178

02 Apr 2019

Learning Shared Encoding Representation for End-to-End Speech Recognition Models

T. Nguyen

Sebastian Stüker

A. Waibel

138

31 Mar 2019

On Arrhythmia Detection by Deep Learning and Multidimensional Representation

30 Mar 2019

Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech

Zixing Zhang

Bingwen Wu

Bjoern Schuller

141

29 Mar 2019

Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data

Bjoern Schuller

130

29 Mar 2019

k-Same-Siamese-GAN: k-Same Algorithm with Generative Adversarial Network for Facial Image De-identification with Hyperparameter Tuning and Mixed Precision Training

108

27 Mar 2019

Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition

Shiliang Zhang

Ming Lei

Zhijie Yan

113

27 Mar 2019

Practical Hidden Voice Attacks against Speech and Speaker Recognition SystemsNetwork and Distributed System Security Symposium (NDSS), 2019

152

177

18 Mar 2019

A Research Agenda: Dynamic Models to Defend Against Correlated Attacks

Ian Goodfellow

AAML OOD

161

14 Mar 2019

End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model

102

12 Mar 2019

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube VideosConference on Empirical Methods in Natural Language Processing (EMNLP), 2018

113

01 Mar 2019

Incorporating End-to-End Speech Recognition Models for Sentiment AnalysisIEEE International Conference on Robotics and Automation (ICRA), 2019

154

28 Feb 2019

No Padding Please: Efficient Neural Handwriting RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2019

Gideon Maillette de Buy Wenniger

Lambert Schomaker

Andy Way

131

28 Feb 2019

Interaction-aware Kalman Neural Networks for Trajectory Prediction

297

28 Feb 2019

An Empirical Study of Large-Batch Stochastic Gradient Descent with Structured Covariance Noise

Jimmy Ba

329

21 Feb 2019

Audio-Linguistic Embeddings for Spoken Sentences

Albert Haque

Michelle Guo

Prateek Verma

Li Fei-Fei

117

20 Feb 2019

STRIP: A Defence Against Trojan Attacks on Deep Neural Networks

Shiping Chen

323

923

18 Feb 2019

A Fully Differentiable Beam Search Decoder

R. Collobert

Awni Y. Hannun

Gabriel Synnaeve

157

16 Feb 2019

Using multi-task learning to improve the performance of acoustic-to-word and conventional hybrid models

T. Nguyen

Sebastian Stüker

A. Waibel

177

02 Feb 2019

Robust Inference via Generative Classifiers for Handling Noisy LabelsInternational Conference on Machine Learning (ICML), 2019

283

154

31 Jan 2019

Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM

105

30 Jan 2019

FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture

137

28 Jan 2019

Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition

247

123

22 Jan 2019

Robust Watermarking of Neural Network with Exponential Weighting

Ryota Namba

Jun Sakuma

AAML

160

151

18 Jan 2019

Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees

162

14 Jan 2019

Exploring spectro-temporal features in end-to-end convolutional neural networks

Sean Robertson

Gerald Penn

Yingxue Wang

137

01 Jan 2019

Towards a Theoretical Understanding of Hashing-Based Neural Nets

Yibo Lin

Zhao Song

Lin F. Yang

152

26 Dec 2018

A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples

110

26 Dec 2018

Pansori: ASR Corpus Generation from Open Online Video Contents

Yoona Choi

Bowon Lee

23 Dec 2018

Deep learning incorporating biologically-inspired neural dynamics

129

17 Dec 2018

Fully Convolutional Speech Recognition

200

17 Dec 2018

To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition

Yossi Adi

216

09 Dec 2018