ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXiv (abs)PDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown
Title
Building competitive direct acoustics-to-word models for English
  conversational speech recognition
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi
Brian Kingsbury
Bhuvana Ramabhadran
G. Saon
M. Picheny
129
153
0
08 Dec 2017
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica
  in End-to-End Models
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models
Tara N. Sainath
Rohit Prabhavalkar
Shankar Kumar
Seungjin Lee
Anjuli Kannan
...
Patrick Nguyen
Yue Liu
Yonghui Wu
Zhiwen Chen
Chung-Cheng Chiu
111
54
0
05 Dec 2017
Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection
Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection
Ruimin Sun
Xiaoyong Yuan
Pan He
Qile Zhu
Aokun Chen
André Grégio
Daniela Oliveira
Xiaolin Li
AAML
134
12
0
04 Dec 2017
SERKET: An Architecture for Connecting Stochastic Models to Realize a
  Large-Scale Cognitive Model
SERKET: An Architecture for Connecting Stochastic Models to Realize a Large-Scale Cognitive Model
Tomoaki Nakamura
Takayuki Nagai
T. Taniguchi
3DV
239
47
0
04 Dec 2017
Deep Learning Scaling is Predictable, Empirically
Deep Learning Scaling is Predictable, Empirically
Joel Hestness
Sharan Narang
Newsha Ardalani
G. Diamos
Heewoo Jun
Hassan Kianinejad
Md. Mostofa Ali Patwary
Yang Yang
Yanqi Zhou
438
876
0
01 Dec 2017
Highrisk Prediction from Electronic Medical Records via Deep Attention
  Networks
Highrisk Prediction from Electronic Medical Records via Deep Attention Networks
You Jin Kim
Yun-Geun Lee
Jeong-Whun Kim
Jin Joo Park
Borim Ryu
Jung-Woo Ha
108
21
0
30 Nov 2017
Exploiting Nontrivial Connectivity for Automatic Speech Recognition
Exploiting Nontrivial Connectivity for Automatic Speech Recognition
Marius Paraschiv
Lasse Borgholt
T. M. S. Tax
Marco Singh
Lars Maaløe
88
0
0
28 Nov 2017
Acoustic-To-Word Model Without OOV
Acoustic-To-Word Model Without OOV
Jinyu Li
Guoli Ye
Rui Zhao
J. Droppo
Jiawei Liu
140
38
0
28 Nov 2017
Multilingual Adaptation of RNN Based ASR Systems
Multilingual Adaptation of RNN Based ASR Systems
Markus Müller
Sebastian Stüker
A. Waibel
158
18
0
13 Nov 2017
Phonemic and Graphemic Multilingual CTC Based Speech Recognition
Phonemic and Graphemic Multilingual CTC Based Speech Recognition
Markus Müller
Sebastian Stüker
A. Waibel
103
12
0
13 Nov 2017
Dynamic Analysis of Executables to Detect and Characterize Malware
Dynamic Analysis of Executables to Detect and Characterize Malware
Michael R. Smith
J. Ingram
Christopher C. Lamb
T. Draelos
J. Doak
J. Aimone
C. James
125
15
0
10 Nov 2017
Block-Sparse Recurrent Neural Networks
Block-Sparse Recurrent Neural Networks
Sharan Narang
Eric Undersander
G. Diamos
150
143
0
08 Nov 2017
Unbounded cache model for online language modeling with open vocabulary
Unbounded cache model for online language modeling with open vocabulary
Edouard Grave
Moustapha Cissé
Armand Joulin
KELMCLL
139
70
0
07 Nov 2017
Improved training for online end-to-end speech recognition systems
Improved training for online end-to-end speech recognition systems
Suyoun Kim
M. Seltzer
Jinyu Li
Rui Zhao
126
45
0
06 Nov 2017
Robust Speech Recognition Using Generative Adversarial Networks
Robust Speech Recognition Using Generative Adversarial Networks
Anuroop Sriram
Heewoo Jun
Yashesh Gaur
S. Satheesh
102
50
0
05 Nov 2017
Convolutional Drift Networks for Video Classification
Convolutional Drift Networks for Video Classification
Dillon Graham
Seyed Hamed Fatemi Langroudi
Christopher Kanan
Dhireesha Kudithipudi
104
11
0
03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting
  Input and Output Sparsity
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity
Jingyang Zhu
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
121
36
0
03 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
81
6
0
01 Nov 2017
Countering Adversarial Images using Input Transformations
Countering Adversarial Images using Input Transformations
Chuan Guo
Mayank Rana
Moustapha Cissé
Laurens van der Maaten
AAML
573
1,522
0
31 Oct 2017
A Study of All-Convolutional Encoders for Connectionist Temporal
  Classification
A Study of All-Convolutional Encoders for Connectionist Temporal ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2017
Kalpesh Krishna
Liang Lu
Kevin Gimpel
Karen Livescu
135
11
0
28 Oct 2017
On the Long-Term Memory of Deep Recurrent Networks
On the Long-Term Memory of Deep Recurrent NetworksInternational Conference on Learning Representations (ICLR), 2017
Yoav Levine
Or Sharir
Alon Ziv
Amnon Shashua
136
25
0
25 Oct 2017
mixup: Beyond Empirical Risk Minimization
mixup: Beyond Empirical Risk MinimizationInternational Conference on Learning Representations (ICLR), 2017
Hongyi Zhang
Moustapha Cissé
Yann N. Dauphin
David Lopez-Paz
NoLa
686
11,025
0
25 Oct 2017
Trace norm regularization and faster inference for embedded speech
  recognition RNNs
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
Mohammad Shoeybi
167
8
0
25 Oct 2017
ActivityNet Challenge 2017 Summary
ActivityNet Challenge 2017 Summary
Guohao Li
Juan Carlos Niebles
Cees G. M. Snoek
Fabian Caba Heilbron
Humam Alwassel
Ranjay Krishna
Victor Escorcia
Kenji Hata
S. Buch
170
49
0
22 Oct 2017
Deep Triphone Embedding Improves Phoneme Recognition
Deep Triphone Embedding Improves Phoneme Recognition
Mohit Yadav
V. Tyagi
92
2
0
22 Oct 2017
Real-time Convolutional Neural Networks for Emotion and Gender
  Classification
Real-time Convolutional Neural Networks for Emotion and Gender Classification
Octavio Arriaga
Matias Valdenegro-Toro
Paul G. Plöger
3DH
131
304
0
20 Oct 2017
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
Convolutional Attention-based Seq2Seq Neural Network for End-to-End ASR
D. Lim
107
2
0
12 Oct 2017
Mixed Precision Training
Mixed Precision Training
Paulius Micikevicius
Sharan Narang
Jonah Alben
G. Diamos
Erich Elsen
...
Boris Ginsburg
Michael Houston
Oleksii Kuchaiev
Ganesh Venkatesh
Hao Wu
456
2,107
0
10 Oct 2017
Spinal cord gray matter segmentation using deep dilated convolutions
Spinal cord gray matter segmentation using deep dilated convolutions
C. Perone
Evan Calabrese
Julien Cohen-Adad
MedIm
106
122
0
02 Oct 2017
Improving speech recognition by revising gated recurrent units
Improving speech recognition by revising gated recurrent units
Mirco Ravanelli
Philemon Brakel
M. Omologo
Yoshua Bengio
123
55
0
29 Sep 2017
Attention-based Wav2Text with Feature Transfer Learning
Attention-based Wav2Text with Feature Transfer Learning
Andros Tjandra
S. Sakti
Satoshi Nakamura
71
20
0
22 Sep 2017
Accelerating SGD for Distributed Deep-Learning Using Approximated
  Hessian Matrix
Accelerating SGD for Distributed Deep-Learning Using Approximated Hessian Matrix
Sébastien M. R. Arnold
Chunming Wang
52
0
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
Xue Yang
MQ
444
142
0
15 Sep 2017
ImageNet Training in Minutes
ImageNet Training in Minutes
Yang You
Zhao-jie Zhang
Cho-Jui Hsieh
J. Demmel
Kurt Keutzer
VLMLRM
274
60
0
14 Sep 2017
Analyzing Hidden Representations in End-to-End Automatic Speech
  Recognition Systems
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems
Yonatan Belinkov
James R. Glass
104
90
0
13 Sep 2017
Parallelizing Linear Recurrent Neural Nets Over Sequence Length
Parallelizing Linear Recurrent Neural Nets Over Sequence Length
Eric Martin
Chris Cundy
230
143
0
12 Sep 2017
Cold Fusion: Training Seq2Seq Models Together with Language Models
Cold Fusion: Training Seq2Seq Models Together with Language Models
Anuroop Sriram
Heewoo Jun
S. Satheesh
Adam Coates
VLM
209
298
0
21 Aug 2017
Deep Learning at 15PF: Supervised and Semi-Supervised Classification for
  Scientific Data
Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data
Thorsten Kurth
Jian Zhang
N. Satish
Alexia Jolicoeur-Martineau
Evan Racah
...
J. Deslippe
Mikhail Shiryaev
Srinivas Sridharan
P. Prabhat
Pradeep Dubey
158
84
0
17 Aug 2017
Scaling Deep Learning on GPU and Knights Landing clusters
Scaling Deep Learning on GPU and Knights Landing clustersInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2017
Yang You
A. Buluç
J. Demmel
GNN
119
80
0
09 Aug 2017
Bayesian Sparsification of Recurrent Neural Networks
Bayesian Sparsification of Recurrent Neural Networks
E. Lobacheva
Nadezhda Chirkova
Dmitry Vetrov
UQCVBDL
178
16
0
31 Jul 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
153
232
0
24 Jul 2017
Attention-Based End-to-End Speech Recognition on Voice Search
Attention-Based End-to-End Speech Recognition on Voice Search
Changhao Shan
Junbo Zhang
Yujun Wang
Lei Xie
170
7
0
22 Jul 2017
Single-Channel Multi-talker Speech Recognition with Permutation
  Invariant Training
Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training
Y. Qian
Xuankai Chang
Dong Yu
166
83
0
19 Jul 2017
Houdini: Fooling Deep Structured Prediction Models
Houdini: Fooling Deep Structured Prediction Models
Moustapha Cissé
Yossi Adi
Natalia Neverova
Joseph Keshet
AAML
189
276
0
17 Jul 2017
Automatic Construction of Real-World Datasets for 3D Object Localization
  using Two Cameras
Automatic Construction of Real-World Datasets for 3D Object Localization using Two Cameras
Joris Guérin
O. Gibaru
E. Nyiri
Stéphane Thiery
238
2
0
10 Jul 2017
Cardiologist-Level Arrhythmia Detection with Convolutional Neural
  Networks
Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks
Pranav Rajpurkar
Awni Y. Hannun
Masoumeh Haghpanahi
Codie Bourn
A. Ng
213
804
0
06 Jul 2017
Improving LSTM-CTC based ASR performance in domains with limited
  training data
Improving LSTM-CTC based ASR performance in domains with limited training data
J. Billa
130
11
0
03 Jul 2017
Dual Supervised Learning
Dual Supervised Learning
Ziheng Lu
Tao Qin
Wei-neng Chen
Jiang Bian
Nenghai Yu
Tie-Yan Liu
SSL
194
143
0
03 Jul 2017
Named Entity Recognition with stack residual LSTM and trainable bias
  decoding
Named Entity Recognition with stack residual LSTM and trainable bias decoding
Q. Tran
Andrew D. MacKinlay
Antonio Jimeno Yepes
148
57
0
23 Jun 2017
A Wavenet for Speech Denoising
A Wavenet for Speech Denoising
Dario Rethage
Jordi Pons
Xavier Serra
295
466
0
22 Jun 2017
Previous
123...19202122
Next