ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXiv (abs)PDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown
Title
VAIN: Attentional Multi-agent Predictive Modeling
VAIN: Attentional Multi-agent Predictive Modeling
Yedid Hoshen
GNN
207
245
0
19 Jun 2017
Advances in Joint CTC-Attention based End-to-End Speech Recognition with
  a Deep CNN Encoder and RNN-LM
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LMInterspeech (Interspeech), 2017
Takaaki Hori
Shinji Watanabe
Yu Zhang
William Chan
145
304
0
08 Jun 2017
Deep Learning: A Bayesian Perspective
Deep Learning: A Bayesian Perspective
Nicholas G. Polson
Vadim Sokolov
BDL
353
125
0
01 Jun 2017
Transfer Learning for Speech Recognition on a Budget
Transfer Learning for Speech Recognition on a Budget
Julius Kunze
Louis Kirsch
Ilia Kurenkov
A. Krug
Jens Johannsmeier
Sebastian Stober
125
143
0
01 Jun 2017
Deep Learning for Environmentally Robust Speech Recognition: An Overview
  of Recent Developments
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent DevelopmentsACM Transactions on Intelligent Systems and Technology (TIST), 2017
Zixing Zhang
Jürgen T. Geiger
Jouni Pohjalainen
A. Mousa
Wenyu Jin
Björn Schuller
238
328
0
30 May 2017
Semi-Supervised Model Training for Unbounded Conversational Speech
  Recognition
Semi-Supervised Model Training for Unbounded Conversational Speech Recognition
Shane Walker
M. Pedersen
Iroro Orife
J. Flaks
74
12
0
26 May 2017
Principled Hybrids of Generative and Discriminative Domain Adaptation
Principled Hybrids of Generative and Discriminative Domain Adaptation
Haiying Zhao
Zhenyao Zhu
Junjie Hu
Adam Coates
Geoffrey J. Gordon
AI4CE
158
5
0
25 May 2017
Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit
  Training for Contextual Video Recognition
Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition
Minju Jung
Haanvid Lee
Jun Tani
AI4TS
157
42
0
24 May 2017
Train longer, generalize better: closing the generalization gap in large
  batch training of neural networks
Train longer, generalize better: closing the generalization gap in large batch training of neural networks
Elad Hoffer
Itay Hubara
Daniel Soudry
ODL
411
844
0
24 May 2017
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
213
1,199
0
23 May 2017
Compressing Recurrent Neural Network with Tensor Train
Compressing Recurrent Neural Network with Tensor Train
Andros Tjandra
S. Sakti
Satoshi Nakamura
160
115
0
23 May 2017
Reducing Bias in Production Speech Models
Reducing Bias in Production Speech Models
Eric Battenberg
R. Child
Adam Coates
Christopher Fougner
Yashesh Gaur
...
Vinay Rao
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
129
10
0
11 May 2017
Deep Speaker: an End-to-End Neural Speaker Embedding System
Deep Speaker: an End-to-End Neural Speaker Embedding System
Chao Li
Xiaokong Ma
B. Jiang
Xiangang Li
Xuewei Zhang
Xiao-Chang Liu
Ying Cao
Ajay Kannan
Zhenyao Zhu
176
517
0
05 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep
  Neural Networks
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
223
187
0
03 May 2017
Speech-Based Visual Question Answering
Speech-Based Visual Question Answering
Ted Zhang
Dengxin Dai
Tinne Tuytelaars
Marie-Francine Moens
Luc Van Gool
178
25
0
01 May 2017
Parseval Networks: Improving Robustness to Adversarial Examples
Parseval Networks: Improving Robustness to Adversarial Examples
Moustapha Cissé
Piotr Bojanowski
Edouard Grave
Yann N. Dauphin
Nicolas Usunier
AAML
392
843
0
28 Apr 2017
A Study of Deep Learning Robustness Against Computation Failures
A Study of Deep Learning Robustness Against Computation Failures
Jean-Charles Vialatte
François Leduc-Primeau
98
14
0
18 Apr 2017
Exploring Sparsity in Recurrent Neural Networks
Exploring Sparsity in Recurrent Neural Networks
Sharan Narang
Erich Elsen
G. Diamos
Shubho Sengupta
163
323
0
17 Apr 2017
Bayesian Recurrent Neural Networks
Bayesian Recurrent Neural Networks
Meire Fortunato
Charles Blundell
Oriol Vinyals
BDL
333
200
0
10 Apr 2017
Simplified End-to-End MMI Training and Voting for ASR
Simplified End-to-End MMI Training and Voting for ASR
L. Fritz
D. Burshtein
98
3
0
30 Mar 2017
Towards thinner convolutional neural networks through Gradually Global
  Pruning
Towards thinner convolutional neural networks through Gradually Global Pruning
Z. Wang
Ce Zhu
Zhiqiang Xia
Qi Guo
Yipeng Liu
CVBM
98
4
0
29 Mar 2017
Multi-Scale Dense Networks for Resource Efficient Image Classification
Multi-Scale Dense Networks for Resource Efficient Image Classification
Gao Huang
Danlu Chen
Tianhong Li
Felix Wu
Laurens van der Maaten
Kilian Q. Weinberger
VLM
230
145
0
29 Mar 2017
Recognizing Multi-talker Speech with Permutation Invariant Training
Recognizing Multi-talker Speech with Permutation Invariant Training
Dong Yu
Xuankai Chang
Y. Qian
219
99
0
22 Mar 2017
Dance Dance Convolution
Dance Dance Convolution
Chris Donahue
Zachary Chase Lipton
Julian McAuley
161
38
0
20 Mar 2017
Convolutional Recurrent Neural Networks for Small-Footprint Keyword
  Spotting
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
Sercan O. Arik
Markus Kliegl
R. Child
Joel Hestness
Andrew Gibiansky
Christopher Fougner
R. Prenger
Adam Coates
269
189
0
15 Mar 2017
Multichannel End-to-end Speech Recognition
Multichannel End-to-end Speech Recognition
Tsubasa Ochiai
Shinji Watanabe
Takaaki Hori
J. Hershey
124
92
0
14 Mar 2017
Task-based End-to-end Model Learning in Stochastic Optimization
Task-based End-to-end Model Learning in Stochastic Optimization
P. Donti
Brandon Amos
J. Zico Kolter
235
25
0
13 Mar 2017
Real-Time Machine Learning: The Missing Pieces
Real-Time Machine Learning: The Missing Pieces
Robert Nishihara
Philipp Moritz
Stephanie Wang
Alexey Tumanov
William Paul
Johann Schleier-Smith
Richard Liaw
Mehrdad Niknami
Sai Li
Ion Stoica
OffRL
197
64
0
11 Mar 2017
Functions that Emerge through End-to-End Reinforcement Learning - The
  Direction for Artificial General Intelligence -
Functions that Emerge through End-to-End Reinforcement Learning - The Direction for Artificial General Intelligence -
K. Shibata
70
7
0
07 Mar 2017
Exponential Moving Average Model in Parallel Speech Recognition Training
Exponential Moving Average Model in Parallel Speech Recognition Training
Xudong Tian
Jun Zhang
Zejun Ma
Yi He
Juan Wei
98
4
0
03 Mar 2017
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence
  Labelling
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Hairong Liu
Zhenyao Zhu
Xiangang Li
S. Satheesh
VLM
209
56
0
01 Mar 2017
Improving the Neural GPU Architecture for Algorithm Learning
Improving the Neural GPU Architecture for Algorithm Learning
Kārlis Freivalds
Renars Liepins
262
43
0
28 Feb 2017
Fixed-point optimization of deep neural networks with adaptive step size
  retraining
Fixed-point optimization of deep neural networks with adaptive step size retrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2017
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
194
36
0
27 Feb 2017
Deep Voice: Real-time Neural Text-to-Speech
Deep Voice: Real-time Neural Text-to-SpeechInternational Conference on Machine Learning (ICML), 2017
Sercan O. Arik
Mike Chrzanowski
Adam Coates
G. Diamos
Andrew Gibiansky
...
John Miller
Andrew Ng
Jonathan Raiman
Shubho Sengupta
Mohammad Shoeybi
237
642
0
25 Feb 2017
Residual Convolutional CTC Networks for Automatic Speech Recognition
Residual Convolutional CTC Networks for Automatic Speech Recognition
Yisen Wang
Xuejiao Deng
Songbai Pu
Zhiheng Huang
169
88
0
24 Feb 2017
Sequence Modeling via Segmentations
Sequence Modeling via SegmentationsInternational Conference on Machine Learning (ICML), 2017
Chong-Jun Wang
Yining Wang
Po-Sen Huang
Abdel-rahman Mohamed
Dengyong Zhou
Li Deng
525
46
0
24 Feb 2017
Convolutional Recurrent Neural Networks for Polyphonic Sound Event
  Detection
Convolutional Recurrent Neural Networks for Polyphonic Sound Event DetectionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2017
Emre Çakir
Giambattista Parascandolo
Toni Heittola
H. Huttunen
Maria Sandsten
ObjD
183
581
0
21 Feb 2017
On Detecting Adversarial Perturbations
On Detecting Adversarial PerturbationsInternational Conference on Learning Representations (ICLR), 2017
J. H. Metzen
Tim Genewein
Volker Fischer
Bastian Bischoff
AAML
288
999
0
14 Feb 2017
DNN adaptation by automatic quality estimation of ASR hypotheses
DNN adaptation by automatic quality estimation of ASR hypothesesComputer Speech and Language (CSL), 2017
D. Falavigna
M. Matassoni
S. Jalalvand
Matteo Negri
Marco Turchi
81
14
0
06 Feb 2017
Predicting Auction Price of Vehicle License Plate with Deep Recurrent
  Neural Network
Predicting Auction Price of Vehicle License Plate with Deep Recurrent Neural NetworkExpert systems with applications (ESWA), 2017
Vinci Chow
278
21
0
30 Jan 2017
Outrageously Large Neural Networks: The Sparsely-Gated
  Mixture-of-Experts Layer
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts LayerInternational Conference on Learning Representations (ICLR), 2017
Noam M. Shazeer
Azalia Mirhoseini
Krzysztof Maziarz
Andy Davis
Quoc V. Le
Geoffrey E. Hinton
J. Dean
MoE
564
3,585
0
23 Jan 2017
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and
  Lipreading
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading
Chunlin Tian
Weijun Ji
47
7
0
16 Jan 2017
Kernel Approximation Methods for Speech Recognition
Kernel Approximation Methods for Speech RecognitionJournal of machine learning research (JMLR), 2017
Avner May
A. Garakani
Zhiyun Lu
Dong Guo
Kuan Liu
...
Michael Collins
Daniel J. Hsu
Brian Kingsbury
M. Picheny
Fei Sha
159
46
0
13 Jan 2017
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
111
19
0
20 Dec 2016
Delta Networks for Optimized Recurrent Network Computation
Delta Networks for Optimized Recurrent Network ComputationInternational Conference on Machine Learning (ICML), 2016
Daniel Neil
Junhaeng Lee
T. Delbruck
Shih-Chii Liu
172
70
0
16 Dec 2016
Active Learning for Speech Recognition: the Power of Gradients
Active Learning for Speech Recognition: the Power of Gradients
Jiaji Huang
R. Child
Vinay Rao
Hairong Liu
S. Satheesh
Adam Coates
VLM
148
67
0
10 Dec 2016
Towards better decoding and language model integration in sequence to
  sequence models
Towards better decoding and language model integration in sequence to sequence models
J. Chorowski
Navdeep Jaitly
205
380
0
08 Dec 2016
Trained Ternary Quantization
Trained Ternary Quantization
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
284
1,065
0
04 Dec 2016
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Song Han
Junlong Kang
Huizi Mao
Yiming Hu
Xin Li
...
Hong Luo
Song Yao
Yu Wang
Huazhong Yang
W. Dally
147
641
0
01 Dec 2016
Capacity and Trainability in Recurrent Neural Networks
Capacity and Trainability in Recurrent Neural Networks
Jasmine Collins
Jascha Narain Sohl-Dickstein
David Sussillo
353
213
0
29 Nov 2016
Previous
123...202122
Next