ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXiv (abs)PDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown
Title
Elastic Gossip: Distributing Neural Network Training Using Gossip-like
  Protocols
Elastic Gossip: Distributing Neural Network Training Using Gossip-like Protocols
Siddharth Pramod
FedML
71
2
0
06 Dec 2018
Explainable and Explicit Visual Reasoning over Scene Graphs
Explainable and Explicit Visual Reasoning over Scene Graphs
Jiaxin Shi
Hanwang Zhang
Juan-Zi Li
OCL
418
250
0
05 Dec 2018
Evaluating Bayesian Deep Learning Methods for Semantic Segmentation
Evaluating Bayesian Deep Learning Methods for Semantic Segmentation
Jishnu Mukhoti
Y. Gal
UQCVBDL
245
245
0
30 Nov 2018
On the Inductive Bias of Word-Character-Level Multi-Task Learning for
  Speech Recognition
On the Inductive Bias of Word-Character-Level Multi-Task Learning for Speech Recognition
Jan Kremer
Lasse Borgholt
Lars Maaløe
126
6
0
28 Nov 2018
A Gray Box Interpretable Visual Debugging Approach for Deep Sequence
  Learning Model
A Gray Box Interpretable Visual Debugging Approach for Deep Sequence Learning Model
Md. Mofijul Islam
Amar Debnath
Tahsin Al Sayeed
Jyotirmay Nag Setu
Md Mahmudur Rahman
Md. Sadman Sakib
M. Razzaque
Md. Mosaddek Khan
Swakkhar Shatabda
HAIAAMLVLM
85
0
0
20 Nov 2018
FALCON: A Fourier Transform Based Approach for Fast and Secure
  Convolutional Neural Network Predictions
FALCON: A Fourier Transform Based Approach for Fast and Secure Convolutional Neural Network PredictionsComputer Vision and Pattern Recognition (CVPR), 2018
Shaohua Li
Kaiping Xue
Chenkai Ding
Xindi Gao
David S. L. Wei
Tao Wan
F. Wu
142
80
0
20 Nov 2018
Stochastic Adaptive Neural Architecture Search for Keyword Spotting
Stochastic Adaptive Neural Architecture Search for Keyword SpottingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018
Tom Véniat
Olivier Schwander
Ludovic Denoyer
AI4TS
112
27
0
16 Nov 2018
Beam Search Decoding using Manner of Articulation Detection Knowledge
  Derived from Connectionist Temporal Classification
Beam Search Decoding using Manner of Articulation Detection Knowledge Derived from Connectionist Temporal Classification
P. Rangan
K. S. Rao
75
0
0
16 Nov 2018
Learning to Predict the Cosmological Structure Formation
Learning to Predict the Cosmological Structure FormationProceedings of the National Academy of Sciences of the United States of America (PNAS), 2018
Siyu He
Yin Li
Yu Feng
S. Ho
Siamak Ravanbakhsh
Wei Chen
Barnabás Póczós
274
181
0
15 Nov 2018
Automatic Grammar Augmentation for Robust Voice Command Recognition
Automatic Grammar Augmentation for Robust Voice Command Recognition
Yang Yang
Anusha Lalitha
Jinwon Lee
Chris Lott
92
3
0
14 Nov 2018
Modular Networks: Learning to Decompose Neural Computation
Modular Networks: Learning to Decompose Neural Computation
Louis Kirsch
Julius Kunze
David Barber
156
122
0
13 Nov 2018
An Online Attention-based Model for Speech Recognition
An Online Attention-based Model for Speech Recognition
Ruchao Fan
Pan Zhou
Wei Chen
Jia Jia
Gang Liu
148
48
0
13 Nov 2018
Sequence-Level Knowledge Distillation for Model Compression of
  Attention-based Sequence-to-Sequence Speech Recognition
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
Koichi Shinoda
115
26
0
12 Nov 2018
A Convergence Theory for Deep Learning via Over-Parameterization
A Convergence Theory for Deep Learning via Over-ParameterizationInternational Conference on Machine Learning (ICML), 2018
Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
AI4CEODL
976
1,550
0
09 Nov 2018
RNNFast: An Accelerator for Recurrent Neural Networks Using Domain Wall
  Memory
RNNFast: An Accelerator for Recurrent Neural Networks Using Domain Wall MemoryACM Journal on Emerging Technologies in Computing Systems (ACM JETC), 2018
Mohammad Hossein Samavatian
Anys Bacha
Li Zhou
R. Teodorescu
118
7
0
07 Nov 2018
Analysis of Multilingual Sequence-to-Sequence speech recognition systems
Analysis of Multilingual Sequence-to-Sequence speech recognition systemsInterspeech (Interspeech), 2018
Jiayang Liu
M. Baskar
Weiming Zhang
Takaaki Hori
Sanjeev Khudanpur
Jan ''Honza'' Cernocký
176
18
0
07 Nov 2018
CAAD 2018: Iterative Ensemble Adversarial Attack
CAAD 2018: Iterative Ensemble Adversarial Attack
Jiayang Liu
Weiming Zhang
Nenghai Yu
AAML
123
4
0
07 Nov 2018
CNN-based MultiChannel End-to-End Speech Recognition for everyday home
  environments
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environmentsEuropean Signal Processing Conference (EUSIPCO), 2018
Hyungjun Lim
Younggwan Kim
Takaaki Hori
Myunghun Jung
Hoirin Kim
144
12
0
07 Nov 2018
Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for
  Speech Recognition
Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech RecognitionIEEE Signal Processing Letters (IEEE SPL), 2018
Jonathan Lee
Michael Laskey
Bo-Kyeong Kim
A. Aswani
Soo-Young Lee
132
18
0
06 Nov 2018
When CTC Training Meets Acoustic Landmarks
When CTC Training Meets Acoustic Landmarks
Di He
Xuesong Yang
Johan Rohdin
Yi Liang
Themos Stafylakis
Deming Chen
99
11
0
05 Nov 2018
The Marchex 2018 English Conversational Telephone Speech Recognition
  System
The Marchex 2018 English Conversational Telephone Speech Recognition System
Xiaofeng Liu
Zhenhua Guo
J. You
B. Kumar
179
1
0
05 Nov 2018
Manner of Articulation Detection using Connectionist Temporal
  Classification to Improve Automatic Speech Recognition Performance
Manner of Articulation Detection using Connectionist Temporal Classification to Improve Automatic Speech Recognition Performance
R. Pradeep
K. S. Rao
56
2
0
05 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural
  Networks by Decision Gates
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
131
5
0
05 Nov 2018
Adversarial Training of End-to-end Speech Recognition Using a
  Criticizing Language Model
Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model
Alexander H. Liu
Hung-yi Lee
Lin-Shan Lee
AuLLM
118
47
0
02 Nov 2018
Cycle-consistency training for end-to-end speech recognition
Cycle-consistency training for end-to-end speech recognition
Takaaki Hori
Ramón Fernández Astudillo
Tomoki Hayashi
Yu Zhang
Shinji Watanabe
Jonathan Le Roux
177
89
0
02 Nov 2018
Training Neural Speech Recognition Systems with Synthetic Speech
  Augmentation
Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Jason Chun Lok Li
R. Gadde
Boris Ginsburg
Vitaly Lavrukhin
137
58
0
02 Nov 2018
Democratizing Production-Scale Distributed Deep Learning
Democratizing Production-Scale Distributed Deep Learning
Minghuang Ma
Hadi Pouransari
Daniel Chao
Saurabh N. Adya
S. Serrano
Yi Qin
Dan Gimnicher
Dominic Walsh
MoE
295
6
0
31 Oct 2018
Towards End-to-End Code-Switching Speech Recognition
Towards End-to-End Code-Switching Speech Recognition
Ne Luo
Dongwei Jiang
Shuaijiang Zhao
Caixia Gong
Wei Zou
Xiangang Li
171
47
0
31 Oct 2018
Towards End-to-end Automatic Code-Switching Speech Recognition
Towards End-to-end Automatic Code-Switching Speech Recognition
Genta Indra Winata
Andrea Madotto
Chien-Sheng Wu
Pascale Fung
97
12
0
30 Oct 2018
On the Convergence Rate of Training Recurrent Neural Networks
On the Convergence Rate of Training Recurrent Neural Networks
Zeyuan Allen-Zhu
Yuanzhi Li
Zhao Song
560
198
0
29 Oct 2018
An improved hybrid CTC-Attention model for speech recognition
An improved hybrid CTC-Attention model for speech recognition
Zhe Yuan
Zhuoran Lyu
Jiwei Li
Xi Zhou
126
11
0
29 Oct 2018
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech
  Recognition
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Xinpei Zhou
Jiwei Li
Xi Zhou
73
4
0
29 Oct 2018
Robust Audio Adversarial Example for a Physical Attack
Robust Audio Adversarial Example for a Physical Attack
Hiromu Yakura
Jun Sakuma
AAML
149
201
0
28 Oct 2018
A novel pyramidal-FSMN architecture with lattice-free MMI for speech
  recognition
A novel pyramidal-FSMN architecture with lattice-free MMI for speech recognition
Xuerui Yang
Jiwei Li
Xi Zhou
162
15
0
26 Oct 2018
Language Modeling at Scale
Language Modeling at Scale
Md. Mostofa Ali Patwary
Milind Chabbi
Heewoo Jun
Jiaji Huang
G. Diamos
Kenneth Church
ALM
93
5
0
23 Oct 2018
To Compress, or Not to Compress: Characterizing Deep Learning Model
  Compression for Embedded Inference
To Compress, or Not to Compress: Characterizing Deep Learning Model Compression for Embedded Inference
Qing Qin
Jie Ren
Jia-Le Yu
Ling Gao
Hai Wang
Jie Zheng
Yansong Feng
Jianbin Fang
Zheng Wang
158
28
0
21 Oct 2018
Learning Models with Uniform Performance via Distributionally Robust
  Optimization
Learning Models with Uniform Performance via Distributionally Robust Optimization
John C. Duchi
Hongseok Namkoong
OOD
487
466
0
20 Oct 2018
SCALE-Sim: Systolic CNN Accelerator Simulator
SCALE-Sim: Systolic CNN Accelerator Simulator
A. Samajdar
Yuhao Zhu
P. Whatmough
Matthew Mattina
Tushar Krishna
169
136
0
16 Oct 2018
Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning
  Framework
Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning Framework
Shun Kiyono
Jun Suzuki
Kentaro Inui
138
9
0
13 Oct 2018
Multimodal Speech Emotion Recognition Using Audio and Text
Multimodal Speech Emotion Recognition Using Audio and Text
Seunghyun Yoon
Seokhyun Byun
Kyomin Jung
189
334
0
10 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation
  Approach Using Neural Networks
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
134
87
0
08 Oct 2018
Security Analysis of Deep Neural Networks Operating in the Presence of
  Cache Side-Channel Attacks
Security Analysis of Deep Neural Networks Operating in the Presence of Cache Side-Channel Attacks
Sanghyun Hong
Michael Davinroy
Yigitcan Kaya
S. Locke
Ian Rackow
Kevin Kulda
Dana Dachman-Soled
Tudor Dumitras
MIACV
153
94
0
08 Oct 2018
Deep Learning Approaches for Understanding Simple Speech Commands
Deep Learning Approaches for Understanding Simple Speech Commands
R. Solovyev
Maxim Vakhrushev
Alexander Radionov
Vladimir Aliev
Alexey A. Shvets
VLM
158
34
0
04 Oct 2018
Throughput Optimizations for FPGA-based Deep Neural Network Inference
Throughput Optimizations for FPGA-based Deep Neural Network InferenceMicroprocessors and microsystems (MM), 2018
Thorbjörn Posewsky
Daniel Ziener
80
26
0
28 Sep 2018
Learning Recurrent Binary/Ternary Weights
Learning Recurrent Binary/Ternary WeightsInternational Conference on Learning Representations (ICLR), 2018
A. Ardakani
Zhengyun Ji
S. C. Smithson
B. Meyer
W. Gross
MQ
246
28
0
28 Sep 2018
Towards Efficient and Secure Delivery of Data for Training and Inference
  with Privacy-Preserving
Towards Efficient and Secure Delivery of Data for Training and Inference with Privacy-Preserving
Juncheng Shen
Juzheng Liu
Yiran Chen
Hai Helen Li
FedML
285
1
0
20 Sep 2018
End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent
  Neural Models
End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models
Fei Tao
John H. L. Hansen
136
35
0
12 Sep 2018
PhaseLink: A Deep Learning Approach to Seismic Phase Association
PhaseLink: A Deep Learning Approach to Seismic Phase Association
Zachary E. Ross
Yisong Yue
Men‐Andrin Meier
E. Hauksson
T. Heaton
166
174
0
08 Sep 2018
MixUp as Locally Linear Out-Of-Manifold Regularization
MixUp as Locally Linear Out-Of-Manifold Regularization
Hongyu Guo
Yongyi Mao
Richong Zhang
330
340
0
07 Sep 2018
Single-Microphone Speech Enhancement and Separation Using Deep Learning
Single-Microphone Speech Enhancement and Separation Using Deep Learning
Morten Kolbaek
159
7
0
31 Aug 2018
Previous
123...161718...202122
Next