ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXiv (abs)PDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown
Impact of Artificial Intelligence on Businesses: from Research,
  Innovation, Market Deployment to Future Shifts in Business Models
Impact of Artificial Intelligence on Businesses: from Research, Innovation, Market Deployment to Future Shifts in Business Models
N. Soni
E. Sharma
Narotam Singh
A. Kapoor
82
74
0
03 May 2019
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text
M. Baskar
Shinji Watanabe
Ramón Fernández Astudillo
Takaaki Hori
L. Burget
J. Černocký
183
40
0
30 Apr 2019
Forget the Learning Rate, Decay Loss
Forget the Learning Rate, Decay Loss
Jiakai Wei
101
9
0
27 Apr 2019
Realizing Petabyte Scale Acoustic Modeling
Realizing Petabyte Scale Acoustic Modeling
S. Parthasarathi
Nitin Sivakrishnan
Pranav Ladkat
N. Strom
114
11
0
24 Apr 2019
MinCall - MinION end2end convolutional deep learning basecaller
MinCall - MinION end2end convolutional deep learning basecaller
N. Miculinic
Marko Ratkovic
M. Šikić
66
12
0
22 Apr 2019
An Investigation of End-to-End Multichannel Speech Recognition for
  Reverberant and Mismatch Conditions
An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
Aswin Shanmugam Subramanian
Xiaofei Wang
Shinji Watanabe
T. Taniguchi
Dung T. Tran
Yuya Fujita
166
20
0
19 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and
  Knowledge Distillation
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
156
48
0
17 Apr 2019
A Multi-Task Learning Framework for Overcoming the Catastrophic
  Forgetting in Automatic Speech Recognition
A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition
Jiabin Xue
Jiqing Han
Tieran Zheng
Yantao Du
Jiaxing Guo
CLL
131
9
0
17 Apr 2019
Hard Sample Mining for the Improved Retraining of Automatic Speech
  Recognition
Hard Sample Mining for the Improved Retraining of Automatic Speech Recognition
Jiabin Xue
Jiqing Han
Tieran Zheng
Jiaxing Guo
Boyong Wu
114
10
0
17 Apr 2019
Predicting Time-to-Failure of Plasma Etching Equipment using Machine
  Learning
Predicting Time-to-Failure of Plasma Etching Equipment using Machine Learning
Anahid N. Jalali
Clemens Heistracher
Alexander Schindler
Bernhard Haslhofer
Tanja Nemeth
Robert Glawar
W. Sihn
Peter De Boer
56
21
0
16 Apr 2019
SpeechYOLO: Detection and Localization of Speech Objects
SpeechYOLO: Detection and Localization of Speech Objects
Yael Segal
T. Fuchs
Joseph Keshet
ObjD
135
18
0
14 Apr 2019
wav2vec: Unsupervised Pre-training for Speech Recognition
wav2vec: Unsupervised Pre-training for Speech Recognition
Steffen Schneider
Alexei Baevski
R. Collobert
Michael Auli
SSL
370
721
0
11 Apr 2019
From Semi-supervised to Almost-unsupervised Speech Recognition with
  Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text
  Embeddings
From Semi-supervised to Almost-unsupervised Speech Recognition with Very-low Resource by Jointly Learning Phonetic Structures from Audio and Text Embeddings
Yi-Chen Chen
Sung-Feng Huang
Hung-yi Lee
Lin-Shan Lee
SSL
124
0
0
10 Apr 2019
Distributed Deep Learning Strategies For Automatic Speech Recognition
Distributed Deep Learning Strategies For Automatic Speech Recognition
Wei Zhang
Xiaodong Cui
Ulrich Finkler
Brian Kingsbury
G. Saon
David S. Kung
M. Picheny
134
30
0
10 Apr 2019
Who Needs Words? Lexicon-Free Speech Recognition
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko
Gabriel Synnaeve
R. Collobert
228
27
0
09 Apr 2019
Speech Model Pre-training for End-to-End Spoken Language Understanding
Speech Model Pre-training for End-to-End Spoken Language Understanding
Loren Lugosch
Mirco Ravanelli
Patrick Ignoto
Vikrant Singh Tomar
Yoshua Bengio
SyDaAuLLM
196
376
0
07 Apr 2019
On The Power of Curriculum Learning in Training Deep Networks
On The Power of Curriculum Learning in Training Deep Networks
Guy Hacohen
D. Weinshall
ODL
259
515
0
07 Apr 2019
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jasper: An End-to-End Convolutional Neural Acoustic Model
Jason Chun Lok Li
Vitaly Lavrukhin
Boris Ginsburg
Ryan Leary
Oleksii Kuchaiev
Jonathan M. Cohen
Huyen Nguyen
R. Gadde
DRLVLMAuLLM
251
277
0
05 Apr 2019
Lessons from Building Acoustic Models with a Million Hours of Speech
Lessons from Building Acoustic Models with a Million Hours of Speech
S. Parthasarathi
N. Strom
178
89
0
02 Apr 2019
Learning Shared Encoding Representation for End-to-End Speech
  Recognition Models
Learning Shared Encoding Representation for End-to-End Speech Recognition Models
T. Nguyen
Sebastian Stüker
A. Waibel
138
2
0
31 Mar 2019
On Arrhythmia Detection by Deep Learning and Multidimensional
  Representation
On Arrhythmia Detection by Deep Learning and Multidimensional Representation
K. S. Rajput
Sandi Wibowo
Chen Hao
M. Majmudar
AI4TS
76
17
0
30 Mar 2019
Attention-Augmented End-to-End Multi-Task Learning for Emotion
  Prediction from Speech
Attention-Augmented End-to-End Multi-Task Learning for Emotion Prediction from Speech
Zixing Zhang
Bingwen Wu
Bjoern Schuller
141
86
0
29 Mar 2019
Snore-GANs: Improving Automatic Snore Sound Classification with
  Synthesized Data
Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data
Zixing Zhang
Jing Han
Kun Qian
C. Janott
Yanan Guo
Bjoern Schuller
130
46
0
29 Mar 2019
k-Same-Siamese-GAN: k-Same Algorithm with Generative Adversarial Network
  for Facial Image De-identification with Hyperparameter Tuning and Mixed
  Precision Training
k-Same-Siamese-GAN: k-Same Algorithm with Generative Adversarial Network for Facial Image De-identification with Hyperparameter Tuning and Mixed Precision Training
Yi-Lun Pan
Min-Jhih Haung
Kuo-Teng Ding
Ja-Ling Wu
J. Jang
PICV
108
21
0
27 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End
  Speech Recognition
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
113
18
0
27 Mar 2019
Practical Hidden Voice Attacks against Speech and Speaker Recognition
  Systems
Practical Hidden Voice Attacks against Speech and Speaker Recognition SystemsNetwork and Distributed System Security Symposium (NDSS), 2019
H. Abdullah
Washington Garcia
Christian Peeters
Patrick Traynor
Kevin R. B. Butler
Joseph N. Wilson
AAML
152
177
0
18 Mar 2019
A Research Agenda: Dynamic Models to Defend Against Correlated Attacks
A Research Agenda: Dynamic Models to Defend Against Correlated Attacks
Ian Goodfellow
AAMLOOD
161
33
0
14 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
102
15
0
12 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition
  from YouTube Videos
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube VideosConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Egor Lakomkin
S. Magg
C. Weber
S. Wermter
113
20
0
01 Mar 2019
Incorporating End-to-End Speech Recognition Models for Sentiment
  Analysis
Incorporating End-to-End Speech Recognition Models for Sentiment AnalysisIEEE International Conference on Robotics and Automation (ICRA), 2019
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
154
24
0
28 Feb 2019
No Padding Please: Efficient Neural Handwriting Recognition
No Padding Please: Efficient Neural Handwriting RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2019
Gideon Maillette de Buy Wenniger
Lambert Schomaker
Andy Way
131
20
0
28 Feb 2019
Interaction-aware Kalman Neural Networks for Trajectory Prediction
Interaction-aware Kalman Neural Networks for Trajectory Prediction
Ce Ju
Liang Luo
Cheng Long
Xiaoyu Zhang
D. Chang
297
61
0
28 Feb 2019
An Empirical Study of Large-Batch Stochastic Gradient Descent with
  Structured Covariance Noise
An Empirical Study of Large-Batch Stochastic Gradient Descent with Structured Covariance Noise
Yeming Wen
Kevin Luk
Maxime Gazeau
Guodong Zhang
Harris Chan
Jimmy Ba
ODL
329
24
0
21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
117
52
0
20 Feb 2019
STRIP: A Defence Against Trojan Attacks on Deep Neural Networks
STRIP: A Defence Against Trojan Attacks on Deep Neural Networks
Yansong Gao
Chang Xu
Derui Wang
Shiping Chen
Damith C. Ranasinghe
Surya Nepal
AAML
323
923
0
18 Feb 2019
A Fully Differentiable Beam Search Decoder
A Fully Differentiable Beam Search Decoder
R. Collobert
Awni Y. Hannun
Gabriel Synnaeve
157
43
0
16 Feb 2019
Using multi-task learning to improve the performance of acoustic-to-word and conventional hybrid models
T. Nguyen
Sebastian Stüker
A. Waibel
177
1
0
02 Feb 2019
Robust Inference via Generative Classifiers for Handling Noisy Labels
Robust Inference via Generative Classifiers for Handling Noisy LabelsInternational Conference on Machine Learning (ICML), 2019
Kimin Lee
Sukmin Yun
Kibok Lee
Honglak Lee
Yue Liu
Jinwoo Shin
NoLa
283
154
0
31 Jan 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet
  Execution-Efficient LSTM
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
105
10
0
30 Jan 2019
FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN
  Accelerator Architecture
FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture
Yu Ji
Youyang Zhang
Xinfeng Xie
Shuangchen Li
Peiqi Wang
Xing Hu
Youhui Zhang
Yuan Xie
137
57
0
28 Jan 2019
Self-Attention Networks for Connectionist Temporal Classification in
  Speech Recognition
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
247
123
0
22 Jan 2019
Robust Watermarking of Neural Network with Exponential Weighting
Robust Watermarking of Neural Network with Exponential Weighting
Ryota Namba
Jun Sakuma
AAML
160
151
0
18 Jan 2019
Towards Using Context-Dependent Symbols in CTC Without State-Tying
  Decision Trees
Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees
J. Chorowski
A. Lancucki
Bartosz Kostka
Michal Zapotoczny
162
5
0
14 Jan 2019
Exploring spectro-temporal features in end-to-end convolutional neural
  networks
Exploring spectro-temporal features in end-to-end convolutional neural networks
Sean Robertson
Gerald Penn
Yingxue Wang
137
4
0
01 Jan 2019
Towards a Theoretical Understanding of Hashing-Based Neural Nets
Towards a Theoretical Understanding of Hashing-Based Neural Nets
Yibo Lin
Zhao Song
Lin F. Yang
152
7
0
26 Dec 2018
A Multiversion Programming Inspired Approach to Detecting Audio
  Adversarial Examples
A Multiversion Programming Inspired Approach to Detecting Audio Adversarial Examples
Qiang Zeng
Jianhai Su
Chenglong Fu
Golam Kayas
Lannan Luo
AAML
110
51
0
26 Dec 2018
Pansori: ASR Corpus Generation from Open Online Video Contents
Pansori: ASR Corpus Generation from Open Online Video Contents
Yoona Choi
Bowon Lee
75
6
0
23 Dec 2018
Deep learning incorporating biologically-inspired neural dynamics
Deep learning incorporating biologically-inspired neural dynamics
Stanisław Woźniak
A. Pantazi
Thomas Bohnstingl
E. Eleftheriou
129
9
0
17 Dec 2018
Fully Convolutional Speech Recognition
Fully Convolutional Speech Recognition
Neil Zeghidour
Qiantong Xu
Vitaliy Liptchinsky
Nicolas Usunier
Gabriel Synnaeve
R. Collobert
200
95
0
17 Dec 2018
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial
  and Multi-task Learning in Speech Recognition
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition
Yossi Adi
Neil Zeghidour
R. Collobert
Nicolas Usunier
Vitaliy Liptchinsky
Gabriel Synnaeve
216
41
0
09 Dec 2018
Previous
123...151617...202122
Next