ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXiv (abs)PDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown
Title
Training ASR models by Generation of Contextual Information
Training ASR models by Generation of Contextual InformationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Kritika Singh
Dmytro Okhonko
Jun Liu
Yongqiang Wang
Frank Zhang
...
Sergey Edunov
Fuchun Peng
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
136
7
0
27 Oct 2019
Meta Learning for End-to-End Low-Resource Speech Recognition
Meta Learning for End-to-End Low-Resource Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Jui-Yang Hsu
Yuan-Jui Chen
Hung-yi Lee
108
114
0
26 Oct 2019
A holistic approach to polyphonic music transcription with neural
  networks
A holistic approach to polyphonic music transcription with neural networksInternational Society for Music Information Retrieval Conference (ISMIR), 2019
Miguel A. Román
A. Pertusa
Jorge Calvo-Zaragoza
108
32
0
26 Oct 2019
Towards Online End-to-end Transformer Automatic Speech Recognition
Towards Online End-to-end Transformer Automatic Speech Recognition
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
152
32
0
25 Oct 2019
Recognizing long-form speech using streaming end-to-end models
Recognizing long-form speech using streaming end-to-end modelsAutomatic Speech Recognition & Understanding (ASRU), 2019
A. Narayanan
Rohit Prabhavalkar
Chung-Cheng Chiu
David Rybach
Tara N. Sainath
Trevor Strohman
160
135
0
24 Oct 2019
Low-frequency Compensated Synthetic Impulse Responses for Improved
  Far-field Speech Recognition
Low-frequency Compensated Synthetic Impulse Responses for Improved Far-field Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Zhenyu Tang
Hsien-Yu Meng
Tianyi Zhou
148
12
0
23 Oct 2019
Complex Transformer: A Framework for Modeling Complex-Valued Sequence
Complex Transformer: A Framework for Modeling Complex-Valued SequenceIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Muqiao Yang
Martin Q. Ma
Dongyu Li
Yifan Hao
Ruslan Salakhutdinov
ViT
220
52
0
22 Oct 2019
No-regret Non-convex Online Meta-Learning
No-regret Non-convex Online Meta-LearningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Zhenxun Zhuang
Yunlong Wang
Kezi Yu
Songtao Lu
CLLOffRL
226
15
0
22 Oct 2019
ELSA: A Throughput-Optimized Design of an LSTM Accelerator for
  Energy-Constrained Devices
ELSA: A Throughput-Optimized Design of an LSTM Accelerator for Energy-Constrained DevicesACM Transactions on Embedded Computing Systems (ACM TECS), 2019
E. Azari
S. Vrudhula
155
5
0
19 Oct 2019
Label-efficient audio classification through multitask learning and
  self-supervision
Label-efficient audio classification through multitask learning and self-supervision
Tyler Lee
Ting Gong
Suchismita Padhy
Andrew Rouditchenko
A. Ndirango
SSLVLM
121
7
0
19 Oct 2019
Do Explanations Reflect Decisions? A Machine-centric Strategy to
  Quantify the Performance of Explainability Algorithms
Do Explanations Reflect Decisions? A Machine-centric Strategy to Quantify the Performance of Explainability Algorithms
Z. Q. Lin
M. Shafiee
S. Bochkarev
Michael St. Jules
Xiao Yu Wang
A. Wong
FAtt
158
85
0
16 Oct 2019
Transformer ASR with Contextual Block Processing
Transformer ASR with Contextual Block ProcessingAutomatic Speech Recognition & Understanding (ASRU), 2019
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
224
69
0
16 Oct 2019
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019
Xuankai Chang
Wangyou Zhang
Y. Qian
Jonathan Le Roux
Shinji Watanabe
166
124
0
15 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
vq-wav2vec: Self-Supervised Learning of Discrete Speech RepresentationsInternational Conference on Learning Representations (ICLR), 2019
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
549
713
0
12 Oct 2019
Orchestrating the Development Lifecycle of Machine Learning-Based IoT
  Applications: A Taxonomy and Survey
Orchestrating the Development Lifecycle of Machine Learning-Based IoT Applications: A Taxonomy and Survey
Bin Qian
Jie Su
Z. Wen
D. N. Jha
Yinhao Li
...
Albert Y. Zomaya
Omer F. Rana
Lizhe Wang
Maciej Koutny
R. Ranjan
233
4
0
11 Oct 2019
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box
  Attacks on Speech Recognition and Voice Identification Systems
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems
H. Abdullah
Muhammad Sajidur Rahman
Washington Garcia
Logan Blue
Kevin Warren
Anurag Swarnim Yadav
T. Shrimpton
Patrick Traynor
AAML
141
95
0
11 Oct 2019
Contract Statements Knowledge Service for Chatbots
Contract Statements Knowledge Service for ChatbotsIEEE International Conference on Systems, Man and Cybernetics (SMC), 2019
Boris Ruf
Matteo Sammarco
Marcin Detyniecki
AILaw
61
1
0
10 Oct 2019
One-To-Many Multilingual End-to-end Speech Translation
One-To-Many Multilingual End-to-end Speech TranslationAutomatic Speech Recognition & Understanding (ASRU), 2019
Mattia Antonino Di Gangi
Matteo Negri
Marco Turchi
164
52
0
08 Oct 2019
Sequence embeddings help to identify fraudulent cases in healthcare
  insurance
Sequence embeddings help to identify fraudulent cases in healthcare insurance
I. Fursov
A. Zaytsev
R. Khasyanov
Martin Spindler
Evgeny Burnaev
MedIm
164
8
0
07 Oct 2019
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid
  Speech Recognition
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019
Duc Le
Xiaohui Zhang
Weiyi Zheng
C. Fügen
Geoffrey Zweig
M. Seltzer
167
65
0
02 Oct 2019
Recent Advances in End-to-End Spoken Language Understanding
Recent Advances in End-to-End Spoken Language UnderstandingInternational Conference on Statistical Language and Speech Processing (ICSLSP), 2019
N. Tomashenko
Antoine Caubrière
Yannick Esteve
Antoine Laurent
Emmanuel Morin
135
29
0
29 Sep 2019
Self-Attention Transducers for End-to-End Speech Recognition
Self-Attention Transducers for End-to-End Speech RecognitionInterspeech (Interspeech), 2019
Zhengkun Tian
Jiangyan Yi
Jianhua Tao
Ye Bai
Zhengqi Wen
AI4TS
144
75
0
28 Sep 2019
Improving RNN Transducer Modeling for End-to-End Speech Recognition
Improving RNN Transducer Modeling for End-to-End Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019
Jinyu Li
Rui Zhao
Hu Hu
Jiawei Liu
170
176
0
26 Sep 2019
Scaling data-driven robotics with reward sketching and batch
  reinforcement learning
Scaling data-driven robotics with reward sketching and batch reinforcement learning
Serkan Cabi
Sergio Gomez Colmenarejo
Alexander Novikov
Ksenia Konyushkova
Scott E. Reed
...
David Barker
Jonathan Scholz
Misha Denil
Nando de Freitas
Ziyun Wang
OffRL
238
30
0
26 Sep 2019
DARTS: Dialectal Arabic Transcription System
DARTS: Dialectal Arabic Transcription System
Sameer Khurana
Ahmed M. Ali
James R. Glass
136
11
0
26 Sep 2019
High Fidelity Speech Synthesis with Adversarial Networks
High Fidelity Speech Synthesis with Adversarial NetworksInternational Conference on Learning Representations (ICLR), 2019
Mikolaj Binkowski
Jeff Donahue
Sander Dieleman
Aidan Clark
Erich Elsen
Norman Casagrande
Luis C. Cobo
Karen Simonyan
549
260
0
25 Sep 2019
Improving OOV Detection and Resolution with External Language Models in
  Acoustic-to-Word ASR
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASRSpoken Language Technology Workshop (SLT), 2018
Hirofumi Inaguma
Masato Mimura
S. Sakai
Tatsuya Kawahara
78
5
0
22 Sep 2019
Using Statistics to Automate Stochastic Optimization
Using Statistics to Automate Stochastic OptimizationNeural Information Processing Systems (NeurIPS), 2019
Hunter Lang
Pengchuan Zhang
Lin Xiao
128
24
0
21 Sep 2019
Distributed Parameter Estimation in Randomized One-hidden-layer Neural
  Networks
Distributed Parameter Estimation in Randomized One-hidden-layer Neural NetworksAmerican Control Conference (ACC), 2019
Yinsong Wang
Shahin Shahrampour
FedML
184
1
0
20 Sep 2019
A Simple yet Effective Baseline for Robust Deep Learning with Noisy
  Labels
A Simple yet Effective Baseline for Robust Deep Learning with Noisy Labels
Yucen Luo
Jun Zhu
Tomas Pfister
NoLa
192
7
0
20 Sep 2019
Training Robust Deep Neural Networks via Adversarial Noise Propagation
Training Robust Deep Neural Networks via Adversarial Noise PropagationIEEE Transactions on Image Processing (TIP), 2019
Aishan Liu
Xianglong Liu
Chongzhi Zhang
Hang Yu
Qiang Liu
Dacheng Tao
AAML
110
132
0
19 Sep 2019
A Comparison of Hybrid and End-to-End Models for Syllable Recognition
A Comparison of Hybrid and End-to-End Models for Syllable RecognitionInternational Conference on Text, Speech and Dialogue (TSD), 2019
Sebastian P. Bayerl
Korbinian Riedhammer
91
2
0
19 Sep 2019
Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Espresso: A Fast End-to-end Neural Speech Recognition ToolkitAutomatic Speech Recognition & Understanding (ASRU), 2019
Yiming Wang
Tongfei Chen
Hainan Xu
Shuoyang Ding
Hang Lv
Yiwen Shao
Nanyun Peng
Lei Xie
Shinji Watanabe
Sanjeev Khudanpur
VLM
177
75
0
18 Sep 2019
Benchmarking the Performance and Energy Efficiency of AI Accelerators
  for AI Training
Benchmarking the Performance and Energy Efficiency of AI Accelerators for AI Training
Yuxin Wang
Qiang-qiang Wang
Shaoshuai Shi
Xin He
Zhenheng Tang
Kaiyong Zhao
Xiaowen Chu
352
4
0
15 Sep 2019
Multilingual Graphemic Hybrid ASR with Massive Data Augmentation
Multilingual Graphemic Hybrid ASR with Massive Data AugmentationWorkshop on Spoken Language Technologies for Under-resourced Languages (SLTU), 2019
Chunxi Liu
Qiaochu Zhang
Xiaohui Zhang
Kritika Singh
Yatharth Saraf
Geoffrey Zweig
233
31
0
14 Sep 2019
Human-Machine Collaborative Design for Accelerated Design of Compact
  Deep Neural Networks for Autonomous Driving
Human-Machine Collaborative Design for Accelerated Design of Compact Deep Neural Networks for Autonomous Driving
M. Shafiee
M. Nentwig
Y. Kassahun
Francis Li
S. Bochkarev
Akif Kamal
D. Dolson
Secil Altintas
Arif Virani
A. Wong
146
3
0
12 Sep 2019
A Survey of Techniques All Classifiers Can Learn from Deep Networks:
  Models, Optimizations, and Regularization
A Survey of Techniques All Classifiers Can Learn from Deep Networks: Models, Optimizations, and Regularization
Alireza Ghods
D. Cook
127
1
0
10 Sep 2019
Preech: A System for Privacy-Preserving Speech Transcription
Preech: A System for Privacy-Preserving Speech TranscriptionUSENIX Security Symposium (USENIX Security), 2019
Shimaa Ahmed
Amrita Roy Chowdhury
Kassem Fawaz
P. Ramanathan
371
50
0
09 Sep 2019
Training Deep Neural Networks Using Posit Number System
Training Deep Neural Networks Using Posit Number SystemACM Symposium on Cloud Computing (SoCC), 2019
Jinming Lu
Siyuan Lu
Zhisheng Wang
Chao Fang
Jun Lin
Zhongfeng Wang
Li Du
MQ
109
16
0
06 Sep 2019
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible
  Neural Processing Units
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing UnitsInternational Symposium on High-Performance Computer Architecture (HPCA), 2019
Yujeong Choi
Minsoo Rhu
133
151
0
06 Sep 2019
Learning without feedback: Fixed random learning signals allow for
  feedforward training of deep neural networks
Learning without feedback: Fixed random learning signals allow for feedforward training of deep neural networks
Charlotte Frenkel
M. Lefebvre
D. Bol
207
23
0
03 Sep 2019
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning
Beyond Human-Level Accuracy: Computational Challenges in Deep LearningACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2019
Joel Hestness
Newsha Ardalani
G. Diamos
110
79
0
03 Sep 2019
Scale Calibrated Training: Improving Generalization of Deep Networks via
  Scale-Specific Normalization
Scale Calibrated Training: Improving Generalization of Deep Networks via Scale-Specific Normalization
Zhuoran Yu
Aojun Zhou
Yukun Ma
Yudian Li
Xiaohan Zhang
Ping Luo
101
3
0
31 Aug 2019
Approximating Stacked and Bidirectional Recurrent Architectures with the
  Delayed Recurrent Neural Network
Approximating Stacked and Bidirectional Recurrent Architectures with the Delayed Recurrent Neural Network
Javier S. Turek
Shailee Jain
Vy A. Vo
Mihai Capota
Alexander G. Huth
Theodore L. Willke
144
3
0
30 Aug 2019
Learning to Transfer Learn: Reinforcement Learning-Based Selection for
  Adaptive Transfer Learning
Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning
Linchao Zhu
Sercan O. Arik
Yezhou Yang
Tomas Pfister
193
5
0
29 Aug 2019
Environment Sound Classification using Multiple Feature Channels and Attention based Deep Convolutional Neural Network
Jivitesh Sharma
Ole-Christoffer Granmo
M. G. Olsen
372
17
0
28 Aug 2019
TabNet: Attentive Interpretable Tabular Learning
TabNet: Attentive Interpretable Tabular LearningAAAI Conference on Artificial Intelligence (AAAI), 2019
Sercan O. Arik
Tomas Pfister
LMTD
740
1,776
0
20 Aug 2019
Unpaired Image-to-Speech Synthesis with Multimodal Information
  Bottleneck
Unpaired Image-to-Speech Synthesis with Multimodal Information BottleneckIEEE International Conference on Computer Vision (ICCV), 2019
Shuang Ma
Daniel J. McDuff
Yale Song
174
28
0
19 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DVVLMAI4TS
349
224
0
16 Aug 2019
AIBench: An Industry Standard Internet Service AI Benchmark Suite
AIBench: An Industry Standard Internet Service AI Benchmark Suite
Wanling Gao
Fei Tang
Lei Wang
Jianfeng Zhan
Chunxin Lan
...
Yatao Li
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
163
48
0
13 Aug 2019
Previous
123...131415...202122
Next