ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.02595
  4. Cited By
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

8 December 2015
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
Bryan Catanzaro
Jingdong Chen
Mike Chrzanowski
Adam Coates
G. Diamos
Erich Elsen
Jesse Engel
Linxi Fan
Christopher Fougner
T. Han
Awni Y. Hannun
Billy Jun
P. LeGresley
Libby Lin
Sharan Narang
A. Ng
Sherjil Ozair
R. Prenger
Jonathan Raiman
S. Satheesh
David Seetapun
Shubho Sengupta
Yi Wang
Zhiqian Wang
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
ArXiv (abs)PDFHTML

Papers citing "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin"

50 / 1,096 papers shown
Title
FrugalML: How to Use ML Prediction APIs More Accurately and Cheaply
FrugalML: How to Use ML Prediction APIs More Accurately and CheaplyNeural Information Processing Systems (NeurIPS), 2020
Lingjiao Chen
Matei A. Zaharia
James Zou
133
45
0
12 Jun 2020
A Practical Sparse Approximation for Real Time Recurrent Learning
A Practical Sparse Approximation for Real Time Recurrent Learning
Jacob Menick
Erich Elsen
Utku Evci
Simon Osindero
Karen Simonyan
Alex Graves
173
33
0
12 Jun 2020
Self-organization of multi-layer spiking neural networks
Self-organization of multi-layer spiking neural networks
G. Raghavan
Cong Lin
Matt Thomson
AI4CE
88
8
0
12 Jun 2020
Dataset Condensation with Gradient Matching
Dataset Condensation with Gradient MatchingInternational Conference on Learning Representations (ICLR), 2020
Bo Zhao
Konda Reddy Mopuri
Hakan Bilen
DD
593
619
0
10 Jun 2020
Modelling of daily reference evapotranspiration using deep neural
  network in different climates
Modelling of daily reference evapotranspiration using deep neural network in different climates
Atilla Özgür
S. Yamaç
205
7
0
02 Jun 2020
Insertion-Based Modeling for End-to-End Automatic Speech Recognition
Insertion-Based Modeling for End-to-End Automatic Speech RecognitionInterspeech (Interspeech), 2020
Yuya Fujita
Shinji Watanabe
Motoi Omachi
Xuankai Chan
215
33
0
27 May 2020
Misalignment Resilient Diffractive Optical Networks
Misalignment Resilient Diffractive Optical Networks
Deniz Mengu
Yifan Zhao
N. Yardimci
Y. Rivenson
Mona Jarrahi
Aydogan Ozcan
179
116
0
23 May 2020
End-to-end Named Entity Recognition from English Speech
End-to-end Named Entity Recognition from English Speech
Hemant Yadav
Sreyan Ghosh
Yi Yu
R. Shah
126
66
0
22 May 2020
Formant Tracking Using Dilated Convolutional Networks Through Dense
  Connection with Gating Mechanism
Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism
Wang Dai
Jinsong Zhang
Yingming Gao
Wei Wei
Dengfeng Ke
Binghuai Lin
Yanlu Xie
163
4
0
21 May 2020
Automated Question Answer medical model based on Deep Learning
  Technology
Automated Question Answer medical model based on Deep Learning Technology
Abdelrahman Abdallah
M. Kasem
Mohamed Hamada
Shaymaa Sdeek
LM&MAAI4MHMedIm
94
29
0
21 May 2020
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for
  End-to-End ASR
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Yiwen Shao
Yiming Wang
Daniel Povey
Sanjeev Khudanpur
AI4TS
142
39
0
20 May 2020
Deep learning approaches for neural decoding: from CNNs to LSTMs and
  spikes to fMRI
Deep learning approaches for neural decoding: from CNNs to LSTMs and spikes to fMRI
J. Livezey
Joshua I. Glaser
AI4CE
201
11
0
19 May 2020
A systematic comparison of grapheme-based vs. phoneme-based label units
  for encoder-decoder-attention models
A systematic comparison of grapheme-based vs. phoneme-based label units for encoder-decoder-attention models
Mohammad Zeineldeen
Albert Zeyer
Wei Zhou
T. Ng
Ralf Schluter
Hermann Ney
208
2
0
19 May 2020
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Frank Zhang
Yongqiang Wang
Xiaohui Zhang
Chunxi Liu
Yatharth Saraf
Geoffrey Zweig
274
20
0
19 May 2020
Robust Training of Vector Quantized Bottleneck Models
Robust Training of Vector Quantized Bottleneck Models
A. Lancucki
J. Chorowski
Guillaume Sanchez
R. Marxer
Nanxin Chen
Hans J. G. A. Dolfing
Sameer Khurana
Tanel Alumäe
Antoine Laurent
169
73
0
18 May 2020
Multi-modal Automated Speech Scoring using Attention Fusion
Multi-modal Automated Speech Scoring using Attention Fusion
Manraj Singh Grover
Yaman Kumar Singla
Sumit Sarin
Payman Vafaee
Mika Hama
R. Shah
152
13
0
17 May 2020
Large scale weakly and semi-supervised learning for low-resource video
  ASR
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
144
10
0
16 May 2020
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to
  Predict Sleepiness From Speech
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech
Shahin Amiriparian
Pawel Winokurow
Vincent Karas
Sandra Ottl
Maurice Gerczuk
Björn W. Schuller
141
7
0
15 May 2020
FaceFilter: Audio-visual speech separation using still images
FaceFilter: Audio-visual speech separation using still images
Soo-Whan Chung
Soyeon Choe
Joon Son Chung
Hong-Goo Kang
CVBM
158
74
0
14 May 2020
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for
  Personalized Recommendations
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations
Ranggi Hwang
Taehun Kim
Youngeun Kwon
Minsoo Rhu
135
114
0
12 May 2020
deepSELF: An Open Source Deep Self End-to-End Learning Framework
deepSELF: An Open Source Deep Self End-to-End Learning Framework
Tomoya Koike
Kun Qian
Björn W. Schuller
Yoshiharu Yamamoto
SLRHAI
90
3
0
11 May 2020
Incremental Learning for End-to-End Automatic Speech Recognition
Incremental Learning for End-to-End Automatic Speech Recognition
Li Fu
Xiaoxiao Li
Libo Zi
Zhengchen Zhang
Youzheng Wu
Xiaodong He
Bowen Zhou
CLL
317
25
0
11 May 2020
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and
  Solutions
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Chung-Cheng Chiu
A. Narayanan
Wei Han
Rohit Prabhavalkar
Yu Zhang
...
Ruoming Pang
Tara N. Sainath
Patrick Nguyen
Liangliang Cao
Yonghui Wu
356
44
0
07 May 2020
AIBench Scenario: Scenario-distilling AI Benchmarking
AIBench Scenario: Scenario-distilling AI BenchmarkingInternational Conference on Parallel Architectures and Compilation Techniques (PACT), 2020
Wanling Gao
Fei Tang
Jianfeng Zhan
Xu Wen
Lei Wang
Zheng Cao
Chuanxin Lan
Chunjie Luo
Xiaoli Liu
Zihan Jiang
238
14
0
06 May 2020
Monitoring COVID-19 social distancing with person detection and tracking
  via fine-tuned YOLO v3 and Deepsort techniques
Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques
Narinder Singh Punn
S. K. Sonbhadra
Sonali Agarwal
Gaurav Rai
251
250
0
04 May 2020
MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech
MultiQT: Multimodal Learning for Real-Time Question Tracking in SpeechAnnual Meeting of the Association for Computational Linguistics (ACL), 2020
Jakob Drachmann Havtorn
Jan Latko
Joakim Edin
Lasse Borgholt
Lars Maaløe
Lorenzo Belgrano
Nicolai Frost Jakobsen
R. Sdun
Zeljko Agic
135
3
0
02 May 2020
AIBench Training: Balanced Industry-Standard AI Training Benchmarking
AIBench Training: Balanced Industry-Standard AI Training Benchmarking
Fei Tang
Wanling Gao
Jianfeng Zhan
Chuanxin Lan
Xu Wen
...
Yatao Li
Junchao Shao
Zhenyu Wang
Xiaoyu Wang
Hainan Ye
163
3
0
30 Apr 2020
Quantized Adam with Error Feedback
Quantized Adam with Error FeedbackACM Transactions on Intelligent Systems and Technology (ACM TIST), 2020
Congliang Chen
Li Shen
Haozhi Huang
Wei Liu
ODLMQ
127
38
0
29 Apr 2020
Caramel: Accelerating Decentralized Distributed Deep Learning with
  Computation Scheduling
Caramel: Accelerating Decentralized Distributed Deep Learning with Computation Scheduling
Sayed Hadi Hashemi
Sangeetha Abdu Jyothi
Brighten Godfrey
R. Campbell
120
2
0
29 Apr 2020
A Summary of the First Workshop on Language Technology for Language
  Documentation and Revitalization
A Summary of the First Workshop on Language Technology for Language Documentation and RevitalizationWorkshop on Spoken Language Technologies for Under-resourced Languages (SLTU), 2020
Graham Neubig
Shruti Rijhwani
Alexis Palmer
Jordan MacKenzie
Hilaria Cruz
...
Yiyuan Li
S. Zink
Mengzhou Xia
Roshan S. Sharma
Patrick Littell
104
8
0
27 Apr 2020
Research on Modeling Units of Transformer Transducer for Mandarin Speech
  Recognition
Research on Modeling Units of Transformer Transducer for Mandarin Speech Recognition
Li Fu
Xiaoxiao Li
Libo Zi
149
5
0
26 Apr 2020
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic
  Speech Recognition of Contact Centers
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersInterspeech (Interspeech), 2020
Jung-Woo Ha
KiHyun Nam
Jin Gu Kang
Sang-Woo Lee
Sohee Yang
...
Hyun Ah Kim
Kyoungtae Doh
C. Lee
Nako Sung
Sunghun Kim
152
31
0
20 Apr 2020
COVID-19 Time-series Prediction by Joint Dictionary Learning and Online
  NMF
COVID-19 Time-series Prediction by Joint Dictionary Learning and Online NMF
Hanbaek Lyu
Christopher Strohmeier
G. Menz
Deanna Needell
133
12
0
20 Apr 2020
Efficient Synthesis of Compact Deep Neural Networks
Efficient Synthesis of Compact Deep Neural NetworksDesign Automation Conference (DAC), 2020
Wenhan Xia
Hongxu Yin
N. Jha
142
3
0
18 Apr 2020
Direct Speech-to-image Translation
Direct Speech-to-image TranslationIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2020
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
149
29
0
07 Apr 2020
Improving Perceptual Quality of Drum Transcription with the Expanded
  Groove MIDI Dataset
Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset
Lee F. Callender
Curtis Hawthorne
Jesse Engel
225
26
0
01 Apr 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Serialized Output Training for End-to-End Overlapped Speech RecognitionInterspeech (Interspeech), 2020
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
193
141
0
28 Mar 2020
Enabling Efficient and Flexible FPGA Virtualization for Deep Learning in
  the Cloud
Enabling Efficient and Flexible FPGA Virtualization for Deep Learning in the CloudIEEE Symposium on Field-Programmable Custom Computing Machines (FCCM), 2020
Shulin Zeng
Guohao Dai
Hanbo Sun
Kai Zhong
Guangjun Ge
Kaiyuan Guo
Yu Wang
Huazhong Yang
109
17
0
26 Mar 2020
Depth Enables Long-Term Memory for Recurrent Neural Networks
Depth Enables Long-Term Memory for Recurrent Neural Networks
A. Ziv
84
0
0
23 Mar 2020
Training for Speech Recognition on Coprocessors
Training for Speech Recognition on Coprocessors
Sebastian Baunsgaard
S. Wrede
Pınar Tözün
96
6
0
22 Mar 2020
Communication-Efficient Distributed Deep Learning: A Comprehensive
  Survey
Communication-Efficient Distributed Deep Learning: A Comprehensive Survey
Zhenheng Tang
Shaoshuai Shi
Wei Wang
Yue Liu
Xiaowen Chu
222
54
0
10 Mar 2020
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection
Good Subnetworks Provably Exist: Pruning via Greedy Forward SelectionInternational Conference on Machine Learning (ICML), 2020
Mao Ye
Chengyue Gong
Lizhen Nie
Denny Zhou
Adam R. Klivans
Qiang Liu
338
120
0
03 Mar 2020
Untangling in Invariant Speech Recognition
Untangling in Invariant Speech RecognitionNeural Information Processing Systems (NeurIPS), 2020
Cory Stephenson
J. Feather
Suchismita Padhy
Oguz H. Elibol
Hanlin Tang
Josh H. McDermott
SueYeon Chung
SSL
197
33
0
03 Mar 2020
Deep Learning in Memristive Nanowire Networks
Deep Learning in Memristive Nanowire Networks
Jack D. Kendall
Ross D. Pantone
J. Nino
100
3
0
03 Mar 2020
Improving Uyghur ASR systems with decoders using morpheme-based language
  models
Improving Uyghur ASR systems with decoders using morpheme-based language modelsIEEE Joint International Information Technology and Artificial Intelligence Conference (JITAI), 2020
Zi-Jin Qiu
Wei Jiang
Turghunjan Mamut
AI4CE
180
4
0
03 Mar 2020
Convo: What does conversational programming need? An exploration of
  machine learning interface design
Convo: What does conversational programming need? An exploration of machine learning interface designIEEE Symposium on Visual Languages / Human-Centric Computing Languages and Environments (VL/HCC), 2020
Jessica Van Brummelen
Kevin Weng
Phoebe Lin
C. Yeo
122
20
0
03 Mar 2020
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A Survey
A. Torfi
Rouzbeh A. Shirvani
Yaser Keneshloo
Nader Tavvaf
Edward A. Fox
AI4CEVLM
573
248
0
02 Mar 2020
Towards Automatic Face-to-Face Translation
Towards Automatic Face-to-Face TranslationACM Multimedia (ACM MM), 2019
Prajwal K R
Rudrabha Mukhopadhyay
Jerin Philip
Abhishek Jha
Vinay P. Namboodiri
C. V. Jawahar
CVBM
216
201
0
01 Mar 2020
Graphcore C2 Card performance for image-based deep learning application:
  A Report
Graphcore C2 Card performance for image-based deep learning application: A Report
Ilyes Kacher
Maxime Portaz
Hicham Randrianarivo
Sylvain Peyronnet
GNNBDLVLM
247
13
0
26 Feb 2020
A.I. based Embedded Speech to Text Using Deepspeech
A.I. based Embedded Speech to Text Using Deepspeech
Muhammad Hafidh Firmansyah
Anand Paul
D. Bhattacharya
Gul Malik Urfa
76
6
0
25 Feb 2020
Previous
123...111213...202122
Next