ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition
v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXiv (abs)PDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 768 papers shown
Distributed Training of Deep Neural Network Acoustic Models for
  Automatic Speech Recognition
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech RecognitionIEEE Signal Processing Magazine (IEEE Signal Process. Mag.), 2020
Xiaodong Cui
Wei Zhang
Ulrich Finkler
G. Saon
M. Picheny
David S. Kung
121
19
0
24 Feb 2020
Semi-Supervised Speech Recognition via Local Prior Matching
Semi-Supervised Speech Recognition via Local Prior Matching
Wei-Ning Hsu
Ann Lee
Gabriel Synnaeve
Awni Y. Hannun
SSL
236
31
0
24 Feb 2020
Uncertainty Estimation in Autoregressive Structured Prediction
Uncertainty Estimation in Autoregressive Structured Prediction
A. Malinin
Mark Gales
UQLM
241
9
0
18 Feb 2020
Identifying Audio Adversarial Examples via Anomalous Pattern Detection
Identifying Audio Adversarial Examples via Anomalous Pattern Detection
Victor Akinwande
C. Cintas
Skyler Speakman
Srihari Sridharan
AAML
185
18
0
13 Feb 2020
Spatial-Temporal Multi-Cue Network for Continuous Sign Language
  Recognition
Spatial-Temporal Multi-Cue Network for Continuous Sign Language RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2020
Hao Zhou
Wen-gang Zhou
Yun Zhou
Houqiang Li
NoLa
137
235
0
08 Feb 2020
RPN: A Residual Pooling Network for Efficient Federated Learning
RPN: A Residual Pooling Network for Efficient Federated LearningEuropean Conference on Artificial Intelligence (ECAI), 2020
Anbu Huang
Yuanyuan Chen
Yang Liu
Tianjian Chen
Qiang Yang
FedML
199
12
0
23 Jan 2020
Single headed attention based sequence-to-sequence model for
  state-of-the-art results on Switchboard
Single headed attention based sequence-to-sequence model for state-of-the-art results on SwitchboardInterspeech (Interspeech), 2020
Zoltán Tüske
G. Saon
Kartik Audhkhasi
Brian Kingsbury
BDL
172
70
0
20 Jan 2020
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach
  for Deep Neural Networks
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural NetworksJournal of Artificial Intelligence Research (JAIR), 2020
Md Shahriar Iqbal
Jianhai Su
Lars Kotthoff
Pooyan Jamshidi
182
3
0
18 Jan 2020
Deep Representation Learning in Speech Processing: Challenges, Recent
  Advances, and Future Trends
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends
S. Latif
R. Rana
Sara Khalifa
Raja Jurdak
Junaid Qadir
Björn W. Schuller
AI4TS
346
87
0
02 Jan 2020
RC-DARTS: Resource Constrained Differentiable Architecture Search
RC-DARTS: Resource Constrained Differentiable Architecture Search
Xiaojie Jin
Jiang Wang
Joshua Slocum
Ming-Hsuan Yang
Shengyang Dai
Shuicheng Yan
Jiashi Feng
137
34
0
30 Dec 2019
An Analysis of the Expressiveness of Deep Neural Network Architectures
  Based on Their Lipschitz Constants
An Analysis of the Expressiveness of Deep Neural Network Architectures Based on Their Lipschitz Constants
Siqi Zhou
Angela P. Schoellig
67
13
0
24 Dec 2019
Incorporating Unlabeled Data into Distributionally Robust Learning
Incorporating Unlabeled Data into Distributionally Robust LearningJournal of machine learning research (JMLR), 2019
Charlie Frogner
Sebastian Claici
Edward Chien
Justin Solomon
OOD
159
27
0
16 Dec 2019
Common Voice: A Massively-Multilingual Speech Corpus
Common Voice: A Massively-Multilingual Speech CorpusInternational Conference on Language Resources and Evaluation (LREC), 2019
Rosana Ardila
Megan Branson
Kelly Davis
Michael Henretty
M. Kohler
Josh Meyer
Reuben Morais
Lindsay Saunders
Francis M. Tyers
Gregor Weber
VLM
343
2,070
0
13 Dec 2019
Neural Voice Puppetry: Audio-driven Facial Reenactment
Neural Voice Puppetry: Audio-driven Facial ReenactmentEuropean Conference on Computer Vision (ECCV), 2019
Justus Thies
Mohamed A. Elgharib
A. Tewari
Christian Theobalt
Matthias Nießner
VGen3DH
294
423
0
11 Dec 2019
SpecAugment on Large Scale Datasets
SpecAugment on Large Scale DatasetsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Daniel S. Park
Yu Zhang
Chung-Cheng Chiu
Youzheng Chen
Yue Liu
William Chan
Quoc V. Le
Yonghui Wu
172
154
0
11 Dec 2019
Effective Data Augmentation Approaches to End-to-End Task-Oriented
  Dialogue
Effective Data Augmentation Approaches to End-to-End Task-Oriented DialogueInternational Conference on Asian Language Processing (IALP), 2019
Jun Quan
Deyi Xiong
114
18
0
05 Dec 2019
Scratch that! An Evolution-based Adversarial Attack against Neural
  Networks
Scratch that! An Evolution-based Adversarial Attack against Neural Networks
Malhar Jere
Loris Rossi
Briland Hitaj
Gabriela F. Cretu-Ciocarlie
Giacomo Boracchi
F. Koushanfar
AAML
190
18
0
05 Dec 2019
Topic-aware chatbot using Recurrent Neural Networks and Nonnegative
  Matrix Factorization
Topic-aware chatbot using Recurrent Neural Networks and Nonnegative Matrix Factorization
Yuchen Guo
Nicholas Hanoian
Zhexiao Lin
Nicholas Liskij
Hanbaek Lyu
...
Jiahao Qu
Henry Sojico
Yuliang Wang
Zhe Xiong
Zhenhong Zou
229
2
0
01 Dec 2019
Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech
  Recognition
Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech RecognitionInterspeech (Interspeech), 2019
Chao Weng
Chengzhu Yu
Jia Cui
Chunlei Zhang
Dong Yu
272
40
0
28 Nov 2019
Stage-based Hyper-parameter Optimization for Deep Learning
Stage-based Hyper-parameter Optimization for Deep Learning
Ahnjae Shin
Dongjin Shin
Sungwoo Cho
Do Yoon Kim
Eunji Jeong
Gyeong-In Yu
Byung-Gon Chun
80
4
0
24 Nov 2019
Universal adversarial examples in speech command classification
Universal adversarial examples in speech command classification
Jon Vadillo
Roberto Santana
AAML
200
30
0
22 Nov 2019
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
DermGAN: Synthetic Generation of Clinical Skin Images with Pathology
Amirata Ghorbani
Vivek Natarajan
David Coz
Yuan Liu
GANMedIm
171
110
0
20 Nov 2019
Generate (non-software) Bugs to Fool Classifiers
Generate (non-software) Bugs to Fool ClassifiersAAAI Conference on Artificial Intelligence (AAAI), 2019
Hiromu Yakura
Youhei Akimoto
Jun Sakuma
AAML
82
10
0
20 Nov 2019
A novel method for identifying the deep neural network model with the
  Serial Number
A novel method for identifying the deep neural network model with the Serial Number
Xiangrui Xu
Yaqin Li
Cao Yuan
AAML
85
8
0
19 Nov 2019
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models
Siddharth Dalmia
Abdel-rahman Mohamed
M. Lewis
Florian Metze
Luke Zettlemoyer
163
12
0
09 Nov 2019
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems
Who is Real Bob? Adversarial Attacks on Speaker Recognition SystemsIEEE Symposium on Security and Privacy (IEEE S&P), 2019
Guangke Chen
Sen Chen
Lingling Fan
Xiaoning Du
Zhe Zhao
Fu Song
Yang Liu
AAML
262
225
0
03 Nov 2019
Does Speech enhancement of publicly available data help build robust
  Speech Recognition Systems?
Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?AAAI Conference on Artificial Intelligence (AAAI), 2019
Bhavya Ghai
Buvana Ramanan
Klaus Mueller
92
1
0
29 Oct 2019
Improving sequence-to-sequence speech recognition training with
  on-the-fly data augmentation
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
T. Nguyen
S. Stueker
Jan Niehues
A. Waibel
235
104
0
29 Oct 2019
Meta Learning for End-to-End Low-Resource Speech Recognition
Meta Learning for End-to-End Low-Resource Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
Jui-Yang Hsu
Yuan-Jui Chen
Hung-yi Lee
116
114
0
26 Oct 2019
Recognizing long-form speech using streaming end-to-end models
Recognizing long-form speech using streaming end-to-end modelsAutomatic Speech Recognition & Understanding (ASRU), 2019
A. Narayanan
Rohit Prabhavalkar
Chung-Cheng Chiu
David Rybach
Tara N. Sainath
Trevor Strohman
172
135
0
24 Oct 2019
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial
  Networks
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks
Sherif Abdulatif
Karim Armanious
Karim Guirguis
Jayasankar T. Sajeev
Bin Yang
GAN
140
0
0
21 Oct 2019
End-to-End Speech Recognition: A review for the French Language
End-to-End Speech Recognition: A review for the French Language
Florian Boyer
Jean-Luc Rouas
AI4TS
153
10
0
18 Oct 2019
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box
  Attacks on Speech Recognition and Voice Identification Systems
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems
H. Abdullah
Muhammad Sajidur Rahman
Washington Garcia
Logan Blue
Kevin Warren
Anurag Swarnim Yadav
T. Shrimpton
Patrick Traynor
AAML
141
95
0
11 Oct 2019
Animating Face using Disentangled Audio Representations
Animating Face using Disentangled Audio RepresentationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
Gaurav Mittal
Baoyuan Wang
CVBM
182
46
0
02 Oct 2019
Addressing Failure Prediction by Learning Model Confidence
Addressing Failure Prediction by Learning Model ConfidenceNeural Information Processing Systems (NeurIPS), 2019
Charles Corbière
Nicolas Thome
Avner Bar-Hen
Matthieu Cord
P. Pérez
283
331
0
01 Oct 2019
RandAugment: Practical automated data augmentation with a reduced search
  space
RandAugment: Practical automated data augmentation with a reduced search space
E. D. Cubuk
Barret Zoph
Jonathon Shlens
Quoc V. Le
MQ
1.0K
3,948
0
30 Sep 2019
A Comparison of Hybrid and End-to-End Models for Syllable Recognition
A Comparison of Hybrid and End-to-End Models for Syllable RecognitionInternational Conference on Text, Speech and Dialogue (TSD), 2019
Sebastian P. Bayerl
Korbinian Riedhammer
91
2
0
19 Sep 2019
Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
Adversarial Attacks and Defenses in Images, Graphs and Text: A ReviewInternational Journal of Automation and Computing (IJAC), 2019
Han Xu
Yao Ma
Haochen Liu
Debayan Deb
Hui Liu
Shucheng Zhou
Anil K. Jain
AAML
331
728
0
17 Sep 2019
Preech: A System for Privacy-Preserving Speech Transcription
Preech: A System for Privacy-Preserving Speech TranscriptionUSENIX Security Symposium (USENIX Security), 2019
Shimaa Ahmed
Amrita Roy Chowdhury
Kassem Fawaz
P. Ramanathan
371
50
0
09 Sep 2019
A Quantum Search Decoder for Natural Language Processing
A Quantum Search Decoder for Natural Language ProcessingQuantum Machine Intelligence (QMI), 2019
Johannes Bausch
Sathyawageeswar Subramanian
Stephen Piddock
203
17
0
09 Sep 2019
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible
  Neural Processing Units
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing UnitsInternational Symposium on High-Performance Computer Architecture (HPCA), 2019
Yujeong Choi
Minsoo Rhu
137
153
0
06 Sep 2019
Harnessing the Power of Deep Learning Methods in Healthcare: Neonatal
  Pain Assessment from Crying Sound
Harnessing the Power of Deep Learning Methods in Healthcare: Neonatal Pain Assessment from Crying Sound
Md Sirajus Salekin
Ghada Zamzami
Rahul Paul
Dmitry Goldgof
R. Kasturi
T. Ho
Yu Sun
112
11
0
05 Sep 2019
Brain2Char: A Deep Architecture for Decoding Text from Brain Recordings
Brain2Char: A Deep Architecture for Decoding Text from Brain RecordingsJournal of Neural Engineering (J. Neural Eng.), 2019
Pengfei Sun
Gopala K. Anumanchipalli
E. Chang
73
71
0
03 Sep 2019
Beyond Human-Level Accuracy: Computational Challenges in Deep Learning
Beyond Human-Level Accuracy: Computational Challenges in Deep LearningACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2019
Joel Hestness
Newsha Ardalani
G. Diamos
110
79
0
03 Sep 2019
Metric Learning for Adversarial Robustness
Metric Learning for Adversarial RobustnessNeural Information Processing Systems (NeurIPS), 2019
Chengzhi Mao
Ziyuan Zhong
Junfeng Yang
Carl Vondrick
Baishakhi Ray
OOD
320
199
0
03 Sep 2019
Smaller Models, Better Generalization
Smaller Models, Better Generalization
Mayank Sharma
Suraj Tripathi
Abhimanyu Dubey
Jayadeva Jayadeva
Sai Guruju
Nihal Goalla
83
1
0
29 Aug 2019
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and
  Transfer Learning
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer LearningInterspeech (Interspeech), 2019
Pavel Denisov
Ngoc Thang Vu
129
27
0
13 Aug 2019
Universal Adversarial Audio Perturbations
Universal Adversarial Audio Perturbations
Sajjad Abdoli
L. G. Hafemann
Jérôme Rony
Ismail Ben Ayed
P. Cardinal
Alessandro Lameiras Koerich
AAML
318
58
0
08 Aug 2019
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech
  Recognition Systems
Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition SystemsAsia-Pacific Computer Systems Architecture Conference (APCSAC), 2019
Lea Schonherr
Thorsten Eisenhofer
Steffen Zeiler
Thorsten Holz
D. Kolossa
AAML
356
70
0
05 Aug 2019
Machine Learning at the Network Edge: A Survey
Machine Learning at the Network Edge: A SurveyACM Computing Surveys (ACM CSUR), 2019
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
643
461
0
31 Jul 2019
Previous
123...91011...141516
Next