ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition
v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXiv (abs)PDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 768 papers shown
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE)
  for Speech Feature Enhancement
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature EnhancementApplied Acoustics (Appl. Acoust.), 2019
Alzahra Badi
Sangwook Park
D. Han
Hanseok Ko
116
7
0
26 Jul 2019
A system of different layers of abstraction for artificial intelligence
A system of different layers of abstraction for artificial intelligence
Alexander Serb
T. Prodromakis
AI4CE
70
7
0
22 Jul 2019
A semi-holographic hyperdimensional representation system for
  hardware-friendly cognitive computing
A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing
Alexandrou Serb
I. Kobyzev
Jiaqi Wang
T. Prodromakis
160
4
0
12 Jul 2019
Fine-grained robust prosody transfer for single-speaker neural
  text-to-speech
Fine-grained robust prosody transfer for single-speaker neural text-to-speechInterspeech (Interspeech), 2019
V. Klimkov
S. Ronanki
Jonas Rohnke
Thomas Drugman
AI4TS
178
85
0
04 Jul 2019
Towards Interpretable Deep Extreme Multi-label Learning
Towards Interpretable Deep Extreme Multi-label LearningIEEE International Conference on Information Reuse and Integration (IRI), 2019
Yihuang Kang
I-Ling Cheng
W. Mao
Bowen Kuo
Pei-Ju Lee
100
0
0
03 Jul 2019
Themis: Fair and Efficient GPU Cluster Scheduling
Themis: Fair and Efficient GPU Cluster SchedulingSymposium on Networked Systems Design and Implementation (NSDI), 2019
Kshiteej S. Mahajan
Arjun Balasubramanian
Arjun Singhvi
Shivaram Venkataraman
Aditya Akella
Amar Phanishayee
Shuchi Chawla
161
218
0
02 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for
  Conversational-Context Fusion
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context FusionAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Suyoun Kim
Siddharth Dalmia
Florian Metze
172
24
0
27 Jun 2019
Unsupervised Phoneme and Word Discovery from Multiple Speakers using
  Double Articulation Analyzer and Neural Network with Parametric Bias
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric BiasFrontiers in Robotics and AI (Front. Robot. AI), 2019
Ryo Nakashima
Ryo Ozaki
T. Taniguchi
157
6
0
21 Jun 2019
On the Robustness of the Backdoor-based Watermarking in Deep Neural
  Networks
On the Robustness of the Backdoor-based Watermarking in Deep Neural Networks
Masoumeh Shafieinejad
Jiaqi Wang
Nils Lukas
Xinda Li
Florian Kerschbaum
AAML
136
8
0
18 Jun 2019
Curriculum-based transfer learning for an effective end-to-end spoken
  language understanding and domain portability
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portabilityInterspeech (Interspeech), 2019
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
124
57
0
18 Jun 2019
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Aaron Nicolson
K. Paliwal
120
12
0
18 Jun 2019
Perceptual Based Adversarial Audio Attacks
Perceptual Based Adversarial Audio Attacks
Joseph Szurley
J. Zico Kolter
AAML
109
25
0
14 Jun 2019
Selfie: Self-supervised Pretraining for Image Embedding
Selfie: Self-supervised Pretraining for Image Embedding
Trieu H. Trinh
Minh-Thang Luong
Quoc V. Le
SSL
290
116
0
07 Jun 2019
The Architectural Implications of Facebook's DNN-based Personalized
  Recommendation
The Architectural Implications of Facebook's DNN-based Personalized RecommendationInternational Symposium on High-Performance Computer Architecture (HPCA), 2019
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
317
315
0
06 Jun 2019
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty
  and Adversarial Robustness
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial RobustnessNeural Information Processing Systems (NeurIPS), 2019
A. Malinin
Mark Gales
UQCVAAML
261
202
0
31 May 2019
Speaker Anonymization Using X-vector and Neural Waveform Models
Speaker Anonymization Using X-vector and Neural Waveform ModelsSpeech Synthesis Workshop (SSW), 2019
Fuming Fang
Xin Wang
Junichi Yamagishi
Isao Echizen
Massimiliano Todisco
Nicholas W. D. Evans
J. Bonastre
154
157
0
30 May 2019
Mixed Precision Training With 8-bit Floating Point
Mixed Precision Training With 8-bit Floating Point
Naveen Mellempudi
Sudarshan Srinivasan
Dipankar Das
Bharat Kaul
MQ
174
73
0
29 May 2019
Local Label Propagation for Large-Scale Semi-Supervised Learning
Local Label Propagation for Large-Scale Semi-Supervised Learning
Chengxu Zhuang
Xuehao Ding
Divyanshu Murli
Daniel L. K. Yamins
SSL
108
13
0
28 May 2019
NTP : A Neural Network Topology Profiler
NTP : A Neural Network Topology Profiler
Raghavendra Bhat
Pravin Chandran
Juby Jose
Viswanath Dibbur
Prakash Sirra Ajith
125
2
0
22 May 2019
Acoustic-to-Word Models with Conversational Context Information
Acoustic-to-Word Models with Conversational Context InformationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2019
Suyoun Kim
Florian Metze
157
7
0
21 May 2019
Universal Adversarial Perturbations for Speech Recognition Systems
Universal Adversarial Perturbations for Speech Recognition SystemsInterspeech (Interspeech), 2019
Paarth Neekhara
Shehzeen Samarah Hussain
Prakhar Pandey
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
142
128
0
09 May 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
Capture, Learning, and Synthesis of 3D Speaking StylesComputer Vision and Pattern Recognition (CVPR), 2019
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM3DH
285
399
0
08 May 2019
Transparent pronunciation scoring using articulatorily weighted phoneme
  edit distance
Transparent pronunciation scoring using articulatorily weighted phoneme edit distanceInterspeech (Interspeech), 2019
Reima Karhila
Anna-Riikka Smolander
Sari Ylinen
M. Kurimo
46
15
0
07 May 2019
Ensemble Distribution Distillation
Ensemble Distribution DistillationInternational Conference on Learning Representations (ICLR), 2019
A. Malinin
Bruno Mlodozeniec
Mark Gales
UQCV
504
262
0
30 Apr 2019
Unsupervised Data Augmentation for Consistency Training
Unsupervised Data Augmentation for Consistency TrainingNeural Information Processing Systems (NeurIPS), 2019
Qizhe Xie
Zihang Dai
Eduard H. Hovy
Minh-Thang Luong
Quoc V. Le
793
2,537
0
29 Apr 2019
Transformers with convolutional context for ASR
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
192
172
0
26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech
  Recognition
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
518
3,832
0
18 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and
  Knowledge Distillation
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
153
48
0
17 Apr 2019
Adversarial Audio: A New Information Hiding Method and Backdoor for
  DNN-based Speech Recognition Models
Adversarial Audio: A New Information Hiding Method and Backdoor for DNN-based Speech Recognition Models
Yehao Kong
Jiliang Zhang
85
30
0
08 Apr 2019
Measuring scheduling efficiency of RNNs for NLP applications
Measuring scheduling efficiency of RNNs for NLP applications
Urmish Thakker
Ganesh S. Dasika
Jesse G. Beu
Matthew Mattina
105
14
0
05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable
  Convolutions
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
183
105
0
04 Apr 2019
RAPID: Early Classification of Explosive Transients using Deep Learning
RAPID: Early Classification of Explosive Transients using Deep Learning
D. Muthukrishna
G. Narayan
K. Mandel
R. Biswas
R. Hložek
160
122
0
29 Mar 2019
Local Aggregation for Unsupervised Learning of Visual Embeddings
Local Aggregation for Unsupervised Learning of Visual Embeddings
Chengxu Zhuang
Alex Zhai
Daniel L. K. Yamins
SSL
256
460
0
29 Mar 2019
Grammatical Error Correction and Style Transfer via Zero-shot
  Monolingual Translation
Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation
Elizaveta Korotkova
Agnes Luhtaru
Maksym Del
Krista Liin
Daiga Deksne
Mark Fishel
148
12
0
27 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End
  Speech Recognition
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
113
18
0
27 Mar 2019
Practical Hidden Voice Attacks against Speech and Speaker Recognition
  Systems
Practical Hidden Voice Attacks against Speech and Speaker Recognition SystemsNetwork and Distributed System Security Symposium (NDSS), 2019
H. Abdullah
Washington Garcia
Christian Peeters
Patrick Traynor
Kevin R. B. Butler
Joseph N. Wilson
AAML
152
177
0
18 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
102
15
0
12 Mar 2019
Source codes in human communication
Source codes in human communication
Michael Ramscar
49
12
0
08 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition
  from YouTube Videos
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube VideosConference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Egor Lakomkin
S. Magg
C. Weber
S. Wermter
113
20
0
01 Mar 2019
Incorporating End-to-End Speech Recognition Models for Sentiment
  Analysis
Incorporating End-to-End Speech Recognition Models for Sentiment AnalysisIEEE International Conference on Robotics and Automation (ICRA), 2019
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
154
24
0
28 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
An Optimized Recurrent Unit for Ultra-Low-Power Keyword SpottingProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2019
Justice Amoh
K. Odame
173
19
0
13 Feb 2019
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning
  Applications
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning ApplicationsConference on Machine Learning and Systems (MLSys), 2019
Peifeng Yu
Mosharaf Chowdhury
162
79
0
12 Feb 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet
  Execution-Efficient LSTM
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
105
10
0
30 Jan 2019
Weighted-Sampling Audio Adversarial Example Attack
Weighted-Sampling Audio Adversarial Example Attack
Xiaolei Liu
Xiaosong Zhang
Kun Wan
Qingxin Zhu
Yufei Ding
DiffMAAML
220
39
0
26 Jan 2019
SirenAttack: Generating Adversarial Audio for End-to-End Acoustic Systems
Tianyu Du
S. Ji
Jinfeng Li
Qinchen Gu
Ting Wang
Jiliang Li
AAML
247
146
0
23 Jan 2019
Self-Attention Networks for Connectionist Temporal Classification in
  Speech Recognition
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
247
123
0
22 Jan 2019
Robust Watermarking of Neural Network with Exponential Weighting
Robust Watermarking of Neural Network with Exponential Weighting
Ryota Namba
Jun Sakuma
AAML
160
151
0
18 Jan 2019
Prototypical Metric Transfer Learning for Continuous Speech Keyword
  Spotting With Limited Training Data
Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data
Harshita Seth
Pulkit Kumar
Muktabh Mayank Srivastava
184
13
0
12 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
132
26
0
31 Dec 2018
Stanza: Layer Separation for Distributed Training in Deep Learning
Stanza: Layer Separation for Distributed Training in Deep Learning
Xiaorui Wu
Hongao Xu
Bo Li
Y. Xiong
MoE
125
9
0
27 Dec 2018
Previous
123...101112...141516
Next