Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1412.5567
Cited By
v1
v2 (latest)
Deep Speech: Scaling up end-to-end speech recognition
17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Speech: Scaling up end-to-end speech recognition"
50 / 768 papers shown
Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature Enhancement
Applied Acoustics (Appl. Acoust.), 2019
Alzahra Badi
Sangwook Park
D. Han
Hanseok Ko
116
7
0
26 Jul 2019
A system of different layers of abstraction for artificial intelligence
Alexander Serb
T. Prodromakis
AI4CE
70
7
0
22 Jul 2019
A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing
Alexandrou Serb
I. Kobyzev
Jiaqi Wang
T. Prodromakis
160
4
0
12 Jul 2019
Fine-grained robust prosody transfer for single-speaker neural text-to-speech
Interspeech (Interspeech), 2019
V. Klimkov
S. Ronanki
Jonas Rohnke
Thomas Drugman
AI4TS
178
85
0
04 Jul 2019
Towards Interpretable Deep Extreme Multi-label Learning
IEEE International Conference on Information Reuse and Integration (IRI), 2019
Yihuang Kang
I-Ling Cheng
W. Mao
Bowen Kuo
Pei-Ju Lee
100
0
0
03 Jul 2019
Themis: Fair and Efficient GPU Cluster Scheduling
Symposium on Networked Systems Design and Implementation (NSDI), 2019
Kshiteej S. Mahajan
Arjun Balasubramanian
Arjun Singhvi
Shivaram Venkataraman
Aditya Akella
Amar Phanishayee
Shuchi Chawla
161
218
0
02 Jul 2019
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Suyoun Kim
Siddharth Dalmia
Florian Metze
172
24
0
27 Jun 2019
Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias
Frontiers in Robotics and AI (Front. Robot. AI), 2019
Ryo Nakashima
Ryo Ozaki
T. Taniguchi
157
6
0
21 Jun 2019
On the Robustness of the Backdoor-based Watermarking in Deep Neural Networks
Masoumeh Shafieinejad
Jiaqi Wang
Nils Lukas
Xinda Li
Florian Kerschbaum
AAML
136
8
0
18 Jun 2019
Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portability
Interspeech (Interspeech), 2019
Antoine Caubrière
N. Tomashenko
Antoine Laurent
Emmanuel Morin
Nathalie Camelin
Yannick Esteve
124
57
0
18 Jun 2019
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Aaron Nicolson
K. Paliwal
120
12
0
18 Jun 2019
Perceptual Based Adversarial Audio Attacks
Joseph Szurley
J. Zico Kolter
AAML
109
25
0
14 Jun 2019
Selfie: Self-supervised Pretraining for Image Embedding
Trieu H. Trinh
Minh-Thang Luong
Quoc V. Le
SSL
290
116
0
07 Jun 2019
The Architectural Implications of Facebook's DNN-based Personalized Recommendation
International Symposium on High-Performance Computer Architecture (HPCA), 2019
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
317
315
0
06 Jun 2019
Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial Robustness
Neural Information Processing Systems (NeurIPS), 2019
A. Malinin
Mark Gales
UQCV
AAML
261
202
0
31 May 2019
Speaker Anonymization Using X-vector and Neural Waveform Models
Speech Synthesis Workshop (SSW), 2019
Fuming Fang
Xin Wang
Junichi Yamagishi
Isao Echizen
Massimiliano Todisco
Nicholas W. D. Evans
J. Bonastre
154
157
0
30 May 2019
Mixed Precision Training With 8-bit Floating Point
Naveen Mellempudi
Sudarshan Srinivasan
Dipankar Das
Bharat Kaul
MQ
174
73
0
29 May 2019
Local Label Propagation for Large-Scale Semi-Supervised Learning
Chengxu Zhuang
Xuehao Ding
Divyanshu Murli
Daniel L. K. Yamins
SSL
108
13
0
28 May 2019
NTP : A Neural Network Topology Profiler
Raghavendra Bhat
Pravin Chandran
Juby Jose
Viswanath Dibbur
Prakash Sirra Ajith
125
2
0
22 May 2019
Acoustic-to-Word Models with Conversational Context Information
North American Chapter of the Association for Computational Linguistics (NAACL), 2019
Suyoun Kim
Florian Metze
157
7
0
21 May 2019
Universal Adversarial Perturbations for Speech Recognition Systems
Interspeech (Interspeech), 2019
Paarth Neekhara
Shehzeen Samarah Hussain
Prakhar Pandey
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
142
128
0
09 May 2019
Capture, Learning, and Synthesis of 3D Speaking Styles
Computer Vision and Pattern Recognition (CVPR), 2019
Daniel Cudeiro
Timo Bolkart
Cassidy Laidlaw
Anurag Ranjan
Michael J. Black
CVBM
3DH
285
399
0
08 May 2019
Transparent pronunciation scoring using articulatorily weighted phoneme edit distance
Interspeech (Interspeech), 2019
Reima Karhila
Anna-Riikka Smolander
Sari Ylinen
M. Kurimo
46
15
0
07 May 2019
Ensemble Distribution Distillation
International Conference on Learning Representations (ICLR), 2019
A. Malinin
Bruno Mlodozeniec
Mark Gales
UQCV
504
262
0
30 Apr 2019
Unsupervised Data Augmentation for Consistency Training
Neural Information Processing Systems (NeurIPS), 2019
Qizhe Xie
Zihang Dai
Eduard H. Hovy
Minh-Thang Luong
Quoc V. Le
793
2,537
0
29 Apr 2019
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
192
172
0
26 Apr 2019
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Daniel S. Park
William Chan
Yu Zhang
Chung-Cheng Chiu
Barret Zoph
E. D. Cubuk
Quoc V. Le
VLM
518
3,832
0
18 Apr 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
Gakuto Kurata
Kartik Audhkhasi
153
48
0
17 Apr 2019
Adversarial Audio: A New Information Hiding Method and Backdoor for DNN-based Speech Recognition Models
Yehao Kong
Jiliang Zhang
85
30
0
08 Apr 2019
Measuring scheduling efficiency of RNNs for NLP applications
Urmish Thakker
Ganesh S. Dasika
Jesse G. Beu
Matthew Mattina
105
14
0
05 Apr 2019
Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions
Awni Y. Hannun
Ann Lee
Qiantong Xu
R. Collobert
183
105
0
04 Apr 2019
RAPID: Early Classification of Explosive Transients using Deep Learning
D. Muthukrishna
G. Narayan
K. Mandel
R. Biswas
R. Hložek
160
122
0
29 Mar 2019
Local Aggregation for Unsupervised Learning of Visual Embeddings
Chengxu Zhuang
Alex Zhai
Daniel L. K. Yamins
SSL
256
460
0
29 Mar 2019
Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation
Elizaveta Korotkova
Agnes Luhtaru
Maksym Del
Krista Liin
Daiga Deksne
Mark Fishel
148
12
0
27 Mar 2019
Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
113
18
0
27 Mar 2019
Practical Hidden Voice Attacks against Speech and Speaker Recognition Systems
Network and Distributed System Security Symposium (NDSS), 2019
H. Abdullah
Washington Garcia
Christian Peeters
Patrick Traynor
Kevin R. B. Butler
Joseph N. Wilson
AAML
152
177
0
18 Mar 2019
End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model
Yangyang Shi
M. Hwang
X. Lei
AI4TS
102
15
0
12 Mar 2019
Source codes in human communication
Michael Ramscar
49
12
0
08 Mar 2019
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018
Egor Lakomkin
S. Magg
C. Weber
S. Wermter
113
20
0
01 Mar 2019
Incorporating End-to-End Speech Recognition Models for Sentiment Analysis
IEEE International Conference on Robotics and Automation (ICRA), 2019
Egor Lakomkin
M. Zamani
C. Weber
S. Magg
S. Wermter
154
24
0
28 Feb 2019
An Optimized Recurrent Unit for Ultra-Low-Power Keyword Spotting
Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2019
Justice Amoh
K. Odame
173
19
0
13 Feb 2019
Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications
Conference on Machine Learning and Systems (MLSys), 2019
Peifeng Yu
Mosharaf Chowdhury
162
79
0
12 Feb 2019
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM
Hongxu Yin
Guoyang Chen
Yingmin Li
Shuai Che
Weifeng Zhang
N. Jha
105
10
0
30 Jan 2019
Weighted-Sampling Audio Adversarial Example Attack
Xiaolei Liu
Xiaosong Zhang
Kun Wan
Qingxin Zhu
Yufei Ding
DiffM
AAML
220
39
0
26 Jan 2019
SirenAttack: Generating Adversarial Audio for End-to-End Acoustic Systems
Tianyu Du
S. Ji
Jinfeng Li
Qinchen Gu
Ting Wang
Jiliang Li
AAML
247
146
0
23 Jan 2019
Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Julian Salazar
Katrin Kirchhoff
Zhiheng Huang
AI4TS
247
123
0
22 Jan 2019
Robust Watermarking of Neural Network with Exponential Weighting
Ryota Namba
Jun Sakuma
AAML
160
151
0
18 Jan 2019
Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data
Harshita Seth
Pulkit Kumar
Muktabh Mayank Srivastava
184
13
0
12 Jan 2019
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units
Amit Das
Jinyu Li
Guoli Ye
Rui Zhao
Jiawei Liu
132
26
0
31 Dec 2018
Stanza: Layer Separation for Distributed Training in Deep Learning
Xiaorui Wu
Hongao Xu
Bo Li
Y. Xiong
MoE
125
9
0
27 Dec 2018
Previous
1
2
3
...
10
11
12
...
14
15
16
Next