Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1412.5567
Cited By
v1
v2 (latest)
Deep Speech: Scaling up end-to-end speech recognition
17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Speech: Scaling up end-to-end speech recognition"
50 / 770 papers shown
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Tomohiro Tanaka
Ryo Masumura
Mana Ihori
Akihiko Takashima
Takafumi Moriya
Takanori Ashihara
Shota Orihashi
Naoki Makishima
120
8
0
04 Jul 2021
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription
Nikita Pavlichenko
Ivan Stelmakh
Dmitry Ustalov
144
20
0
02 Jul 2021
Realtime Robust Malicious Traffic Detection via Frequency Domain Analysis
Conference on Computer and Communications Security (CCS), 2021
Chuanpu Fu
Qi Li
Meng Shen
Ke Xu
AAML
148
207
0
28 Jun 2021
Towards Model-informed Precision Dosing with Expert-in-the-loop Machine Learning
IEEE International Conference on Information Reuse and Integration (IRI), 2021
Yihuang Kang
Y. Chiu
Ming-Yen Lin
F. Su
Sheng-Tai Huang
127
2
0
28 Jun 2021
Open, Sesame! Introducing Access Control to Voice Services
Dominika Woszczyk
Alvin Lee
Soteris Demetriou
AAML
86
1
0
27 Jun 2021
Accelerating Recurrent Neural Networks for Gravitational Wave Experiments
IEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021
Zhiqiang Que
Erwei Wang
Umar Marikar
Eric A. Moreno
J. Ngadiuba
...
Vladimir Loncar
S. Summers
M. Pierini
P. Cheung
Wayne Luk
196
31
0
26 Jun 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training
Neural Information Processing Systems (NeurIPS), 2021
Anup Sarma
Sonali Singh
Huaipan Jiang
Rui Zhang
M. Kandemir
Chita R. Das
100
1
0
22 Jun 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLM
MedIm
272
528
0
16 Jun 2021
Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021
Yuriy Arabskyy
Aashish Agarwal
S. Dey
Oscar Koller
111
12
0
15 Jun 2021
Break-It-Fix-It: Unsupervised Learning for Program Repair
International Conference on Machine Learning (ICML), 2021
Michihiro Yasunaga
Abigail Z. Jacobs
240
121
0
11 Jun 2021
Handcrafted Backdoors in Deep Neural Networks
Neural Information Processing Systems (NeurIPS), 2021
Sanghyun Hong
Nicholas Carlini
Alexey Kurakin
233
89
0
08 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
295
904
0
08 Jun 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization
Computer Vision and Pattern Recognition (CVPR), 2021
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
206
111
0
08 Jun 2021
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Interspeech (Interspeech), 2021
Zhong Meng
Yu-Huan Wu
Naoyuki Kanda
Liang Lu
Xie Chen
Guoli Ye
Eric Sun
Jinyu Li
Jiawei Liu
MoMe
146
22
0
04 Jun 2021
An Improved Model for Voicing Silent Speech
Annual Meeting of the Association for Computational Linguistics (ACL), 2021
David Gaddy
Dana Klein
228
42
0
03 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAML
SSL
257
37
0
01 Jun 2021
Multi-Modal Semantic Inconsistency Detection in Social Media News Posts
Conference on Multimedia Modeling (MMM), 2021
S. McCrae
Kehan Wang
A. Zakhor
147
16
0
26 May 2021
See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text Summarization
Knowledge-Based Systems (KBS), 2021
Yash Kumar Atri
Shraman Pramanick
Vikram Goyal
Tanmoy Chakraborty
222
43
0
20 May 2021
Unsupervised Discriminative Learning of Sounds for Audio Event Classification
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sascha Hornauer
Ke Li
Stella X. Yu
Shabnam Ghaffarzadegan
Liu Ren
SSL
117
5
0
19 May 2021
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey
Xiaoyu Zhang
Chao Chen
Yi Xie
Xiaofeng Chen
Jun Zhang
Yang Xiang
FedML
118
8
0
13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Khin Me Me Chit
Laet Laet Lin
108
4
0
13 May 2021
PIM-DRAM: Accelerating Machine Learning Workloads using Processing in Commodity DRAM
IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 2021
Sourjya Roy
M. Ali
A. Raghunathan
93
24
0
08 May 2021
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect
Binbin Xu
Chongyang Tao
Z. Feng
Youssef Raqui
Sylvie Ranwez
161
17
0
07 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial Intelligence
IEEE Communications Surveys and Tutorials (COMST), 2021
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
354
125
0
04 May 2021
On the limit of English conversational speech recognition
Interspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
183
53
0
03 May 2021
RotLSTM: Rotating Memories in Recurrent Neural Networks
Vlad Velici
Adam Prugel-Bennett
RALM
VLM
254
1
0
01 May 2021
Adversarial Example Detection for DNN Models: A Review and Experimental Comparison
Artificial Intelligence Review (AIR), 2021
Ahmed Aldahdooh
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
AAML
697
161
0
01 May 2021
End-to-End Speech Recognition from Federated Acoustic Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yan Gao
Titouan Parcollet
Salah Zaiem
Javier Fernandez-Marques
Pedro Porto Buarque de Gusmão
Daniel J. Beutel
Nicholas D. Lane
206
46
0
29 Apr 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization
Journal of machine learning research (JMLR), 2019
Ali Ramezani-Kebrya
Fartash Faghri
Ilya Markov
V. Aksenov
Dan Alistarh
Daniel M. Roy
MQ
215
34
0
28 Apr 2021
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head
Qianyun Wang
Zhenfeng Fan
Shi-hong Xia
3DH
203
21
0
25 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing
ACM Journal on Emerging Technologies in Computing Systems (JETC), 2021
Wentao Chen
Hailong Qiu
Zhuang Jian
Chutong Zhang
Yu Hu
Qing Lu
Tianchen Wang
Yiyu Shi
Meiping Huang
Xiaowe Xu
230
31
0
25 Apr 2021
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network
Interspeech (Interspeech), 2021
Janne Pylkkönen
Antti Ukkonen
Juho Kilpikoski
Samu Tamminen
Hannes Heikinheimo
181
28
0
22 Apr 2021
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems
Mimansa Jaiswal
E. Provost
108
0
0
18 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
IEEE International Conference on Computer Vision (ICCV), 2021
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
351
251
0
16 Apr 2021
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Trung D. Q. Dang
Om Thakkar
Swaroop Indra Ramaswamy
Rajiv Mathews
Peter Chin
Franccoise Beaufays
FedML
96
10
0
15 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
148
13
0
11 Apr 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
Interspeech (Interspeech), 2021
Zhengkun Tian
Jiangyan Yi
Ye Bai
Jianhua Tao
Shuai Zhang
Zhengqi Wen
83
19
0
07 Apr 2021
Visual Alignment Constraint for Continuous Sign Language Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Yuecong Min
Aiming Hao
Xiujuan Chai
Xilin Chen
SLR
191
196
0
06 Apr 2021
Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems
Automatic Speech Recognition & Understanding (ASRU), 2021
Akshat Gupta
Olivia Deng
Akruti Kushwaha
Saloni Mittal
William Zeng
Sai Krishna Rallabandi
A. Black
198
7
0
03 Apr 2021
TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity and Model Smoothness
Neural Information Processing Systems (NeurIPS), 2021
Zhuolin Yang
Linyi Li
Xiaojun Xu
Shiliang Zuo
Qiang Chen
Benjamin I. P. Rubinstein
Pan Zhou
Ce Zhang
Yue Liu
AAML
248
65
0
01 Apr 2021
Comparison of different convolutional neural network activation functions and methods for building ensembles
L. Nanni
Gianluca Maguolo
S. Brahnam
M. Paci
177
8
0
29 Mar 2021
Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs
R. Kaur
Susmit Jha
Anirban Roy
O. Sokolsky
Insup Lee
149
14
0
23 Mar 2021
Federated Quantum Machine Learning
Entropy (Entropy), 2021
Samuel Yen-Chi Chen
Shinjae Yoo
FedML
AI4CE
187
158
0
22 Mar 2021
Digital Peter: Dataset, Competition and Handwriting Recognition Methods
M. Potanin
Denis Dimitrov
Alex Shonenkov
Vladimir Bataev
Denis Karachev
Maxim Novopoltsev
158
10
0
16 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
179
18
0
13 Mar 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition
IEEE Transactions on Affective Computing (TAC), 2021
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schuller
162
70
0
10 Mar 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges
ACM Computing Surveys (CSUR), 2021
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
404
276
0
08 Mar 2021
WaveGuard: Understanding and Mitigating Audio Adversarial Examples
USENIX Security Symposium (USENIX Security), 2021
Shehzeen Samarah Hussain
Paarth Neekhara
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
159
83
0
04 Mar 2021
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization
International Conference on Machine Learning (ICML), 2021
HanQin Cai
Y. Lou
Daniel McKenzie
W. Yin
261
56
0
21 Feb 2021
Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation
Elizabeth Fons
Paula Dawson
Xiao-Jun Zeng
J. Keane
Alexandros Iosifidis
AI4TS
116
27
0
16 Feb 2021
Previous
1
2
3
...
6
7
8
...
14
15
16
Next