ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition
v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXiv (abs)PDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 770 papers shown
Cross-Modal Transformer-Based Neural Correction Models for Automatic
  Speech Recognition
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Tomohiro Tanaka
Ryo Masumura
Mana Ihori
Akihiko Takashima
Takafumi Moriya
Takanori Ashihara
Shota Orihashi
Naoki Makishima
120
8
0
04 Jul 2021
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio
  Transcription
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription
Nikita Pavlichenko
Ivan Stelmakh
Dmitry Ustalov
144
20
0
02 Jul 2021
Realtime Robust Malicious Traffic Detection via Frequency Domain
  Analysis
Realtime Robust Malicious Traffic Detection via Frequency Domain AnalysisConference on Computer and Communications Security (CCS), 2021
Chuanpu Fu
Qi Li
Meng Shen
Ke Xu
AAML
148
207
0
28 Jun 2021
Towards Model-informed Precision Dosing with Expert-in-the-loop Machine
  Learning
Towards Model-informed Precision Dosing with Expert-in-the-loop Machine LearningIEEE International Conference on Information Reuse and Integration (IRI), 2021
Yihuang Kang
Y. Chiu
Ming-Yen Lin
F. Su
Sheng-Tai Huang
127
2
0
28 Jun 2021
Open, Sesame! Introducing Access Control to Voice Services
Open, Sesame! Introducing Access Control to Voice Services
Dominika Woszczyk
Alvin Lee
Soteris Demetriou
AAML
86
1
0
27 Jun 2021
Accelerating Recurrent Neural Networks for Gravitational Wave
  Experiments
Accelerating Recurrent Neural Networks for Gravitational Wave ExperimentsIEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021
Zhiqiang Que
Erwei Wang
Umar Marikar
Eric A. Moreno
J. Ngadiuba
...
Vladimir Loncar
S. Summers
M. Pierini
P. Cheung
Wayne Luk
196
31
0
26 Jun 2021
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for
  Efficient Training
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient TrainingNeural Information Processing Systems (NeurIPS), 2021
Anup Sarma
Sonali Singh
Huaipan Jiang
Rui Zhang
M. Kandemir
Chita R. Das
100
1
0
22 Jun 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models
  Smaller, Faster, and Better
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLMMedIm
272
528
0
16 Jun 2021
Dialectal Speech Recognition and Translation of Swiss German Speech to
  Standard German Text: Microsoft's Submission to SwissText 2021
Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021
Yuriy Arabskyy
Aashish Agarwal
S. Dey
Oscar Koller
111
12
0
15 Jun 2021
Break-It-Fix-It: Unsupervised Learning for Program Repair
Break-It-Fix-It: Unsupervised Learning for Program RepairInternational Conference on Machine Learning (ICML), 2021
Michihiro Yasunaga
Abigail Z. Jacobs
240
121
0
11 Jun 2021
Handcrafted Backdoors in Deep Neural Networks
Handcrafted Backdoors in Deep Neural NetworksNeural Information Processing Systems (NeurIPS), 2021
Sanghyun Hong
Nicholas Carlini
Alexey Kurakin
233
89
0
08 Jun 2021
SpeechBrain: A General-Purpose Speech Toolkit
SpeechBrain: A General-Purpose Speech Toolkit
Mirco Ravanelli
Titouan Parcollet
Peter William VanHarn Plantinga
Aku Rouhe
Samuele Cornell
...
William Aris
Hwidong Na
Yan Gao
R. Mori
Yoshua Bengio
295
904
0
08 Jun 2021
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from
  Video using Pose and Lighting Normalization
LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting NormalizationComputer Vision and Pattern Recognition (CVPR), 2021
A. Lahiri
Vivek Kwatra
C. Frueh
J. P. Lewis
C. Bregler
3DH
206
111
0
08 Jun 2021
Minimum Word Error Rate Training with Language Model Fusion for
  End-to-End Speech Recognition
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech RecognitionInterspeech (Interspeech), 2021
Zhong Meng
Yu-Huan Wu
Naoyuki Kanda
Liang Lu
Xie Chen
Guoli Ye
Eric Sun
Jinyu Li
Jiawei Liu
MoMe
146
22
0
04 Jun 2021
An Improved Model for Voicing Silent Speech
An Improved Model for Voicing Silent SpeechAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
David Gaddy
Dana Klein
228
42
0
03 Jun 2021
Improving the Adversarial Robustness for Speaker Verification by
  Self-Supervised Learning
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021
Haibin Wu
Xu Li
Andy T. Liu
Zhiyong Wu
Helen Meng
Hung-yi Lee
AAMLSSL
257
37
0
01 Jun 2021
Multi-Modal Semantic Inconsistency Detection in Social Media News Posts
Multi-Modal Semantic Inconsistency Detection in Social Media News PostsConference on Multimedia Modeling (MMM), 2021
S. McCrae
Kehan Wang
A. Zakhor
147
16
0
26 May 2021
See, Hear, Read: Leveraging Multimodality with Guided Attention for
  Abstractive Text Summarization
See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text SummarizationKnowledge-Based Systems (KBS), 2021
Yash Kumar Atri
Shraman Pramanick
Vikram Goyal
Tanmoy Chakraborty
222
43
0
20 May 2021
Unsupervised Discriminative Learning of Sounds for Audio Event
  Classification
Unsupervised Discriminative Learning of Sounds for Audio Event ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Sascha Hornauer
Ke Li
Stella X. Yu
Shabnam Ghaffarzadegan
Liu Ren
SSL
117
5
0
19 May 2021
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural
  Network: A Survey
Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey
Xiaoyu Zhang
Chao Chen
Yi Xie
Xiaofeng Chen
Jun Zhang
Yang Xiang
FedML
118
8
0
13 May 2021
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition
Khin Me Me Chit
Laet Laet Lin
108
4
0
13 May 2021
PIM-DRAM: Accelerating Machine Learning Workloads using Processing in
  Commodity DRAM
PIM-DRAM: Accelerating Machine Learning Workloads using Processing in Commodity DRAMIEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 2021
Sourjya Roy
M. Ali
A. Raghunathan
93
24
0
08 May 2021
A Benchmarking on Cloud based Speech-To-Text Services for French Speech
  and Background Noise Effect
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect
Binbin Xu
Chongyang Tao
Z. Feng
Youssef Raqui
Sylvie Ranwez
161
17
0
07 May 2021
Pervasive AI for IoT applications: A Survey on Resource-efficient
  Distributed Artificial Intelligence
Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial IntelligenceIEEE Communications Surveys and Tutorials (COMST), 2021
Emna Baccour
N. Mhaisen
A. Abdellatif
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
354
125
0
04 May 2021
On the limit of English conversational speech recognition
On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021
Zoltán Tüske
G. Saon
Brian Kingsbury
183
53
0
03 May 2021
RotLSTM: Rotating Memories in Recurrent Neural Networks
RotLSTM: Rotating Memories in Recurrent Neural Networks
Vlad Velici
Adam Prugel-Bennett
RALMVLM
254
1
0
01 May 2021
Adversarial Example Detection for DNN Models: A Review and Experimental
  Comparison
Adversarial Example Detection for DNN Models: A Review and Experimental ComparisonArtificial Intelligence Review (AIR), 2021
Ahmed Aldahdooh
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
AAML
697
161
0
01 May 2021
End-to-End Speech Recognition from Federated Acoustic Models
End-to-End Speech Recognition from Federated Acoustic ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Yan Gao
Titouan Parcollet
Salah Zaiem
Javier Fernandez-Marques
Pedro Porto Buarque de Gusmão
Daniel J. Beutel
Nicholas D. Lane
206
46
0
29 Apr 2021
NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform QuantizationJournal of machine learning research (JMLR), 2019
Ali Ramezani-Kebrya
Fartash Faghri
Ilya Markov
V. Aksenov
Dan Alistarh
Daniel M. Roy
MQ
215
34
0
28 Apr 2021
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head
Qianyun Wang
Zhenfeng Fan
Shi-hong Xia
3DH
203
21
0
25 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing
Quantization of Deep Neural Networks for Accurate Edge ComputingACM Journal on Emerging Technologies in Computing Systems (JETC), 2021
Wentao Chen
Hailong Qiu
Zhuang Jian
Chutong Zhang
Yu Hu
Qing Lu
Tianchen Wang
Yiyu Shi
Meiping Huang
Xiaowe Xu
230
31
0
25 Apr 2021
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction Network
Fast Text-Only Domain Adaptation of RNN-Transducer Prediction NetworkInterspeech (Interspeech), 2021
Janne Pylkkönen
Antti Ukkonen
Juho Kilpikoski
Samu Tamminen
Hannes Heikinheimo
181
28
0
22 Apr 2021
Best Practices for Noise-Based Augmentation to Improve the Performance
  of Deployable Speech-Based Emotion Recognition Systems
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems
Mimansa Jaiswal
E. Provost
108
0
0
18 Apr 2021
MeshTalk: 3D Face Animation from Speech using Cross-Modality
  Disentanglement
MeshTalk: 3D Face Animation from Speech using Cross-Modality DisentanglementIEEE International Conference on Computer Vision (ICCV), 2021
Alexander Richard
Michael Zollhoefer
Yandong Wen
Fernando de la Torre
Yaser Sheikh
CVBM
351
251
0
16 Apr 2021
A Method to Reveal Speaker Identity in Distributed ASR Training, and How
  to Counter It
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter ItIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Trung D. Q. Dang
Om Thakkar
Swaroop Indra Ramaswamy
Rajiv Mathews
Peter Chin
Franccoise Beaufays
FedML
96
10
0
15 Apr 2021
A Toolbox for Construction and Analysis of Speech Datasets
A Toolbox for Construction and Analysis of Speech Datasets
Evelina Bakhturina
Vitaly Lavrukhin
Boris Ginsburg
148
13
0
11 Apr 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by
  Applying Fast-Skip Regularization
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip RegularizationInterspeech (Interspeech), 2021
Zhengkun Tian
Jiangyan Yi
Ye Bai
Jianhua Tao
Shuai Zhang
Zhengqi Wen
83
19
0
07 Apr 2021
Visual Alignment Constraint for Continuous Sign Language Recognition
Visual Alignment Constraint for Continuous Sign Language RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Yuecong Min
Aiming Hao
Xiujuan Chai
Xilin Chen
SLR
191
196
0
06 Apr 2021
Intent Recognition and Unsupervised Slot Identification for Low
  Resourced Spoken Dialog Systems
Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog SystemsAutomatic Speech Recognition & Understanding (ASRU), 2021
Akshat Gupta
Olivia Deng
Akruti Kushwaha
Saloni Mittal
William Zeng
Sai Krishna Rallabandi
A. Black
198
7
0
03 Apr 2021
TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity
  and Model Smoothness
TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity and Model SmoothnessNeural Information Processing Systems (NeurIPS), 2021
Zhuolin Yang
Linyi Li
Xiaojun Xu
Shiliang Zuo
Qiang Chen
Benjamin I. P. Rubinstein
Pan Zhou
Ce Zhang
Yue Liu
AAML
248
65
0
01 Apr 2021
Comparison of different convolutional neural network activation
  functions and methods for building ensembles
Comparison of different convolutional neural network activation functions and methods for building ensembles
L. Nanni
Gianluca Maguolo
S. Brahnam
M. Paci
177
8
0
29 Mar 2021
Are all outliers alike? On Understanding the Diversity of Outliers for
  Detecting OODs
Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs
R. Kaur
Susmit Jha
Anirban Roy
O. Sokolsky
Insup Lee
149
14
0
23 Mar 2021
Federated Quantum Machine Learning
Federated Quantum Machine LearningEntropy (Entropy), 2021
Samuel Yen-Chi Chen
Shinjae Yoo
FedMLAI4CE
187
158
0
22 Mar 2021
Digital Peter: Dataset, Competition and Handwriting Recognition Methods
Digital Peter: Dataset, Competition and Handwriting Recognition Methods
M. Potanin
Denis Dimitrov
Alex Shonenkov
Vladimir Bataev
Denis Karachev
Maxim Novopoltsev
158
10
0
16 Mar 2021
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Bonaventure F. P. Dossou
Chris C. Emezue
179
18
0
13 Mar 2021
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion
  Recognition
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion RecognitionIEEE Transactions on Affective Computing (TAC), 2021
Maurice Gerczuk
Shahin Amiriparian
Sandra Ottl
Björn Schuller
162
70
0
10 Mar 2021
Split Computing and Early Exiting for Deep Learning Applications: Survey
  and Research Challenges
Split Computing and Early Exiting for Deep Learning Applications: Survey and Research ChallengesACM Computing Surveys (CSUR), 2021
Yoshitomo Matsubara
Marco Levorato
Francesco Restuccia
404
276
0
08 Mar 2021
WaveGuard: Understanding and Mitigating Audio Adversarial Examples
WaveGuard: Understanding and Mitigating Audio Adversarial ExamplesUSENIX Security Symposium (USENIX Security), 2021
Shehzeen Samarah Hussain
Paarth Neekhara
Shlomo Dubnov
Julian McAuley
F. Koushanfar
AAML
159
83
0
04 Mar 2021
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale
  Black-Box Optimization
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box OptimizationInternational Conference on Machine Learning (ICML), 2021
HanQin Cai
Y. Lou
Daniel McKenzie
W. Yin
261
56
0
21 Feb 2021
Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation
Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation
Elizabeth Fons
Paula Dawson
Xiao-Jun Zeng
J. Keane
Alexandros Iosifidis
AI4TS
116
27
0
16 Feb 2021
Previous
123...678...141516
Next