ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1412.5567
  4. Cited By
Deep Speech: Scaling up end-to-end speech recognition
v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
Erich Elsen
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
ArXiv (abs)PDFHTML

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 768 papers shown
Explanation of Unintended Radiated Emission Classification via LIME
Explanation of Unintended Radiated Emission Classification via LIME
Tom Grimes
E. Church
W. Pitts
Lynn Wood
69
5
0
04 Sep 2020
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity
  Edge Devices
CLAN: Continuous Learning using Asynchronous Neuroevolution on Commodity Edge DevicesIEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2020
Parth Mannan
A. Samajdar
T. Krishna
182
2
0
27 Aug 2020
Geometry-guided Dense Perspective Network for Speech-Driven Facial
  Animation
Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation
Jing-ying Liu
Binyuan Hui
Kun Li
Yunke Liu
Yu-kun Lai
Yuxiang Zhang
Yebin Liu
Jingyu Yang
3DHCVBM
221
32
0
23 Aug 2020
MASRI-HEADSET: A Maltese Corpus for Speech Recognition
MASRI-HEADSET: A Maltese Corpus for Speech Recognition
C. Mena
Albert Gatt
A. DeMarco
Claudia Borg
Lonneke van der Plas
Amanda Muscat
Ian Padovani
104
15
0
13 Aug 2020
Attention-based Fully Gated CNN-BGRU for Russian Handwritten Text
Attention-based Fully Gated CNN-BGRU for Russian Handwritten TextJournal of Imaging (JI), 2020
Abdelrahman Abdallah
Mohamed Hamada
D. Nurseitov
197
49
0
12 Aug 2020
Transformer with Bidirectional Decoder for Speech Recognition
Transformer with Bidirectional Decoder for Speech RecognitionInterspeech (Interspeech), 2020
Xi Chen
Songyang Zhang
Dandan Song
P. Ouyang
Shouyi Yin
144
15
0
11 Aug 2020
TinySpeech: Attention Condensers for Deep Speech Recognition Neural
  Networks on Edge Devices
TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
A. Wong
M. Famouri
Maya Pavlova
Siddharth Surana
310
34
0
10 Aug 2020
Improving the Accuracy of Global Forecasting Models using Time Series
  Data Augmentation
Improving the Accuracy of Global Forecasting Models using Time Series Data AugmentationPattern Recognition (Pattern Recognit.), 2020
Kasun Bandara
Hansika Hewamalage
Yuan-Hao Liu
Yanfei Kang
Christoph Bergmeir
AI4TS
341
133
0
06 Aug 2020
FRMDN: Flow-based Recurrent Mixture Density Network
FRMDN: Flow-based Recurrent Mixture Density Network
S. Razavi
Reshad Hosseini
Tina Behzad
BDL
345
2
0
05 Aug 2020
Word meaning in minds and machines
Word meaning in minds and machines
Brenden M. Lake
G. Murphy
NAI
369
140
0
04 Aug 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
305
62
0
29 Jul 2020
Autosegmental Neural Nets: Should Phones and Tones be Synchronous or
  Asynchronous?
Autosegmental Neural Nets: Should Phones and Tones be Synchronous or Asynchronous?Interspeech (Interspeech), 2020
Jialu Li
M. Hasegawa-Johnson
173
5
0
28 Jul 2020
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery
Self-Expressing Autoencoders for Unsupervised Spoken Term DiscoveryInterspeech (Interspeech), 2020
Saurabhchand Bhati
Jesús Villalba
Piotr Żelasko
Najim Dehak
SSL
143
18
0
26 Jul 2020
MP3 Compression To Diminish Adversarial Noise in End-to-End Speech
  Recognition
MP3 Compression To Diminish Adversarial Noise in End-to-End Speech RecognitionInternational Conference on Speech and Computer (SPECOM), 2020
I. Andronic
Ludwig Kurzinger
Edgar Ricardo Chavez Rosas
Gerhard Rigoll
B. Seeber
147
16
0
25 Jul 2020
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech
  Recognition
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition
Ludwig Kurzinger
Edgar Ricardo Chavez Rosas
Lujun Li
Tobias Watzel
Gerhard Rigoll
AAML
191
4
0
21 Jul 2020
Learning to Generate Customized Dynamic 3D Facial Expressions
Learning to Generate Customized Dynamic 3D Facial ExpressionsEuropean Conference on Computer Vision (ECCV), 2020
Rolandos Alexandros Potamias
Jiali Zheng
Stylianos Ploumpis
Giorgos Bouritsas
Evangelos Ververas
Stefanos Zafeiriou
3DH
271
24
0
19 Jul 2020
Robust Image Classification Using A Low-Pass Activation Function and DCT
  Augmentation
Robust Image Classification Using A Low-Pass Activation Function and DCT AugmentationIEEE Access (IEEE Access), 2020
Md Tahmid Hossain
S. Teng
Ferdous Sohel
Guojun Lu
287
11
0
18 Jul 2020
EZLDA: Efficient and Scalable LDA on GPUs
EZLDA: Efficient and Scalable LDA on GPUsIEEE Access (IEEE Access), 2020
Shilong Wang
Hang Liu
Anil Gaihre
Hengyong Yu
123
1
0
17 Jul 2020
Data augmentation enhanced speaker enrollment for text-dependent speaker
  verification
Data augmentation enhanced speaker enrollment for text-dependent speaker verificationInternational Conference on Energy, Power and Environment (ICEPE), 2020
A. K. Sarkar
H. Sarma
Priyanka Dwivedi
Zheng-Hua Tan
91
4
0
12 Jul 2020
The Computational Limits of Deep Learning
The Computational Limits of Deep Learning
Neil C. Thompson
Kristjan Greenewald
Keeheon Lee
Gabriel F. Manso
VLM
296
642
0
10 Jul 2020
Meta-Learning Symmetries by Reparameterization
Meta-Learning Symmetries by Reparameterization
Allan Zhou
Tom Knowles
Chelsea Finn
OOD
304
104
0
06 Jul 2020
Learning from Failure: Training Debiased Classifier from Biased
  Classifier
Learning from Failure: Training Debiased Classifier from Biased Classifier
J. Nam
Hyuntak Cha
SungSoo Ahn
Jaeho Lee
Jinwoo Shin
255
168
0
06 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML
  Models: A Survey and Insights
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
287
100
0
02 Jul 2020
Hippo: Taming Hyper-parameter Optimization of Deep Learning with Stage
  Trees
Hippo: Taming Hyper-parameter Optimization of Deep Learning with Stage Trees
Ahnjae Shin
Do Yoon Kim
Joo Seong Jeong
Byung-Gon Chun
153
5
0
22 Jun 2020
Regression Prior Networks
Regression Prior Networks
A. Malinin
Sergey Chervontsev
Ivan Provilkov
Mark Gales
BDLUQCV
255
39
0
20 Jun 2020
Calibrating Deep Neural Network Classifiers on Out-of-Distribution
  Datasets
Calibrating Deep Neural Network Classifiers on Out-of-Distribution Datasets
Zhihui Shao
Jianyi Yang
Shaolei Ren
OODD
204
11
0
16 Jun 2020
Emotion Recognition in Audio and Video Using Deep Neural Networks
Emotion Recognition in Audio and Video Using Deep Neural Networks
Mandeep Singh
Yuanye Fang
105
19
0
15 Jun 2020
Sparsity Turns Adversarial: Energy and Latency Attacks on Deep Neural
  Networks
Sparsity Turns Adversarial: Energy and Latency Attacks on Deep Neural Networks
Sarada Krithivasan
Sanchari Sen
A. Raghunathan
AAML
161
1
0
14 Jun 2020
Transfer Learning for British Sign Language Modelling
Transfer Learning for British Sign Language Modelling
B. Mocialov
Graham Turner
H. Hastie
SLR
134
20
0
03 Jun 2020
Detecting Audio Attacks on ASR Systems with Dropout Uncertainty
Detecting Audio Attacks on ASR Systems with Dropout UncertaintyInterspeech (Interspeech), 2020
T. Jayashankar
Jonathan Le Roux
P. Moulin
AAML
126
17
0
02 Jun 2020
Scalable Polyhedral Verification of Recurrent Neural Networks
Scalable Polyhedral Verification of Recurrent Neural NetworksInternational Conference on Computer Aided Verification (CAV), 2020
Wonryong Ryou
Jiayu Chen
Mislav Balunović
Gagandeep Singh
Andrei Dan
Martin Vechev
281
36
0
27 May 2020
Simplified Self-Attention for Transformer-based End-to-End Speech
  Recognition
Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
Haoneng Luo
Shiliang Zhang
Ming Lei
Lei Xie
287
36
0
21 May 2020
Toward Automated Classroom Observation: Multimodal Machine Learning to Estimate CLASS Positive Climate and Negative Climate
Anand Ramakrishnan
Brian Zylich
Erin Ottmar
Jennifer LoCasale-Crouch
Jacob Whitehill
198
32
0
19 May 2020
Neural Polysynthetic Language Modelling
Neural Polysynthetic Language Modelling
Lane Schwartz
Francis M. Tyers
Lori S. Levin
Christo Kirov
Patrick Littell
...
Vasilisa Andriyanets
Aldrian Obaja Muis
Naoki Otani
J. Park
Zhisong Zhang
243
25
0
11 May 2020
The Perceptimatic English Benchmark for Speech Perception Models
The Perceptimatic English Benchmark for Speech Perception Models
Juliette Millet
Ewan Dunbar
106
4
0
07 May 2020
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and
  Solutions
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Chung-Cheng Chiu
A. Narayanan
Wei Han
Rohit Prabhavalkar
Yu Zhang
...
Ruoming Pang
Tara N. Sainath
Patrick Nguyen
Liangliang Cao
Yonghui Wu
377
44
0
07 May 2020
Data Augmentation for Spoken Language Understanding via Pretrained
  Language Models
Data Augmentation for Spoken Language Understanding via Pretrained Language Models
Baolin Peng
Chenguang Zhu
Michael Zeng
Jianfeng Gao
193
26
0
29 Apr 2020
Conditional Spoken Digit Generation with StyleGAN
Conditional Spoken Digit Generation with StyleGANInterspeech (Interspeech), 2020
Kasperi Palkama
Lauri Juvela
Alexander Ilin
GAN
225
11
0
28 Apr 2020
A Summary of the First Workshop on Language Technology for Language
  Documentation and Revitalization
A Summary of the First Workshop on Language Technology for Language Documentation and RevitalizationWorkshop on Spoken Language Technologies for Under-resourced Languages (SLTU), 2020
Graham Neubig
Shruti Rijhwani
Alexis Palmer
Jordan MacKenzie
Hilaria Cruz
...
Yiyuan Li
S. Zink
Mengzhou Xia
Roshan S. Sharma
Patrick Littell
104
8
0
27 Apr 2020
COVID-19 Time-series Prediction by Joint Dictionary Learning and Online
  NMF
COVID-19 Time-series Prediction by Joint Dictionary Learning and Online NMF
Hanbaek Lyu
Christopher Strohmeier
G. Menz
Deanna Needell
133
12
0
20 Apr 2020
Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of
  Deep Neural Networks
Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural NetworksMicro (MICRO), 2020
Gil Shomron
U. Weiser
167
16
0
17 Apr 2020
Direct Speech-to-image Translation
Direct Speech-to-image TranslationIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2020
Jiguo Li
Xinfeng Zhang
Chuanmin Jia
Jizheng Xu
Li Zhang
Y. Wang
Siwei Ma
Wen Gao
178
34
0
07 Apr 2020
Information Leakage in Embedding Models
Information Leakage in Embedding ModelsConference on Computer and Communications Security (CCS), 2020
Congzheng Song
A. Raghunathan
MIACV
439
322
0
31 Mar 2020
Characterizing Speech Adversarial Examples Using Self-Attention U-Net
  Enhancement
Characterizing Speech Adversarial Examples Using Self-Attention U-Net EnhancementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Chao-Han Huck Yang
Jun Qi
Pin-Yu Chen
Xiaoli Ma
Chin-Hui Lee
AAML
238
56
0
31 Mar 2020
Training for Speech Recognition on Coprocessors
Training for Speech Recognition on Coprocessors
Sebastian Baunsgaard
S. Wrede
Pınar Tözün
141
6
0
22 Mar 2020
Generating Socially Acceptable Perturbations for Efficient Evaluation of
  Autonomous Vehicles
Generating Socially Acceptable Perturbations for Efficient Evaluation of Autonomous Vehicles
Songan Zhang
H. Peng
S. Nageshrao
E. Tseng
AAML
201
5
0
18 Mar 2020
Hybrid Autoregressive Transducer (hat)
Hybrid Autoregressive Transducer (hat)IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Ehsan Variani
David Rybach
Cyril Allauzen
Michael Riley
179
170
0
12 Mar 2020
Development of Automatic Speech Recognition for Kazakh Language using
  Transfer Learning
Development of Automatic Speech Recognition for Kazakh Language using Transfer LearningInternational Journal of Advanced Trends in Computer Science and Engineering (IJATCSE), 2020
Amirgaliyev E.N.
Kuanyshbay D.N.
O. Baimuratov
71
14
0
08 Mar 2020
TxSim:Modeling Training of Deep Neural Networks on Resistive Crossbar
  Systems
TxSim:Modeling Training of Deep Neural Networks on Resistive Crossbar Systems
Sourjya Roy
S. Sridharan
Shubham Jain
A. Raghunathan
200
47
0
25 Feb 2020
A.I. based Embedded Speech to Text Using Deepspeech
A.I. based Embedded Speech to Text Using Deepspeech
Muhammad Hafidh Firmansyah
Anand Paul
D. Bhattacharya
Gul Malik Urfa
84
6
0
25 Feb 2020
Previous
123...8910...141516
Next