v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 768 papers shown

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech RecognitionIEEE Signal Processing Magazine (IEEE Signal Process. Mag.), 2020

121

24 Feb 2020

Semi-Supervised Speech Recognition via Local Prior Matching

236

24 Feb 2020

Uncertainty Estimation in Autoregressive Structured Prediction

A. Malinin

Mark Gales

UQLM

241

18 Feb 2020

Identifying Audio Adversarial Examples via Anomalous Pattern Detection

185

13 Feb 2020

Spatial-Temporal Multi-Cue Network for Continuous Sign Language RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2020

137

235

08 Feb 2020

RPN: A Residual Pooling Network for Efficient Federated LearningEuropean Conference on Artificial Intelligence (ECAI), 2020

Yang Liu

199

23 Jan 2020

Single headed attention based sequence-to-sequence model for state-of-the-art results on SwitchboardInterspeech (Interspeech), 2020

Kartik Audhkhasi

172

20 Jan 2020

FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural NetworksJournal of Artificial Intelligence Research (JAIR), 2020

182

18 Jan 2020

Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends

Björn W. Schuller

346

02 Jan 2020

RC-DARTS: Resource Constrained Differentiable Architecture Search

Ming-Hsuan Yang

137

30 Dec 2019

An Analysis of the Expressiveness of Deep Neural Network Architectures Based on Their Lipschitz Constants

Siqi Zhou

Angela P. Schoellig

24 Dec 2019

Incorporating Unlabeled Data into Distributionally Robust LearningJournal of machine learning research (JMLR), 2019

159

16 Dec 2019

Common Voice: A Massively-Multilingual Speech CorpusInternational Conference on Language Resources and Evaluation (LREC), 2019

343

2,070

13 Dec 2019

Neural Voice Puppetry: Audio-driven Facial ReenactmentEuropean Conference on Computer Vision (ECCV), 2019

Matthias Nießner

294

423

11 Dec 2019

SpecAugment on Large Scale DatasetsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

172

154

11 Dec 2019

Effective Data Augmentation Approaches to End-to-End Task-Oriented DialogueInternational Conference on Asian Language Processing (IALP), 2019

Jun Quan

Deyi Xiong

114

05 Dec 2019

Scratch that! An Evolution-based Adversarial Attack against Neural Networks

Malhar Jere

Loris Rossi

Briland Hitaj

Gabriela F. Cretu-Ciocarlie

Giacomo Boracchi

F. Koushanfar

AAML

190

05 Dec 2019

Topic-aware chatbot using Recurrent Neural Networks and Nonnegative Matrix Factorization

...

229

01 Dec 2019

Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech RecognitionInterspeech (Interspeech), 2019

272

28 Nov 2019

Stage-based Hyper-parameter Optimization for Deep Learning

24 Nov 2019

Universal adversarial examples in speech command classification

Jon Vadillo

Roberto Santana

AAML

200

22 Nov 2019

DermGAN: Synthetic Generation of Clinical Skin Images with Pathology

171

110

20 Nov 2019

Generate (non-software) Bugs to Fool ClassifiersAAAI Conference on Artificial Intelligence (AAAI), 2019

20 Nov 2019

A novel method for identifying the deep neural network model with the Serial Number

19 Nov 2019

Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models

Luke Zettlemoyer

163

09 Nov 2019

Who is Real Bob? Adversarial Attacks on Speaker Recognition SystemsIEEE Symposium on Security and Privacy (IEEE S&P), 2019

Lingling Fan

Yang Liu

262

225

03 Nov 2019

Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?AAAI Conference on Artificial Intelligence (AAAI), 2019

Bhavya Ghai

Buvana Ramanan

Klaus Mueller

29 Oct 2019

Improving sequence-to-sequence speech recognition training with on-the-fly data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

235

104

29 Oct 2019

Meta Learning for End-to-End Low-Resource Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

Jui-Yang Hsu

Yuan-Jui Chen

Hung-yi Lee

116

114

26 Oct 2019

Recognizing long-form speech using streaming end-to-end modelsAutomatic Speech Recognition & Understanding (ASRU), 2019

172

135

24 Oct 2019

AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks

140

21 Oct 2019

End-to-End Speech Recognition: A review for the French Language

Florian Boyer

Jean-Luc Rouas

AI4TS

153

18 Oct 2019

Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems

H. Abdullah

Muhammad Sajidur Rahman

141

11 Oct 2019

Animating Face using Disentangled Audio RepresentationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019

Gaurav Mittal

Baoyuan Wang

CVBM

182

02 Oct 2019

Addressing Failure Prediction by Learning Model ConfidenceNeural Information Processing Systems (NeurIPS), 2019

283

331

01 Oct 2019

RandAugment: Practical automated data augmentation with a reduced search space

1.0K

3,948

30 Sep 2019

A Comparison of Hybrid and End-to-End Models for Syllable RecognitionInternational Conference on Text, Speech and Dialogue (TSD), 2019

Sebastian P. Bayerl

Korbinian Riedhammer

19 Sep 2019

Adversarial Attacks and Defenses in Images, Graphs and Text: A ReviewInternational Journal of Automation and Computing (IJAC), 2019

331

728

17 Sep 2019

Preech: A System for Privacy-Preserving Speech TranscriptionUSENIX Security Symposium (USENIX Security), 2019

371

09 Sep 2019

A Quantum Search Decoder for Natural Language ProcessingQuantum Machine Intelligence (QMI), 2019

Johannes Bausch

Sathyawageeswar Subramanian

Stephen Piddock

203

09 Sep 2019

PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing UnitsInternational Symposium on High-Performance Computer Architecture (HPCA), 2019

Yujeong Choi

Minsoo Rhu

137

153

06 Sep 2019

Harnessing the Power of Deep Learning Methods in Healthcare: Neonatal Pain Assessment from Crying Sound

112

05 Sep 2019

Brain2Char: A Deep Architecture for Decoding Text from Brain RecordingsJournal of Neural Engineering (J. Neural Eng.), 2019

Pengfei Sun

Gopala K. Anumanchipalli

E. Chang

03 Sep 2019

Beyond Human-Level Accuracy: Computational Challenges in Deep LearningACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPoPP), 2019

Joel Hestness

Newsha Ardalani

G. Diamos

110

03 Sep 2019

Metric Learning for Adversarial RobustnessNeural Information Processing Systems (NeurIPS), 2019

Carl Vondrick

320

199

03 Sep 2019

Smaller Models, Better Generalization

29 Aug 2019

End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer LearningInterspeech (Interspeech), 2019

Pavel Denisov

Ngoc Thang Vu

129

13 Aug 2019

Universal Adversarial Audio Perturbations

Alessandro Lameiras Koerich

AAML

318

08 Aug 2019

Imperio: Robust Over-the-Air Adversarial Examples for Automatic Speech Recognition SystemsAsia-Pacific Computer Systems Architecture Conference (APCSAC), 2019

356

05 Aug 2019

Machine Learning at the Network Edge: A SurveyACM Computing Surveys (ACM CSUR), 2019

Ganesh Ananthanarayanan

Faraz Hussain

643

461

31 Jul 2019