v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 768 papers shown

Correlation Distance Skip Connection Denoising Autoencoder (CDSK-DAE) for Speech Feature EnhancementApplied Acoustics (Appl. Acoust.), 2019

116

26 Jul 2019

A system of different layers of abstraction for artificial intelligence

Alexander Serb

T. Prodromakis

AI4CE

22 Jul 2019

A semi-holographic hyperdimensional representation system for hardware-friendly cognitive computing

160

12 Jul 2019

Fine-grained robust prosody transfer for single-speaker neural text-to-speechInterspeech (Interspeech), 2019

Jonas Rohnke

178

04 Jul 2019

Towards Interpretable Deep Extreme Multi-label LearningIEEE International Conference on Information Reuse and Integration (IRI), 2019

Yihuang Kang

100

03 Jul 2019

Themis: Fair and Efficient GPU Cluster SchedulingSymposium on Networked Systems Design and Implementation (NSDI), 2019

Kshiteej S. Mahajan

Arjun Balasubramanian

Arjun Singhvi

Shivaram Venkataraman

Aditya Akella

Amar Phanishayee

Shuchi Chawla

161

218

02 Jul 2019

Gated Embeddings in End-to-End Speech Recognition for Conversational-Context FusionAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Suyoun Kim

Siddharth Dalmia

Florian Metze

172

27 Jun 2019

Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric BiasFrontiers in Robotics and AI (Front. Robot. AI), 2019

Ryo Nakashima

Ryo Ozaki

T. Taniguchi

157

21 Jun 2019

On the Robustness of the Backdoor-based Watermarking in Deep Neural Networks

Masoumeh Shafieinejad

136

18 Jun 2019

Curriculum-based transfer learning for an effective end-to-end spoken language understanding and domain portabilityInterspeech (Interspeech), 2019

124

18 Jun 2019

Deep Xi as a Front-End for Robust Automatic Speech Recognition

Aaron Nicolson

K. Paliwal

120

18 Jun 2019

Perceptual Based Adversarial Audio Attacks

Joseph Szurley

J. Zico Kolter

AAML

109

14 Jun 2019

Selfie: Self-supervised Pretraining for Image Embedding

290

116

07 Jun 2019

The Architectural Implications of Facebook's DNN-based Personalized RecommendationInternational Symposium on High-Performance Computer Architecture (HPCA), 2019

...

317

315

06 Jun 2019

Reverse KL-Divergence Training of Prior Networks: Improved Uncertainty and Adversarial RobustnessNeural Information Processing Systems (NeurIPS), 2019

A. Malinin

Mark Gales

UQCV AAML

261

202

31 May 2019

Speaker Anonymization Using X-vector and Neural Waveform ModelsSpeech Synthesis Workshop (SSW), 2019

Xin Wang

Isao Echizen

154

157

30 May 2019

Mixed Precision Training With 8-bit Floating Point

174

29 May 2019

Local Label Propagation for Large-Scale Semi-Supervised Learning

108

28 May 2019

NTP : A Neural Network Topology Profiler

125

22 May 2019

Acoustic-to-Word Models with Conversational Context InformationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2019

Suyoun Kim

Florian Metze

157

21 May 2019

Universal Adversarial Perturbations for Speech Recognition SystemsInterspeech (Interspeech), 2019

Paarth Neekhara

Shehzeen Samarah Hussain

Shlomo Dubnov

142

128

09 May 2019

Capture, Learning, and Synthesis of 3D Speaking StylesComputer Vision and Pattern Recognition (CVPR), 2019

285

399

08 May 2019

Transparent pronunciation scoring using articulatorily weighted phoneme edit distanceInterspeech (Interspeech), 2019

Reima Karhila

Anna-Riikka Smolander

Sari Ylinen

M. Kurimo

07 May 2019

Ensemble Distribution DistillationInternational Conference on Learning Representations (ICLR), 2019

504

262

30 Apr 2019

Unsupervised Data Augmentation for Consistency TrainingNeural Information Processing Systems (NeurIPS), 2019

793

2,537

29 Apr 2019

Transformers with convolutional context for ASR

Abdel-rahman Mohamed

Dmytro Okhonko

Luke Zettlemoyer

192

172

26 Apr 2019

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

518

3,832

18 Apr 2019

Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation

Gakuto Kurata

Kartik Audhkhasi

153

17 Apr 2019

Adversarial Audio: A New Information Hiding Method and Backdoor for DNN-based Speech Recognition Models

Yehao Kong

Jiliang Zhang

08 Apr 2019

Measuring scheduling efficiency of RNNs for NLP applications

105

05 Apr 2019

Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions

183

105

04 Apr 2019

RAPID: Early Classification of Explosive Transients using Deep Learning

160

122

29 Mar 2019

Local Aggregation for Unsupervised Learning of Visual Embeddings

256

460

29 Mar 2019

Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation

148

27 Mar 2019

Automatic Spelling Correction with Transformer for CTC-based End-to-End Speech Recognition

Shiliang Zhang

Ming Lei

Zhijie Yan

113

27 Mar 2019

Practical Hidden Voice Attacks against Speech and Speaker Recognition SystemsNetwork and Distributed System Security Symposium (NDSS), 2019

152

177

18 Mar 2019

End-To-End Speech Recognition Using A High Rank LSTM-CTC Based Model

102

12 Mar 2019

Source codes in human communication

Michael Ramscar

08 Mar 2019

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube VideosConference on Empirical Methods in Natural Language Processing (EMNLP), 2018

113

01 Mar 2019

Incorporating End-to-End Speech Recognition Models for Sentiment AnalysisIEEE International Conference on Robotics and Automation (ICRA), 2019

154

28 Feb 2019

An Optimized Recurrent Unit for Ultra-Low-Power Keyword SpottingProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2019

Justice Amoh

K. Odame

173

13 Feb 2019

Salus: Fine-Grained GPU Sharing Primitives for Deep Learning ApplicationsConference on Machine Learning and Systems (MLSys), 2019

Peifeng Yu

Mosharaf Chowdhury

162

12 Feb 2019

Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM

105

30 Jan 2019

Weighted-Sampling Audio Adversarial Example Attack

220

26 Jan 2019

SirenAttack: Generating Adversarial Audio for End-to-End Acoustic Systems

247

146

23 Jan 2019

Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition

247

123

22 Jan 2019

Robust Watermarking of Neural Network with Exponential Weighting

Ryota Namba

Jun Sakuma

AAML

160

151

18 Jan 2019

Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data

Harshita Seth

Pulkit Kumar

Muktabh Mayank Srivastava

184

12 Jan 2019

Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units

132

31 Dec 2018

Stanza: Layer Separation for Distributed Training in Deep Learning

125

27 Dec 2018