v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 770 papers shown

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition

120

04 Jul 2021

CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription

Nikita Pavlichenko

Ivan Stelmakh

Dmitry Ustalov

144

02 Jul 2021

Realtime Robust Malicious Traffic Detection via Frequency Domain AnalysisConference on Computer and Communications Security (CCS), 2021

Qi Li

148

207

28 Jun 2021

Towards Model-informed Precision Dosing with Expert-in-the-loop Machine LearningIEEE International Conference on Information Reuse and Integration (IRI), 2021

Yihuang Kang

Y. Chiu

Ming-Yen Lin

F. Su

Sheng-Tai Huang

127

28 Jun 2021

Open, Sesame! Introducing Access Control to Voice Services

27 Jun 2021

Accelerating Recurrent Neural Networks for Gravitational Wave ExperimentsIEEE International Conference on Application-Specific Systems, Architectures, and Processors (ASAP), 2021

...

Vladimir Loncar

196

26 Jun 2021

Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient TrainingNeural Information Processing Systems (NeurIPS), 2021

100

22 Jun 2021

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Gaurav Menghani

VLM MedIm

272

528

16 Jun 2021

Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021

111

15 Jun 2021

Break-It-Fix-It: Unsupervised Learning for Program RepairInternational Conference on Machine Learning (ICML), 2021

Michihiro Yasunaga

Abigail Z. Jacobs

240

121

11 Jun 2021

Handcrafted Backdoors in Deep Neural NetworksNeural Information Processing Systems (NeurIPS), 2021

Sanghyun Hong

Nicholas Carlini

Alexey Kurakin

233

08 Jun 2021

SpeechBrain: A General-Purpose Speech Toolkit

Mirco Ravanelli

Titouan Parcollet

Peter William VanHarn Plantinga

...

295

904

08 Jun 2021

LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting NormalizationComputer Vision and Pattern Recognition (CVPR), 2021

206

111

08 Jun 2021

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech RecognitionInterspeech (Interspeech), 2021

Xie Chen

146

04 Jun 2021

An Improved Model for Voicing Silent SpeechAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

David Gaddy

Dana Klein

228

03 Jun 2021

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised LearningIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Haibin Wu

Zhiyong Wu

257

01 Jun 2021

Multi-Modal Semantic Inconsistency Detection in Social Media News PostsConference on Multimedia Modeling (MMM), 2021

S. McCrae

Kehan Wang

A. Zakhor

147

26 May 2021

See, Hear, Read: Leveraging Multimodality with Guided Attention for Abstractive Text SummarizationKnowledge-Based Systems (KBS), 2021

Yash Kumar Atri

Shraman Pramanick

Vikram Goyal

Tanmoy Chakraborty

222

20 May 2021

Unsupervised Discriminative Learning of Sounds for Audio Event ClassificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Sascha Hornauer

Ke Li

Stella X. Yu

Shabnam Ghaffarzadegan

Liu Ren

SSL

117

19 May 2021

Privacy Inference Attacks and Defenses in Cloud-based Deep Neural Network: A Survey

118

13 May 2021

Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition

Khin Me Me Chit

Laet Laet Lin

108

13 May 2021

PIM-DRAM: Accelerating Machine Learning Workloads using Processing in Commodity DRAMIEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 2021

Sourjya Roy

M. Ali

A. Raghunathan

08 May 2021

A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect

161

07 May 2021

Pervasive AI for IoT applications: A Survey on Resource-efficient Distributed Artificial IntelligenceIEEE Communications Surveys and Tutorials (COMST), 2021

354

125

04 May 2021

On the limit of English conversational speech recognitionInterspeech (Interspeech), 2021

Zoltán Tüske

G. Saon

Brian Kingsbury

183

03 May 2021

RotLSTM: Rotating Memories in Recurrent Neural Networks

Vlad Velici

Adam Prugel-Bennett

RALM VLM

254

01 May 2021

Adversarial Example Detection for DNN Models: A Review and Experimental ComparisonArtificial Intelligence Review (AIR), 2021

697

161

01 May 2021

End-to-End Speech Recognition from Federated Acoustic ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Yan Gao

Titouan Parcollet

Salah Zaiem

Javier Fernandez-Marques

Pedro Porto Buarque de Gusmão

Daniel J. Beutel

Nicholas D. Lane

206

29 Apr 2021

NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform QuantizationJournal of machine learning research (JMLR), 2019

Dan Alistarh

215

28 Apr 2021

3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head

203

25 Apr 2021

Quantization of Deep Neural Networks for Accurate Edge ComputingACM Journal on Emerging Technologies in Computing Systems (JETC), 2021

Xiaowe Xu

230

25 Apr 2021

Fast Text-Only Domain Adaptation of RNN-Transducer Prediction NetworkInterspeech (Interspeech), 2021

181

22 Apr 2021

Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems

Mimansa Jaiswal

E. Provost

108

18 Apr 2021

MeshTalk: 3D Face Animation from Speech using Cross-Modality DisentanglementIEEE International Conference on Computer Vision (ICCV), 2021

351

251

16 Apr 2021

A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter ItIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Trung D. Q. Dang

Om Thakkar

Swaroop Indra Ramaswamy

15 Apr 2021

A Toolbox for Construction and Analysis of Speech Datasets

Evelina Bakhturina

Vitaly Lavrukhin

Boris Ginsburg

148

11 Apr 2021

FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip RegularizationInterspeech (Interspeech), 2021

Jiangyan Yi

07 Apr 2021

Visual Alignment Constraint for Continuous Sign Language RecognitionIEEE International Conference on Computer Vision (ICCV), 2021

191

196

06 Apr 2021

Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog SystemsAutomatic Speech Recognition & Understanding (ASRU), 2021

Akshat Gupta

Sai Krishna Rallabandi

A. Black

198

03 Apr 2021

TRS: Transferability Reduced Ensemble via Encouraging Gradient Diversity and Model SmoothnessNeural Information Processing Systems (NeurIPS), 2021

Benjamin I. P. Rubinstein

248

01 Apr 2021

Comparison of different convolutional neural network activation functions and methods for building ensembles

177

29 Mar 2021

Are all outliers alike? On Understanding the Diversity of Outliers for Detecting OODs

149

23 Mar 2021

Federated Quantum Machine LearningEntropy (Entropy), 2021

Samuel Yen-Chi Chen

Shinjae Yoo

FedML AI4CE

187

158

22 Mar 2021

Digital Peter: Dataset, Competition and Handwriting Recognition Methods

158

16 Mar 2021

OkwuGbé: End-to-End Speech Recognition for Fon and Igbo

Bonaventure F. P. Dossou

Chris C. Emezue

179

13 Mar 2021

EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion RecognitionIEEE Transactions on Affective Computing (TAC), 2021

Maurice Gerczuk

Shahin Amiriparian

Sandra Ottl

Björn Schuller

162

10 Mar 2021

Split Computing and Early Exiting for Deep Learning Applications: Survey and Research ChallengesACM Computing Surveys (CSUR), 2021

Yoshitomo Matsubara

Marco Levorato

Francesco Restuccia

404

276

08 Mar 2021

WaveGuard: Understanding and Mitigating Audio Adversarial ExamplesUSENIX Security Symposium (USENIX Security), 2021

Shehzeen Samarah Hussain

Shlomo Dubnov

159

04 Mar 2021

A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box OptimizationInternational Conference on Machine Learning (ICML), 2021

261

21 Feb 2021

Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation

116

16 Feb 2021