v1v2 (latest)

Deep Speech: Scaling up end-to-end speech recognition

17 December 2014

Papers citing "Deep Speech: Scaling up end-to-end speech recognition"

50 / 768 papers shown

Real-Time Neural Voice Camouflage

Mia Chiquier

Chengzhi Mao

Carl Vondrick

195

14 Dec 2021

Detecting Audio Adversarial Examples with Logit Noising

184

13 Dec 2021

Finding Deviated Behaviors of the Compressed DNN Models for Image ClassificationsACM Transactions on Software Engineering and Methodology (TOSEM), 2021

230

06 Dec 2021

Joint Audio-Text Model for Expressive Speech-Driven 3D Facial AnimationProceedings of the ACM on Computer Graphics and Interactive Techniques (PACMCGIT), 2021

Yingruo Fan

Mohammad Kachuee

Jun Saito

Wenping Wang

Taku Komura

175

04 Dec 2021

Catch Me If You Can: Blackbox Adversarial Attacks on Automatic Speech Recognition using Frequency Masking

Xiao-lan Wu

A. Rajan

AAML

233

03 Dec 2021

Transformer-S2A: Robust and Efficient Speech-to-Animation

Liyang Chen

Zhiyong Wu

Jun Ling

Runnan Li

Xu Tan

Sheng Zhao

216

18 Nov 2021

A Survey on Adversarial Attacks for Malware AnalysisIEEE Access (IEEE Access), 2021

302

16 Nov 2021

Neural Population Geometry Reveals the Role of Stochasticity in Robust PerceptionNeural Information Processing Systems (NeurIPS), 2021

132

12 Nov 2021

Recent Advances in End-to-End Automatic Speech RecognitionAPSIPA Transactions on Signal and Information Processing (TASIP), 2021

Jinyu Li

VLM

431

427

02 Nov 2021

With a Little Help from my Temporal Context: Multimodal Egocentric Action RecognitionBritish Machine Vision Conference (BMVC), 2021

Dima Damen

297

01 Nov 2021

Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face SynthesisACM Multimedia (ACM MM), 2021

Jia Jia

183

30 Oct 2021

TorchAudio: Building Blocks for Audio and Speech ProcessingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

...

Vincent Quenneville-Bélair

Yangyang Shi

170

190

28 Oct 2021

Beyond

L_p

clipping: Equalization-based Psychoacoustic Attacks against ASRs

H. Abdullah

Muhammad Sajidur Rahman

Christian Peeters

Cassidy Gibson

Washington Garcia

Vincent Bindschaedler

T. Shrimpton

Patrick Traynor

AAML

25 Oct 2021

Deep Neural Networks on EEG Signals to Predict Auditory Attention Score Using Gramian Angular Difference Field

24 Oct 2021

Asynchronous Decentralized Distributed Training of Acoustic ModelsIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2021

Wei Zhang

126

21 Oct 2021

Activation Landscapes as a Topological Summary of Neural Network Performance

Matthew Wheeler

Jose J. Bouza

Peter Bubenik

172

19 Oct 2021

Speech Pattern based Black-box Model Watermarking for Automatic Speech Recognition

19 Oct 2021

Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information

Baolin Zheng

Peipei Jiang

Qian Wang

Qi Li

Chao Shen

146

19 Oct 2021

Intent Classification Using Pre-trained Language Agnostic Embeddings For Low Resource Languages

Hemant Yadav

Akshat Gupta

Sai Krishna Rallabandi

A. Black

R. Shah

18 Oct 2021

Towards Robust Waveform-Based Acoustic Models

206

16 Oct 2021

On Language Model Integration for RNN Transducer based Speech Recognition

265

13 Oct 2021

Synergy: Resource Sensitive DNN Scheduling in Multi-Tenant Clusters

239

12 Oct 2021

Automated Testing of AI Models

114

07 Oct 2021

Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition

Xie Chen

220

06 Oct 2021

Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems

J. C. Duarte

S. Colcher

04 Oct 2021

Anti-aliasing Deep Image Classifiers using Novel Depth Adaptive Blurring and Activation Function

165

03 Oct 2021

SpliceOut: A Simple and Efficient Audio Augmentation Method

451

30 Sep 2021

Challenges and Opportunities of Speech Recognition for Bengali Language

109

27 Sep 2021

DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning

158

24 Sep 2021

KOHTD: Kazakh Offline Handwritten Text DatasetSignal processing. Image communication (SPIC), 2021

207

22 Sep 2021

Live Speech Portraits: Real-Time Photorealistic Talking-Head AnimationACM Transactions on Graphics (TOG), 2021

Yuanxun Lu

Jinxiang Chai

Xun Cao

205

22 Sep 2021

Reliable Neural Networks for Regression Uncertainty Estimation

224

16 Sep 2021

Unsupervised Domain Adaptation Schemes for Building ASR in Low-resource Languages

A. C. S.

Prathosh A P

A. G. Ramakrishnan

203

12 Sep 2021

Learning Visual-Audio Representations for Voice-Controlled RobotsIEEE International Conference on Robotics and Automation (ICRA), 2021

Peixin Chang

Shuijing Liu

D. L. McPherson

Katherine Driggs-Campbell

SSL

237

07 Sep 2021

SEC4SR: A Security Analysis Platform for Speaker Recognition

Lingling Fan

169

04 Sep 2021

Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionAutomatic Speech Recognition & Understanding (ASRU), 2021

Maxime Burchi

Valentin Vielzeuf

185

101

31 Aug 2021

Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random NoiseJournal of The Audio Engineering Society (JAES), 2021

196

31 Aug 2021

Investigating Vulnerabilities of Deep Neural PoliciesConference on Uncertainty in Artificial Intelligence (UAI), 2021

Ezgi Korkmaz

AAML

138

30 Aug 2021

Automatic Speech Recognition And Limited Vocabulary: A Survey

263

23 Aug 2021

FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute Learning

Yifei Huang

166

143

18 Aug 2021

Detecting OODs as datapoints with High Uncertainty

128

13 Aug 2021

SpecMix : A Mixed Sample Data Augmentation method for Training withTime-Frequency Domain FeaturesInterspeech (Interspeech), 2021

Gwantae Kim

D. Han

Hanseok Ko

138

06 Aug 2021

Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent IdentificationWorld Forum on Internet of Things (WF-IoT), 2021

Sangeeta Ghangam

Daniel Whitenack

Joshua Nemecek

04 Aug 2021

A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English

Saida Mussakhojayeva

Yerbolat Khassanov

H. A. Varol

153

03 Aug 2021

The History of Speech Recognition to the Year 2030

Awni Y. Hannun

AI4TS

229

30 Jul 2021

CarneliNet: Neural Mixture Model for Automatic Speech Recognition

A. Kalinov

Somshubra Majumdar

Jagadeesh Balam

Boris Ginsburg

MoE

105

22 Jul 2021

Trustworthy AI: A Computational Perspective

Xiaorui Liu

412

258

12 Jul 2021

End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning

128

07 Jul 2021

A Survey on Data Augmentation for Text Classification

Markus Bayer

M. Kaufhold

Christian A. Reuter

456

426

07 Jul 2021

Egocentric Videoconferencing

Mohamed A. Elgharib

Mohit Mendiratta

Justus Thies

Matthias Nießner

Hans-Peter Seidel

132

07 Jul 2021