v1v2 (latest)

Achieving Human Parity in Conversational Speech Recognition

17 October 2016

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 201 papers shown

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition

Xuesong Yang

Kartik Audhkhasi

Andrew Rosenberg

Samuel Thomas

Bhuvana Ramabhadran

M. Hasegawa-Johnson

116

07 Feb 2018

Blind Pre-Processing: A Robust Defense Method Against Adversarial Examples

Adnan Siraj Rakin

169

05 Feb 2018

Learning Combinations of Activation Functions

Franco Manessi

A. Rozza

AI4CE

170

29 Jan 2018

Certified Defenses against Adversarial Examples

359

990

29 Jan 2018

Classification of sparsely labeled spatio-temporal data through semi-supervised adversarial learning

Atanas Mirchev

Seyed-Ahmad Ahmadi

GAN

170

26 Jan 2018

From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots

H. Shum

Xiaodong He

Di Li

237

591

06 Jan 2018

The CAPIO 2017 Conversational Speech Recognition System

Kyu Jeong Han

Akshay Chandrashekaran

Jungsuk Kim

Ian Lane

346

29 Dec 2017

Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning

764

2,084

15 Dec 2017

Building competitive direct acoustics-to-word models for English conversational speech recognition

Kartik Audhkhasi

Brian Kingsbury

Bhuvana Ramabhadran

G. Saon

M. Picheny

133

153

08 Dec 2017

Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing

Sushant Kafle

Matt Huenerfauth

109

06 Dec 2017

VoiceMask: Anonymize and Sanitize Voice Input on Mobile Devices

Jiahui Hou

137

30 Nov 2017

Multilingual Adaptation of RNN Based ASR Systems

Markus Müller

Sebastian Stüker

A. Waibel

174

13 Nov 2017

Phonemic and Graphemic Multilingual CTC Based Speech Recognition

Markus Müller

Sebastian Stüker

A. Waibel

107

13 Nov 2017

Robust Speech Recognition Using Generative Adversarial Networks

102

05 Nov 2017

Learning Filterbanks from Raw Speech for Phone Recognition

200

126

03 Nov 2017

Acoustic Landmarks Contain More Information About the Phone String than Other Frames for Automatic Speech Recognition with Deep Neural Network Acoustic ModelJournal of the Acoustical Society of America (JASA), 2017

27 Oct 2017

Language Modeling with Highway LSTM

Bhuvana Ramabhadran

136

19 Sep 2017

Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training

163

31 Aug 2017

Comparing Human and Machine Errors in Conversational Speech Transcription

A. Stolcke

J. Droppo

136

29 Aug 2017

The Microsoft 2017 Conversational Speech Recognition System

208

478

21 Aug 2017

Future Word Contexts in Neural Network Language Models

Xie Chen

18 Aug 2017

An Improved Residual LSTM Architecture for Acoustic Modeling

17 Aug 2017

Lattice Long Short-Term Memory for Human Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2017

Bertram E. Shi

Silvio Savarese

148

167

13 Aug 2017

Exploring Neural Transducers for End-to-End Speech Recognition

Jitong Chen

...

177

233

24 Jul 2017

Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition

Zhehuai Chen

J. Droppo

Jinyu Li

Wayne Xiong

241

21 Jul 2017

Encoding Word Confusion Networks with Recurrent Neural Networks for Dialog State Tracking

Glorianna Jagfeld

Ngoc Thang Vu

168

18 Jul 2017

Speaker-independent Speech Separation with Deep Attractor Network

Yi Luo

Zhuo Chen

N. Mesgarani

264

261

12 Jul 2017

Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks

225

808

06 Jul 2017

Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong

191

242

15 Jun 2017

On Calibration of Modern Neural NetworksInternational Conference on Machine Learning (ICML), 2017

1.8K

6,878

14 Jun 2017

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent DevelopmentsACM Transactions on Intelligent Systems and Technology (TIST), 2017

Björn Schuller

271

328

30 May 2017

DeepXplore: Automated Whitebox Testing of Deep Learning Systems

501

1,465

18 May 2017

Reducing Bias in Production Speech Models

...

137

11 May 2017

A comprehensive study of batch construction strategies for recurrent neural networks in MXNet

P. Doetsch

Pavel Golik

Hermann Ney

117

05 May 2017

SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine

341

477

18 Apr 2017

Factorization tricks for LSTM networks

Oleksii Kuchaiev

Boris Ginsburg

219

121

31 Mar 2017

Simplified End-to-End MMI Training and Voting for ASR

L. Fritz

D. Burshtein

114

30 Mar 2017

Recognizing Multi-talker Speech with Permutation Invariant Training

Dong Yu

Xuankai Chang

Y. Qian

303

100

22 Mar 2017

Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks

298

758

18 Mar 2017

English Conversational Telephone Speech Recognition by Humans and Machines

Kartik Audhkhasi

...

Bhuvana Ramabhadran

211

371

06 Mar 2017

Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling

Xiangang Li

217

01 Mar 2017

Multitask Learning with CTC and Segmental CRF for Speech RecognitionInterspeech (Interspeech), 2017

Liang Lu

Lingpeng Kong

Chris Dyer

Noah A. Smith

179

21 Feb 2017

Deep Learning for Computational ChemistryJournal of Computational Chemistry (JCC), 2017

201

711

17 Jan 2017

Kernel Approximation Methods for Speech RecognitionJournal of machine learning research (JMLR), 2017

...

163

13 Jan 2017

Akid: A Library for Neural Network Research and Production from a Dataism Approach

Shuai Li

03 Jan 2017

Dense Prediction on Sequences with Time-Dilated Convolutions for Speech Recognition

Tom Sercu

Vaibhava Goel

VLM

206

28 Nov 2016

Unsupervised Pretraining for Sequence to Sequence Learning

271

289

08 Nov 2016

The Microsoft 2016 Conversational Speech Recognition System

241

291

12 Sep 2016

Using the Output Embedding to Improve Language ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2016

Ofir Press

Lior Wolf

367

769

20 Aug 2016

Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner

Emmanuel Dupoux

284

175

29 Jul 2016