Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1610.05256
Cited By
v1
v2 (latest)
Achieving Human Parity in Conversational Speech Recognition
17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Achieving Human Parity in Conversational Speech Recognition"
50 / 201 papers shown
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Xuesong Yang
Kartik Audhkhasi
Andrew Rosenberg
Samuel Thomas
Bhuvana Ramabhadran
M. Hasegawa-Johnson
116
73
0
07 Feb 2018
Blind Pre-Processing: A Robust Defense Method Against Adversarial Examples
Adnan Siraj Rakin
Zhezhi He
Boqing Gong
Deliang Fan
AAML
169
4
0
05 Feb 2018
Learning Combinations of Activation Functions
Franco Manessi
A. Rozza
AI4CE
170
61
0
29 Jan 2018
Certified Defenses against Adversarial Examples
Aditi Raghunathan
Jacob Steinhardt
Abigail Z. Jacobs
AAML
359
990
0
29 Jan 2018
Classification of sparsely labeled spatio-temporal data through semi-supervised adversarial learning
Atanas Mirchev
Seyed-Ahmad Ahmadi
GAN
170
3
0
26 Jan 2018
From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots
H. Shum
Xiaodong He
Di Li
237
591
0
06 Jan 2018
The CAPIO 2017 Conversational Speech Recognition System
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
346
74
0
29 Dec 2017
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Xinyun Chen
Chang-rui Liu
Yue Liu
Kimberly Lu
Basel Alomair
AAML
SILM
764
2,084
0
15 Dec 2017
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi
Brian Kingsbury
Bhuvana Ramabhadran
G. Saon
M. Picheny
133
153
0
08 Dec 2017
Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing
Sushant Kafle
Matt Huenerfauth
109
58
0
06 Dec 2017
VoiceMask: Anonymize and Sanitize Voice Input on Mobile Devices
Jianwei Qian
Haohua Du
Jiahui Hou
Linlin Chen
Taeho Jung
Xiangyang Li
Yu Wang
Yanbo Deng
137
46
0
30 Nov 2017
Multilingual Adaptation of RNN Based ASR Systems
Markus Müller
Sebastian Stüker
A. Waibel
174
18
0
13 Nov 2017
Phonemic and Graphemic Multilingual CTC Based Speech Recognition
Markus Müller
Sebastian Stüker
A. Waibel
107
12
0
13 Nov 2017
Robust Speech Recognition Using Generative Adversarial Networks
Anuroop Sriram
Heewoo Jun
Yashesh Gaur
S. Satheesh
102
50
0
05 Nov 2017
Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour
Nicolas Usunier
Iasonas Kokkinos
Thomas Schatz
Gabriel Synnaeve
Emmanuel Dupoux
200
126
0
03 Nov 2017
Acoustic Landmarks Contain More Information About the Phone String than Other Frames for Automatic Speech Recognition with Deep Neural Network Acoustic Model
Journal of the Acoustical Society of America (JASA), 2017
Di He
Boon Pang Lim
Xuesong Yang
M. Hasegawa-Johnson
Deming Chen
92
10
0
27 Oct 2017
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
136
39
0
19 Sep 2017
Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training
Morten Kolbæk
Dong Yu
Zheng-Hua Tan
Jesper Jensen
163
22
0
31 Aug 2017
Comparing Human and Machine Errors in Conversational Speech Transcription
A. Stolcke
J. Droppo
136
70
0
29 Aug 2017
The Microsoft 2017 Conversational Speech Recognition System
Wayne Xiong
Lingfeng Wu
F. Alleva
J. Droppo
Xuedong Huang
A. Stolcke
208
478
0
21 Aug 2017
Future Word Contexts in Neural Network Language Models
Xie Chen
Xunying Liu
Anton Ragni
Yu Wang
Mark Gales
55
23
0
18 Aug 2017
An Improved Residual LSTM Architecture for Acoustic Modeling
Lu Huang
Jiasong Sun
Ji Xu
Yi Yang
KELM
98
17
0
17 Aug 2017
Lattice Long Short-Term Memory for Human Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2017
Lin Sun
Kui Jia
Kevin Chen
Dit-Yan Yeung
Bertram E. Shi
Silvio Savarese
148
167
0
13 Aug 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
177
233
0
24 Jul 2017
Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Zhehuai Chen
J. Droppo
Jinyu Li
Wayne Xiong
241
66
0
21 Jul 2017
Encoding Word Confusion Networks with Recurrent Neural Networks for Dialog State Tracking
Glorianna Jagfeld
Ngoc Thang Vu
168
12
0
18 Jul 2017
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
264
261
0
12 Jul 2017
Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks
Pranav Rajpurkar
Awni Y. Hannun
Masoumeh Haghpanahi
Codie Bourn
A. Ng
225
808
0
06 Jul 2017
Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong
Warren He
James Wei
Xinyun Chen
Nicholas Carlini
Basel Alomair
AAML
191
242
0
15 Jun 2017
On Calibration of Modern Neural Networks
International Conference on Machine Learning (ICML), 2017
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
1.8K
6,878
0
14 Jun 2017
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments
ACM Transactions on Intelligent Systems and Technology (TIST), 2017
Zixing Zhang
Jürgen T. Geiger
Jouni Pohjalainen
A. Mousa
Wenyu Jin
Björn Schuller
271
328
0
30 May 2017
DeepXplore: Automated Whitebox Testing of Deep Learning Systems
Kexin Pei
Yinzhi Cao
Junfeng Yang
Suman Jana
AAML
501
1,465
0
18 May 2017
Reducing Bias in Production Speech Models
Eric Battenberg
R. Child
Adam Coates
Christopher Fougner
Yashesh Gaur
...
Vinay Rao
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
137
10
0
11 May 2017
A comprehensive study of batch construction strategies for recurrent neural networks in MXNet
P. Doetsch
Pavel Golik
Hermann Ney
117
17
0
05 May 2017
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
Matthew Dunn
Levent Sagun
Mike Higgins
V. U. Güney
Volkan Cirik
Dong Wang
RALM
341
477
0
18 Apr 2017
Factorization tricks for LSTM networks
Oleksii Kuchaiev
Boris Ginsburg
219
121
0
31 Mar 2017
Simplified End-to-End MMI Training and Voting for ASR
L. Fritz
D. Burshtein
114
3
0
30 Mar 2017
Recognizing Multi-talker Speech with Permutation Invariant Training
Dong Yu
Xuankai Chang
Y. Qian
303
100
0
22 Mar 2017
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks
Morten Kolbaek
Dong Yu
Zheng-Hua Tan
Jesper Jensen
298
758
0
18 Mar 2017
English Conversational Telephone Speech Recognition by Humans and Machines
G. Saon
Gakuto Kurata
Tom Sercu
Kartik Audhkhasi
Samuel Thomas
...
Bhuvana Ramabhadran
M. Picheny
L. Lim
Bergul Roomi
Phil Hall
211
371
0
06 Mar 2017
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Hairong Liu
Zhenyao Zhu
Xiangang Li
S. Satheesh
VLM
217
56
0
01 Mar 2017
Multitask Learning with CTC and Segmental CRF for Speech Recognition
Interspeech (Interspeech), 2017
Liang Lu
Lingpeng Kong
Chris Dyer
Noah A. Smith
179
24
0
21 Feb 2017
Deep Learning for Computational Chemistry
Journal of Computational Chemistry (JCC), 2017
Garrett B. Goh
Nathan Oken Hodas
Abhinav Vishnu
AI4CE
201
711
0
17 Jan 2017
Kernel Approximation Methods for Speech Recognition
Journal of machine learning research (JMLR), 2017
Avner May
A. Garakani
Zhiyun Lu
Dong Guo
Kuan Liu
...
Michael Collins
Daniel J. Hsu
Brian Kingsbury
M. Picheny
Fei Sha
163
47
0
13 Jan 2017
Akid: A Library for Neural Network Research and Production from a Dataism Approach
Shuai Li
76
0
0
03 Jan 2017
Dense Prediction on Sequences with Time-Dilated Convolutions for Speech Recognition
Tom Sercu
Vaibhava Goel
VLM
206
58
0
28 Nov 2016
Unsupervised Pretraining for Sequence to Sequence Learning
Prajit Ramachandran
Peter J. Liu
Quoc V. Le
SSL
AIMat
271
289
0
08 Nov 2016
The Microsoft 2016 Conversational Speech Recognition System
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
241
291
0
12 Sep 2016
Using the Output Embedding to Improve Language Models
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2016
Ofir Press
Lior Wolf
367
769
0
20 Aug 2016
Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner
Emmanuel Dupoux
284
175
0
29 Jul 2016
Previous
1
2
3
4
5
Next