ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.05256
  4. Cited By
Achieving Human Parity in Conversational Speech Recognition
v1v2 (latest)

Achieving Human Parity in Conversational Speech Recognition

17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
ArXiv (abs)PDFHTML

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 201 papers shown
Joint Modeling of Accents and Acoustics for Multi-Accent Speech
  Recognition
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
Xuesong Yang
Kartik Audhkhasi
Andrew Rosenberg
Samuel Thomas
Bhuvana Ramabhadran
M. Hasegawa-Johnson
116
73
0
07 Feb 2018
Blind Pre-Processing: A Robust Defense Method Against Adversarial
  Examples
Blind Pre-Processing: A Robust Defense Method Against Adversarial Examples
Adnan Siraj Rakin
Zhezhi He
Boqing Gong
Deliang Fan
AAML
169
4
0
05 Feb 2018
Learning Combinations of Activation Functions
Learning Combinations of Activation Functions
Franco Manessi
A. Rozza
AI4CE
170
61
0
29 Jan 2018
Certified Defenses against Adversarial Examples
Certified Defenses against Adversarial Examples
Aditi Raghunathan
Jacob Steinhardt
Abigail Z. Jacobs
AAML
359
990
0
29 Jan 2018
Classification of sparsely labeled spatio-temporal data through
  semi-supervised adversarial learning
Classification of sparsely labeled spatio-temporal data through semi-supervised adversarial learning
Atanas Mirchev
Seyed-Ahmad Ahmadi
GAN
170
3
0
26 Jan 2018
From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots
From Eliza to XiaoIce: Challenges and Opportunities with Social Chatbots
H. Shum
Xiaodong He
Di Li
237
591
0
06 Jan 2018
The CAPIO 2017 Conversational Speech Recognition System
The CAPIO 2017 Conversational Speech Recognition System
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
346
74
0
29 Dec 2017
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Xinyun Chen
Chang-rui Liu
Yue Liu
Kimberly Lu
Basel Alomair
AAMLSILM
764
2,084
0
15 Dec 2017
Building competitive direct acoustics-to-word models for English
  conversational speech recognition
Building competitive direct acoustics-to-word models for English conversational speech recognition
Kartik Audhkhasi
Brian Kingsbury
Bhuvana Ramabhadran
G. Saon
M. Picheny
133
153
0
08 Dec 2017
Evaluating the Usability of Automatically Generated Captions for People
  who are Deaf or Hard of Hearing
Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing
Sushant Kafle
Matt Huenerfauth
109
58
0
06 Dec 2017
VoiceMask: Anonymize and Sanitize Voice Input on Mobile Devices
VoiceMask: Anonymize and Sanitize Voice Input on Mobile Devices
Jianwei Qian
Haohua Du
Jiahui Hou
Linlin Chen
Taeho Jung
Xiangyang Li
Yu Wang
Yanbo Deng
137
46
0
30 Nov 2017
Multilingual Adaptation of RNN Based ASR Systems
Multilingual Adaptation of RNN Based ASR Systems
Markus Müller
Sebastian Stüker
A. Waibel
174
18
0
13 Nov 2017
Phonemic and Graphemic Multilingual CTC Based Speech Recognition
Phonemic and Graphemic Multilingual CTC Based Speech Recognition
Markus Müller
Sebastian Stüker
A. Waibel
107
12
0
13 Nov 2017
Robust Speech Recognition Using Generative Adversarial Networks
Robust Speech Recognition Using Generative Adversarial Networks
Anuroop Sriram
Heewoo Jun
Yashesh Gaur
S. Satheesh
102
50
0
05 Nov 2017
Learning Filterbanks from Raw Speech for Phone Recognition
Learning Filterbanks from Raw Speech for Phone Recognition
Neil Zeghidour
Nicolas Usunier
Iasonas Kokkinos
Thomas Schatz
Gabriel Synnaeve
Emmanuel Dupoux
200
126
0
03 Nov 2017
Acoustic Landmarks Contain More Information About the Phone String than
  Other Frames for Automatic Speech Recognition with Deep Neural Network
  Acoustic Model
Acoustic Landmarks Contain More Information About the Phone String than Other Frames for Automatic Speech Recognition with Deep Neural Network Acoustic ModelJournal of the Acoustical Society of America (JASA), 2017
Di He
Boon Pang Lim
Xuesong Yang
M. Hasegawa-Johnson
Deming Chen
92
10
0
27 Oct 2017
Language Modeling with Highway LSTM
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
136
39
0
19 Sep 2017
Joint Separation and Denoising of Noisy Multi-talker Speech using
  Recurrent Neural Networks and Permutation Invariant Training
Joint Separation and Denoising of Noisy Multi-talker Speech using Recurrent Neural Networks and Permutation Invariant Training
Morten Kolbæk
Dong Yu
Zheng-Hua Tan
Jesper Jensen
163
22
0
31 Aug 2017
Comparing Human and Machine Errors in Conversational Speech
  Transcription
Comparing Human and Machine Errors in Conversational Speech Transcription
A. Stolcke
J. Droppo
136
70
0
29 Aug 2017
The Microsoft 2017 Conversational Speech Recognition System
The Microsoft 2017 Conversational Speech Recognition System
Wayne Xiong
Lingfeng Wu
F. Alleva
J. Droppo
Xuedong Huang
A. Stolcke
208
478
0
21 Aug 2017
Future Word Contexts in Neural Network Language Models
Future Word Contexts in Neural Network Language Models
Xie Chen
Xunying Liu
Anton Ragni
Yu Wang
Mark Gales
55
23
0
18 Aug 2017
An Improved Residual LSTM Architecture for Acoustic Modeling
An Improved Residual LSTM Architecture for Acoustic Modeling
Lu Huang
Jiasong Sun
Ji Xu
Yi Yang
KELM
98
17
0
17 Aug 2017
Lattice Long Short-Term Memory for Human Action Recognition
Lattice Long Short-Term Memory for Human Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2017
Lin Sun
Kui Jia
Kevin Chen
Dit-Yan Yeung
Bertram E. Shi
Silvio Savarese
148
167
0
13 Aug 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
177
233
0
24 Jul 2017
Progressive Joint Modeling in Unsupervised Single-channel Overlapped
  Speech Recognition
Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition
Zhehuai Chen
J. Droppo
Jinyu Li
Wayne Xiong
241
66
0
21 Jul 2017
Encoding Word Confusion Networks with Recurrent Neural Networks for
  Dialog State Tracking
Encoding Word Confusion Networks with Recurrent Neural Networks for Dialog State Tracking
Glorianna Jagfeld
Ngoc Thang Vu
168
12
0
18 Jul 2017
Speaker-independent Speech Separation with Deep Attractor Network
Speaker-independent Speech Separation with Deep Attractor Network
Yi Luo
Zhuo Chen
N. Mesgarani
264
261
0
12 Jul 2017
Cardiologist-Level Arrhythmia Detection with Convolutional Neural
  Networks
Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks
Pranav Rajpurkar
Awni Y. Hannun
Masoumeh Haghpanahi
Codie Bourn
A. Ng
225
808
0
06 Jul 2017
Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong
Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong
Warren He
James Wei
Xinyun Chen
Nicholas Carlini
Basel Alomair
AAML
191
242
0
15 Jun 2017
On Calibration of Modern Neural Networks
On Calibration of Modern Neural NetworksInternational Conference on Machine Learning (ICML), 2017
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
1.8K
6,878
0
14 Jun 2017
Deep Learning for Environmentally Robust Speech Recognition: An Overview
  of Recent Developments
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent DevelopmentsACM Transactions on Intelligent Systems and Technology (TIST), 2017
Zixing Zhang
Jürgen T. Geiger
Jouni Pohjalainen
A. Mousa
Wenyu Jin
Björn Schuller
271
328
0
30 May 2017
DeepXplore: Automated Whitebox Testing of Deep Learning Systems
DeepXplore: Automated Whitebox Testing of Deep Learning Systems
Kexin Pei
Yinzhi Cao
Junfeng Yang
Suman Jana
AAML
501
1,465
0
18 May 2017
Reducing Bias in Production Speech Models
Reducing Bias in Production Speech Models
Eric Battenberg
R. Child
Adam Coates
Christopher Fougner
Yashesh Gaur
...
Vinay Rao
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
137
10
0
11 May 2017
A comprehensive study of batch construction strategies for recurrent
  neural networks in MXNet
A comprehensive study of batch construction strategies for recurrent neural networks in MXNet
P. Doetsch
Pavel Golik
Hermann Ney
117
17
0
05 May 2017
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
Matthew Dunn
Levent Sagun
Mike Higgins
V. U. Güney
Volkan Cirik
Dong Wang
RALM
341
477
0
18 Apr 2017
Factorization tricks for LSTM networks
Factorization tricks for LSTM networks
Oleksii Kuchaiev
Boris Ginsburg
219
121
0
31 Mar 2017
Simplified End-to-End MMI Training and Voting for ASR
Simplified End-to-End MMI Training and Voting for ASR
L. Fritz
D. Burshtein
114
3
0
30 Mar 2017
Recognizing Multi-talker Speech with Permutation Invariant Training
Recognizing Multi-talker Speech with Permutation Invariant Training
Dong Yu
Xuankai Chang
Y. Qian
303
100
0
22 Mar 2017
Multi-talker Speech Separation with Utterance-level Permutation
  Invariant Training of Deep Recurrent Neural Networks
Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks
Morten Kolbaek
Dong Yu
Zheng-Hua Tan
Jesper Jensen
298
758
0
18 Mar 2017
English Conversational Telephone Speech Recognition by Humans and
  Machines
English Conversational Telephone Speech Recognition by Humans and Machines
G. Saon
Gakuto Kurata
Tom Sercu
Kartik Audhkhasi
Samuel Thomas
...
Bhuvana Ramabhadran
M. Picheny
L. Lim
Bergul Roomi
Phil Hall
211
371
0
06 Mar 2017
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence
  Labelling
Gram-CTC: Automatic Unit Selection and Target Decomposition for Sequence Labelling
Hairong Liu
Zhenyao Zhu
Xiangang Li
S. Satheesh
VLM
217
56
0
01 Mar 2017
Multitask Learning with CTC and Segmental CRF for Speech Recognition
Multitask Learning with CTC and Segmental CRF for Speech RecognitionInterspeech (Interspeech), 2017
Liang Lu
Lingpeng Kong
Chris Dyer
Noah A. Smith
179
24
0
21 Feb 2017
Deep Learning for Computational Chemistry
Deep Learning for Computational ChemistryJournal of Computational Chemistry (JCC), 2017
Garrett B. Goh
Nathan Oken Hodas
Abhinav Vishnu
AI4CE
201
711
0
17 Jan 2017
Kernel Approximation Methods for Speech Recognition
Kernel Approximation Methods for Speech RecognitionJournal of machine learning research (JMLR), 2017
Avner May
A. Garakani
Zhiyun Lu
Dong Guo
Kuan Liu
...
Michael Collins
Daniel J. Hsu
Brian Kingsbury
M. Picheny
Fei Sha
163
47
0
13 Jan 2017
Akid: A Library for Neural Network Research and Production from a
  Dataism Approach
Akid: A Library for Neural Network Research and Production from a Dataism Approach
Shuai Li
76
0
0
03 Jan 2017
Dense Prediction on Sequences with Time-Dilated Convolutions for Speech
  Recognition
Dense Prediction on Sequences with Time-Dilated Convolutions for Speech Recognition
Tom Sercu
Vaibhava Goel
VLM
206
58
0
28 Nov 2016
Unsupervised Pretraining for Sequence to Sequence Learning
Unsupervised Pretraining for Sequence to Sequence Learning
Prajit Ramachandran
Peter J. Liu
Quoc V. Le
SSLAIMat
271
289
0
08 Nov 2016
The Microsoft 2016 Conversational Speech Recognition System
The Microsoft 2016 Conversational Speech Recognition System
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
241
291
0
12 Sep 2016
Using the Output Embedding to Improve Language Models
Using the Output Embedding to Improve Language ModelsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2016
Ofir Press
Lior Wolf
367
769
0
20 Aug 2016
Cognitive Science in the era of Artificial Intelligence: A roadmap for
  reverse-engineering the infant language-learner
Cognitive Science in the era of Artificial Intelligence: A roadmap for reverse-engineering the infant language-learner
Emmanuel Dupoux
284
175
0
29 Jul 2016
Previous
12345
Next