ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.05256
  4. Cited By
Achieving Human Parity in Conversational Speech Recognition
v1v2 (latest)

Achieving Human Parity in Conversational Speech Recognition

17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
ArXiv (abs)PDFHTML

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 201 papers shown
Multilingual Bottleneck Features for Query by Example Spoken Term
  Detection
Multilingual Bottleneck Features for Query by Example Spoken Term DetectionAutomatic Speech Recognition & Understanding (ASRU), 2019
Dhananjay Ram
Lesly Miculicich
H. Bourlard
65
21
0
30 Jun 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech
  Recognition
Auxiliary Interference Speaker Loss for Target-Speaker Speech RecognitionInterspeech (Interspeech), 2019
Naoyuki Kanda
Shota Horiguchi
R. Takashima
Yusuke Fujita
Kenji Nagamatsu
Shinji Watanabe
166
36
0
26 Jun 2019
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Deep Xi as a Front-End for Robust Automatic Speech Recognition
Aaron Nicolson
K. Paliwal
120
12
0
18 Jun 2019
(Pen-) Ultimate DNN Pruning
(Pen-) Ultimate DNN Pruning
Marc Riera
J. Arnau
Antonio González
CVBM
71
1
0
06 Jun 2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn
  University Joint Investigation for Dinner Party ASR
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASRInterspeech (Interspeech), 2019
Naoyuki Kanda
Christoph Boeddeker
Jens Heitkaemper
Yusuke Fujita
Shota Horiguchi
Kenji Nagamatsu
Reinhold Häb-Umbach
195
62
0
29 May 2019
Multi-Class Gaussian Process Classification Made Conjugate: Efficient
  Inference via Data Augmentation
Multi-Class Gaussian Process Classification Made Conjugate: Efficient Inference via Data AugmentationConference on Uncertainty in Artificial Intelligence (UAI), 2019
Théo Galy-Fajou
F. Wenzel
Christian Donner
Manfred Opper
156
31
0
23 May 2019
Meeting Transcription Using Virtual Microphone Arrays
Meeting Transcription Using Virtual Microphone Arrays
Takuya Yoshioka
Zhuo Chen
Dimitrios Dimitriadis
William Fu-Hinthorn
Xuedong Huang
A. Stolcke
Michael Zeng
173
15
0
03 May 2019
A Comparison of Online Automatic Speech Recognition Systems and the
  Nonverbal Responses to Unintelligible Speech
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
Joshua Y. Kim
Chunfeng Liu
R. Calvo
K. McCabe
Silas C. R. Taylor
Björn W. Schuller
Kaihang Wu
89
41
0
29 Apr 2019
Natural Language Interactions in Autonomous Vehicles: Intent Detection
  and Slot Filling from Passenger Utterances
Natural Language Interactions in Autonomous Vehicles: Intent Detection and Slot Filling from Passenger Utterances
Eda Okur
Shachi H. Kumar
Saurav Sahay
Asli Arslan Esme
L. Nachman
82
19
0
23 Apr 2019
Disfluencies and Human Speech Transcription Errors
Disfluencies and Human Speech Transcription Errors
Vicky Zayats
Trang Tran
Richard A. Wright
Courtney Mansfield
Mari Ostendorf
141
45
0
08 Apr 2019
How to Prove Your Model Belongs to You: A Blind-Watermark based
  Framework to Protect Intellectual Property of DNN
How to Prove Your Model Belongs to You: A Blind-Watermark based Framework to Protect Intellectual Property of DNNAsia-Pacific Computer Systems Architecture Conference (APCSAC), 2019
Zheng Li
Chengyu Hu
Yang Zhang
Shanqing Guo
AAML
213
196
0
05 Mar 2019
Neural network gradient-based learning of black-box function interfaces
Neural network gradient-based learning of black-box function interfaces
Alon Jacovi
Guy Hadash
Einat Kermany
Boaz Carmeli
Ofer Lavi
George Kour
Jonathan Berant
117
14
0
13 Jan 2019
Advancing the State of the Art in Open Domain Dialog Systems through the
  Alexa Prize
Advancing the State of the Art in Open Domain Dialog Systems through the Alexa Prize
Chandra Khatri
Behnam Hedayatnia
Anu Venkatesh
J. Nunn
Yi Pan
...
Dilek Z. Hakkani-Tür
Gene Hwang
Nate Michel
Eric King
R. Prasad
LRM
173
87
0
27 Dec 2018
Unsupervised Speech Recognition via Segmental Empirical Output
  Distribution Matching
Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Chih-Kuan Yeh
Jianshu Chen
Chengzhu Yu
Dong Yu
167
41
0
23 Dec 2018
Adversarial Sample Detection for Deep Neural Network through Model
  Mutation Testing
Adversarial Sample Detection for Deep Neural Network through Model Mutation Testing
Jingyi Wang
Guoliang Dong
Jun Sun
Xinyu Wang
Peixin Zhang
AAML
224
202
0
14 Dec 2018
Feature Extraction for Temporal Signal Recognition: An Overview
Feature Extraction for Temporal Signal Recognition: An Overview
Imad Rida
77
14
0
03 Dec 2018
A Method for Analysis of Patient Speech in Dialogue for Dementia
  Detection
A Method for Analysis of Patient Speech in Dialogue for Dementia Detection
Saturnino Luz
S. D. L. Fuente
Pierre Albert
89
54
0
25 Nov 2018
Concept Learning through Deep Reinforcement Learning with
  Memory-Augmented Neural Networks
Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Jing Shi
Jiaming Xu
Yiqun Yao
Bo Xu
143
27
0
15 Nov 2018
Analyzing deep CNN-based utterance embeddings for acoustic model
  adaptation
Analyzing deep CNN-based utterance embeddings for acoustic model adaptation
Joanna Rownicka
P. Bell
Steve Renals
135
9
0
12 Nov 2018
A Comparison of Lattice-free Discriminative Training Criteria for Purely
  Sequence-Trained Neural Network Acoustic Models
A Comparison of Lattice-free Discriminative Training Criteria for Purely Sequence-Trained Neural Network Acoustic ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018
Chao Weng
Manway Liu
126
6
0
08 Nov 2018
Confusion2Vec: Towards Enriching Vector Space Word Representations with
  Representational Ambiguities
Confusion2Vec: Towards Enriching Vector Space Word Representations with Representational AmbiguitiesPeerJ Computer Science (PeerJ CS), 2018
K. K. Thekumparampil
Zinan Lin
189
23
0
08 Nov 2018
When CTC Training Meets Acoustic Landmarks
When CTC Training Meets Acoustic Landmarks
Di He
Xuesong Yang
Johan Rohdin
Yi Liang
Themos Stafylakis
Deming Chen
99
11
0
05 Nov 2018
Neural Task Representations as Weak Supervision for Model Agnostic
  Cross-Lingual Transfer
Neural Task Representations as Weak Supervision for Model Agnostic Cross-Lingual Transfer
S. Jauhar
Michael Gamon
Patrick Pantel
180
3
0
02 Nov 2018
Spoken Language Understanding on the Edge
Spoken Language Understanding on the Edge
Alaa Saade
A. Coucke
A. Caulier
Joseph Dureau
Adrien Ball
...
Clément Doumouro
Thibault Gisselbrecht
F. Caltagirone
Thibaut Lavril
Maël Primet
306
68
0
30 Oct 2018
The Airbus Air Traffic Control speech recognition 2018 challenge:
  towards ATC automatic transcription and call sign detection
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
Thomas Pellegrini
Jérôme Farinas
Estelle Delpech
François Lancelot
DRL
144
52
0
30 Oct 2018
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech
  Recognition
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Xinpei Zhou
Jiwei Li
Xi Zhou
86
4
0
29 Oct 2018
Deep multi-survey classification of variable stars
Deep multi-survey classification of variable stars
Carlos Aguirre
K. Pichara
I. Becker
134
35
0
21 Oct 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation
  Approach Using Neural Networks
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
150
87
0
08 Oct 2018
Adversarial Examples - A Complete Characterisation of the Phenomenon
Adversarial Examples - A Complete Characterisation of the Phenomenon
A. Serban
E. Poll
Joost Visser
SILMAAML
249
49
0
02 Oct 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm
Capacity Control of ReLU Neural Networks by Basis-path Norm
Shuxin Zheng
Qi Meng
Huishuai Zhang
Wei-neng Chen
Nenghai Yu
Tie-Yan Liu
156
22
0
19 Sep 2018
Single-Microphone Speech Enhancement and Separation Using Deep Learning
Single-Microphone Speech Enhancement and Separation Using Deep Learning
Morten Kolbaek
182
7
0
31 Aug 2018
Nonsense Attacks on Google Assistant
Nonsense Attacks on Google Assistant
M. Bispham
Ioannis Agrafiotis
M. Goldsmith
AAML
51
7
0
06 Aug 2018
DeepCloak: Adversarial Crafting As a Defensive Measure to Cloak
  Processes
DeepCloak: Adversarial Crafting As a Defensive Measure to Cloak Processes
Mehmet Sinan Inci
T. Eisenbarth
B. Sunar
AAML
162
8
0
03 Aug 2018
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech
  Recognition
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition
Pavel Denisov
Ngoc Thang Vu
Marc Ferras
110
18
0
30 Jul 2018
Improving Electron Micrograph Signal-to-Noise with an Atrous
  Convolutional Encoder-Decoder
Improving Electron Micrograph Signal-to-Noise with an Atrous Convolutional Encoder-Decoder
Jeffrey M. Ede
88
1
0
30 Jul 2018
Gradient Band-based Adversarial Training for Generalized Attack Immunity
  of A3C Path Finding
Gradient Band-based Adversarial Training for Generalized Attack Immunity of A3C Path Finding
Tong Chen
Wenjia Niu
Yingxiao Xiang
XiaoXuan Bai
Jiqiang Liu
Zhen Han
Gang Li
AAML
133
26
0
18 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for
  Visual and Speech Recognition
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
252
100
0
10 Jul 2018
A Simple Method for Commonsense Reasoning
A Simple Method for Commonsense Reasoning
Trieu H. Trinh
Quoc V. Le
LRMReLM
430
454
0
07 Jun 2018
Snips Voice Platform: an embedded Spoken Language Understanding system
  for private-by-design voice interfaces
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces
A. Coucke
Alaa Saade
Adrien Ball
Théodore Bluche
A. Caulier
...
Thibault Gisselbrecht
F. Caltagirone
Thibaut Lavril
Maël Primet
Joseph Dureau
SyDa
421
887
0
25 May 2018
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Recent Progresses in Deep Learning based Acoustic Models (Updated)
Dong Yu
Jinyu Li
VLM
218
163
0
25 Apr 2018
Estimate and Replace: A Novel Approach to Integrating Deep Neural
  Networks with Existing Applications
Estimate and Replace: A Novel Approach to Integrating Deep Neural Networks with Existing Applications
Guy Hadash
Einat Kermany
Boaz Carmeli
Ofer Lavi
George Kour
Alon Jacovi
AI4TS
111
49
0
24 Apr 2018
Automatic speech recognition for launch control center communication
  using recurrent neural networks with data augmentation and custom language
  model
Automatic speech recognition for launch control center communication using recurrent neural networks with data augmentation and custom language model
Kyongsik Yun
Joseph Osborne
Madison Lee
Thomas Lu
Edward Chow
134
5
0
24 Apr 2018
Low-Precision Floating-Point Schemes for Neural Network Training
Low-Precision Floating-Point Schemes for Neural Network Training
Marc Ortiz
A. Cristal
Eduard Ayguadé
Marc Casas
MQ
144
22
0
14 Apr 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of
  Untranscribed Speech
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech
David Harwath
Galen Chuang
James R. Glass
147
60
0
09 Apr 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset,
  task and baselines
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
Jon Barker
Shinji Watanabe
Emmanuel Vincent
J. Trmal
184
709
0
28 Mar 2018
Detecting Adversarial Examples via Neural Fingerprinting
Detecting Adversarial Examples via Neural Fingerprinting
Sumanth Dathathri
Stephan Zheng
Tianwei Yin
Richard M. Murray
Yisong Yue
MLAUAAML
163
0
0
11 Mar 2018
Deep-FSMN for Large Vocabulary Continuous Speech Recognition
Deep-FSMN for Large Vocabulary Continuous Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
Lirong Dai
184
123
0
04 Mar 2018
Trustless Machine Learning Contracts; Evaluating and Exchanging Machine
  Learning Models on the Ethereum Blockchain
Trustless Machine Learning Contracts; Evaluating and Exchanging Machine Learning Models on the Ethereum Blockchain
A. Krizhevsky
Geoffrey E. Hinton
SyDa
123
112
0
27 Feb 2018
Sequence-based Multi-lingual Low Resource Speech Recognition
Sequence-based Multi-lingual Low Resource Speech Recognition
Siddharth Dalmia
Ramon Sanabria
Florian Metze
A. Black
143
97
0
21 Feb 2018
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with
  Large Action Spaces
Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces
Gellert Weisz
Paweł Budzianowski
Pei-hao Su
Milica Gasic
140
87
0
11 Feb 2018
Previous
12345
Next
Page 3 of 5