ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.05256
  4. Cited By
Achieving Human Parity in Conversational Speech Recognition

Achieving Human Parity in Conversational Speech Recognition

17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
ArXivPDFHTML

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 54 papers shown
Title
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling
Michael McGuire
47
0
0
10 Mar 2025
Automatic speech recognition for the Nepali language using CNN,
  bidirectional LSTM and ResNet
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet
Manish Dhakal
Arman Chhetri
Aman Kumar Gupta
Prabin B. Lamichhane
S. Pandey
S. Shakya
AI4TS
25
10
0
25 Jun 2024
Tag and correct: high precision post-editing approach to correction of
  speech recognition errors
Tag and correct: high precision post-editing approach to correction of speech recognition errors
Tomasz Ziętkiewicz
26
0
0
11 Jun 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural
  Language Models
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models
A. Ogawa
Naohiro Tawara
Marc Delcroix
S. Araki
27
3
0
20 Dec 2023
Beating Backdoor Attack at Its Own Game
Beating Backdoor Attack at Its Own Game
Min Liu
Alberto L. Sangiovanni-Vincentelli
Xiangyu Yue
AAML
65
11
0
28 Jul 2023
Multilingual Word Error Rate Estimation: e-WER3
Multilingual Word Error Rate Estimation: e-WER3
Shammur A. Chowdhury
Ahmed M. Ali
16
7
0
02 Apr 2023
From User Perceptions to Technical Improvement: Enabling People Who
  Stutter to Better Use Speech Recognition
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition
Colin S. Lea
Zifang Huang
Lauren Tooley
Jaya Narain
Dianna Yee
P. Georgiou
Dung Tien Tran
Jeffrey P. Bigham
Leah Findlater
24
31
0
17 Feb 2023
A Survey of Robust Adversarial Training in Pattern Recognition:
  Fundamental, Theory, and Methodologies
A Survey of Robust Adversarial Training in Pattern Recognition: Fundamental, Theory, and Methodologies
Zhuang Qian
Kaizhu Huang
Qiufeng Wang
Xu-Yao Zhang
OOD
AAML
ObjD
49
71
0
26 Mar 2022
DeepSketch: A New Machine Learning-Based Reference Search Technique for
  Post-Deduplication Delta Compression
DeepSketch: A New Machine Learning-Based Reference Search Technique for Post-Deduplication Delta Compression
Jisung Park
Jeoggyun Kim
Yeseong Kim
Sungjin Lee
O. Mutlu
13
23
0
17 Feb 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
24
90
0
05 Jan 2022
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight
  Stealing in Memories
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories
Adnan Siraj Rakin
Md Hafizul Islam Chowdhuryy
Fan Yao
Deliang Fan
AAML
MIACV
36
110
0
08 Nov 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional
  Networks for Conversational Speech Recognition
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
27
10
0
13 Jun 2021
On Feature Decorrelation in Self-Supervised Learning
On Feature Decorrelation in Self-Supervised Learning
Tianyu Hua
Wenxiao Wang
Zihui Xue
Sucheng Ren
Yue Wang
Hang Zhao
SSL
OOD
119
187
0
02 May 2021
Dompteur: Taming Audio Adversarial Examples
Dompteur: Taming Audio Adversarial Examples
Thorsten Eisenhofer
Lea Schonherr
Joel Frank
Lars Speckemeier
D. Kolossa
Thorsten Holz
AAML
28
24
0
10 Feb 2021
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush
  Deep Neural Network in Multi-Tenant FPGA
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush Deep Neural Network in Multi-Tenant FPGA
Adnan Siraj Rakin
Yukui Luo
Xiaolin Xu
Deliang Fan
AAML
11
49
0
05 Nov 2020
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
26
79
0
17 Sep 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6
  Challenge
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
28
9
0
14 Jun 2020
Large scale weakly and semi-supervised learning for low-resource video
  ASR
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
23
9
0
16 May 2020
DeepHammer: Depleting the Intelligence of Deep Neural Networks through
  Targeted Chain of Bit Flips
DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips
Fan Yao
Adnan Siraj Rakin
Deliang Fan
AAML
18
154
0
30 Mar 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM
  Networks
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
Théodore Bluche
Maël Primet
Thibault Gisselbrecht
ObjD
MQ
18
24
0
25 Feb 2020
A simple way to make neural networks robust against diverse image
  corruptions
A simple way to make neural networks robust against diverse image corruptions
E. Rusak
Lukas Schott
Roland S. Zimmermann
Julian Bitterwolf
Oliver Bringmann
Matthias Bethge
Wieland Brendel
19
64
0
16 Jan 2020
Predicting detection filters for small footprint open-vocabulary keyword
  spotting
Predicting detection filters for small footprint open-vocabulary keyword spotting
Théodore Bluche
Thibault Gisselbrecht
ObjD
16
19
0
16 Dec 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems
  With Limited Data
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited Data
Xinyun Chen
Wenxiao Wang
Chris Bender
Yiming Ding
R. Jia
Bo-wen Li
D. Song
AAML
19
106
0
17 Nov 2019
Transformer-Transducer: End-to-End Speech Recognition with
  Self-Attention
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Ching-Feng Yeh
Jay Mahadeokar
Kaustubh Kalgaonkar
Yongqiang Wang
Duc Le
Mahaveer Jain
Kjell Schubert
Christian Fuegen
M. Seltzer
11
147
0
28 Oct 2019
Domain Expansion in DNN-based Acoustic Models for Robust Speech
  Recognition
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Shahram Ghorbani
S. Khorram
John H. L. Hansen
21
18
0
01 Oct 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
31
205
0
16 Aug 2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn
  University Joint Investigation for Dinner Party ASR
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
Naoyuki Kanda
Christoph Boeddeker
Jens Heitkaemper
Yusuke Fujita
Shota Horiguchi
Kenji Nagamatsu
Reinhold Häb-Umbach
13
61
0
29 May 2019
A Comparison of Online Automatic Speech Recognition Systems and the
  Nonverbal Responses to Unintelligible Speech
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech
Joshua Y. Kim
Chunfeng Liu
R. Calvo
K. McCabe
Silas C. R. Taylor
Björn W. Schuller
Kaihang Wu
15
38
0
29 Apr 2019
Natural Language Interactions in Autonomous Vehicles: Intent Detection
  and Slot Filling from Passenger Utterances
Natural Language Interactions in Autonomous Vehicles: Intent Detection and Slot Filling from Passenger Utterances
Eda Okur
Shachi H. Kumar
Saurav Sahay
Asli Arslan Esme
L. Nachman
13
19
0
23 Apr 2019
Neural network gradient-based learning of black-box function interfaces
Neural network gradient-based learning of black-box function interfaces
Alon Jacovi
Guy Hadash
Einat Kermany
Boaz Carmeli
Ofer Lavi
George Kour
Jonathan Berant
6
13
0
13 Jan 2019
Feature Extraction for Temporal Signal Recognition: An Overview
Feature Extraction for Temporal Signal Recognition: An Overview
Imad Rida
14
12
0
03 Dec 2018
Concept Learning through Deep Reinforcement Learning with
  Memory-Augmented Neural Networks
Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks
Jing Shi
Jiaming Xu
Yiqun Yao
Bo Xu
26
24
0
15 Nov 2018
Recognizing Overlapped Speech in Meetings: A Multichannel Separation
  Approach Using Neural Networks
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
Takuya Yoshioka
Hakan Erdogan
Zhuo Chen
Xiong Xiao
F. Alleva
BDL
17
81
0
08 Oct 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm
Capacity Control of ReLU Neural Networks by Basis-path Norm
Shuxin Zheng
Qi Meng
Huishuai Zhang
Wei-neng Chen
Nenghai Yu
Tie-Yan Liu
18
23
0
19 Sep 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for
  Visual and Speech Recognition
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
17
96
0
10 Jul 2018
Snips Voice Platform: an embedded Spoken Language Understanding system
  for private-by-design voice interfaces
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces
A. Coucke
Alaa Saade
Adrien Ball
Théodore Bluche
A. Caulier
...
Thibault Gisselbrecht
F. Caltagirone
Thibaut Lavril
Maël Primet
Joseph Dureau
SyDa
12
812
0
25 May 2018
Low-Precision Floating-Point Schemes for Neural Network Training
Low-Precision Floating-Point Schemes for Neural Network Training
Marc Ortiz
A. Cristal
Eduard Ayguadé
Marc Casas
MQ
17
22
0
14 Apr 2018
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset,
  task and baselines
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
Jon Barker
Shinji Watanabe
Emmanuel Vincent
J. Trmal
14
678
0
28 Mar 2018
Deep-FSMN for Large Vocabulary Continuous Speech Recognition
Deep-FSMN for Large Vocabulary Continuous Speech Recognition
Shiliang Zhang
Ming Lei
Zhijie Yan
Lirong Dai
21
108
0
04 Mar 2018
Sequence-based Multi-lingual Low Resource Speech Recognition
Sequence-based Multi-lingual Low Resource Speech Recognition
Siddharth Dalmia
Ramon Sanabria
Florian Metze
A. Black
18
94
0
21 Feb 2018
Learning Combinations of Activation Functions
Learning Combinations of Activation Functions
Franco Manessi
A. Rozza
AI4CE
21
54
0
29 Jan 2018
The CAPIO 2017 Conversational Speech Recognition System
The CAPIO 2017 Conversational Speech Recognition System
Kyu Jeong Han
Akshay Chandrashekaran
Jungsuk Kim
Ian Lane
15
72
0
29 Dec 2017
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Targeted Backdoor Attacks on Deep Learning Systems Using Data Poisoning
Xinyun Chen
Chang-rui Liu
Bo-wen Li
Kimberly Lu
D. Song
AAML
SILM
13
1,800
0
15 Dec 2017
Language Modeling with Highway LSTM
Language Modeling with Highway LSTM
Gakuto Kurata
Bhuvana Ramabhadran
G. Saon
A. Sethy
AI4TS
13
38
0
19 Sep 2017
Exploring Neural Transducers for End-to-End Speech Recognition
Exploring Neural Transducers for End-to-End Speech Recognition
Eric Battenberg
Jitong Chen
R. Child
Adam Coates
Yashesh Gaur Yi Li
...
Hairong Liu
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
AI4TS
34
229
0
24 Jul 2017
Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong
Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong
Warren He
James Wei
Xinyun Chen
Nicholas Carlini
D. Song
AAML
27
242
0
15 Jun 2017
DeepXplore: Automated Whitebox Testing of Deep Learning Systems
DeepXplore: Automated Whitebox Testing of Deep Learning Systems
Kexin Pei
Yinzhi Cao
Junfeng Yang
Suman Jana
AAML
17
1,350
0
18 May 2017
Reducing Bias in Production Speech Models
Reducing Bias in Production Speech Models
Eric Battenberg
R. Child
Adam Coates
Christopher Fougner
Yashesh Gaur
...
Vinay Rao
S. Satheesh
David Seetapun
Anuroop Sriram
Zhenyao Zhu
25
10
0
11 May 2017
A comprehensive study of batch construction strategies for recurrent
  neural networks in MXNet
A comprehensive study of batch construction strategies for recurrent neural networks in MXNet
P. Doetsch
Pavel Golik
Hermann Ney
18
17
0
05 May 2017
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
Matthew Dunn
Levent Sagun
Mike Higgins
V. U. Güney
Volkan Cirik
Kyunghyun Cho
RALM
14
452
0
18 Apr 2017
12
Next