Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1610.05256
Cited By
v1
v2 (latest)
Achieving Human Parity in Conversational Speech Recognition
17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Achieving Human Parity in Conversational Speech Recognition"
50 / 201 papers shown
Title
Dealing with the Hard Facts of Low-Resource African NLP
Yacouba Diarra
Nouhoum Souleymane Coulibaly
Panga Azazia Kamaté
Madani Amadou Tall
Emmanuel Élisé Koné
Aymane Dembélé
Michael Leventhal
20
0
0
23 Nov 2025
What Do Humans Hear When Interacting? Experiments on Selective Listening for Evaluating ASR of Spoken Dialogue Systems
Kiyotada Mori
Seiya Kawano
Chaoran Liu
C. Ishi
Angel García Contreras
Koichiro Yoshino
127
0
0
06 Aug 2025
The 2025 PNPL Competition: Speech Detection and Phoneme Classification in the LibriBrain Dataset
Gilad Landau
Miran Özdogan
Gereon Elvers
Francesco Mantegna
Pratik Somaiya
...
Yorguin Mantilla Ramos
Çağlar Gülçehre
M. Woolrich
Natalie Voets
Oiwi Parker Jones
151
4
0
11 Jun 2025
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling
Michael McGuire
160
1
0
10 Mar 2025
Deep CLAS: Deep Contextual Listen, Attend and Spell
Shifu Xiong
Mengzhi Wang
Genshun Wan
Hang Chen
Jianqing Gao
Lirong Dai
173
0
0
26 Sep 2024
Focus Agent: LLM-Powered Virtual Focus Group
International Conference on Intelligent Virtual Agents (IVA), 2024
Taiyu Zhang
Xuesong Zhang
Robbe Cools
Adalberto L. Simeone
LLMAG
127
6
0
03 Sep 2024
Backdoor Defense through Self-Supervised and Generative Learning
British Machine Vision Conference (BMVC), 2024
Ivan Sabolić
Ivan Grubišić
Siniša Šegvić
AAML
215
1
0
02 Sep 2024
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet
Manish Dhakal
Arman Chhetri
Aman Kumar Gupta
Prabin B. Lamichhane
S. Pandey
S. Shakya
AI4TS
148
11
0
25 Jun 2024
Tag and correct: high precision post-editing approach to correction of speech recognition errors
Tomasz Ziętkiewicz
146
2
0
11 Jun 2024
Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation
Yuan Xiao
Shiqing Ma
Juan Zhai
Chunrong Fang
Jinyuan Jia
Zhenyu Chen
AAML
169
1
0
02 Jun 2024
An inclusive review on deep learning techniques and their scope in handwriting recognition
ELCVIA Electronic Letters on Computer Vision and Image Analysis (ELCVIA), 2024
Sukhdeep Singh
Sudhir Rohilla
Anuj Sharma
145
2
0
10 Apr 2024
Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder
He Wang
Pengcheng Guo
Xucheng Wan
Huan Zhou
Lei Xie
176
5
0
08 Apr 2024
Impart: An Imperceptible and Effective Label-Specific Backdoor Attack
Jingke Zhao
Zan Wang
Yongwei Wang
Lanjun Wang
AAML
62
0
0
18 Mar 2024
AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models
Kazuki Kawamura
Jun Rekimoto
87
1
0
05 Mar 2024
Predicting Outcomes in Video Games with Long Short Term Memory Networks
Kittimate Chulajata
Sean Wu
Fabien Scalzo
Eun Sang Cha
183
2
0
24 Feb 2024
MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition
He Wang
Pengcheng Guo
Pan Zhou
Lei Xie
315
16
0
07 Jan 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models
A. Ogawa
Naohiro Tawara
Marc Delcroix
S. Araki
172
3
0
20 Dec 2023
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
A. Ogawa
Takafumi Moriya
Naoyuki Kamo
Naohiro Tawara
Marc Delcroix
120
3
0
17 Oct 2023
A Glance is Enough: Extract Target Sentence By Looking at A keyword
Ying Shi
Dong Wang
Lantian Li
Jiqing Han
155
1
0
09 Oct 2023
Beating Backdoor Attack at Its Own Game
IEEE International Conference on Computer Vision (ICCV), 2023
Min Liu
Alberto L. Sangiovanni-Vincentelli
Xiangyu Yue
AAML
468
13
0
28 Jul 2023
ATWM: Defense against adversarial malware based on adversarial training
Kunkun Li
Fan Zhang
Wei Guo
AAML
120
1
0
11 Jul 2023
Toward Interactive Dictation
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Belinda Z. Li
J. Eisner
Adam Pauls
Sam Thomson
KELM
138
2
0
08 Jul 2023
On Practical Aspects of Aggregation Defenses against Data Poisoning Attacks
Wenxiao Wang
Soheil Feizi
AAML
138
1
0
28 Jun 2023
Multilingual Word Error Rate Estimation: e-WER3
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Shammur A. Chowdhury
Ahmed M. Ali
159
9
0
02 Apr 2023
Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashing
Xianxian Zeng
Yanjun Zheng
118
5
0
20 Mar 2023
Verbal behavior without syntactic structures: beyond Skinner and Chomsky
S. Edelman
22
0
0
11 Mar 2023
Factual Consistency Oriented Speech Recognition
Interspeech (Interspeech), 2023
Naoyuki Kanda
Takuya Yoshioka
Yang Liu
190
1
0
24 Feb 2023
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition
International Conference on Human Factors in Computing Systems (CHI), 2023
Colin S. Lea
Zifang Huang
Lauren Tooley
Jaya Narain
Dianna Yee
P. Georgiou
Dung Tien Tran
Jeffrey P. Bigham
Leah Findlater
323
43
0
17 Feb 2023
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit
Pengcheng Li
Genshun Wan
Fenglin Ding
Hang Chen
Jianqing Gao
Jia Pan
Cong Liu
SSL
157
1
0
07 Dec 2022
An Investigation of Smart Contract for Collaborative Machine Learning Model Training
Sheng Ding
Chenhui Hu
136
3
0
12 Sep 2022
Lethal Dose Conjecture on Data Poisoning
Neural Information Processing Systems (NeurIPS), 2022
Wenxiao Wang
Alexander Levine
Soheil Feizi
FedML
151
17
0
05 Aug 2022
Online Continual Learning of End-to-End Speech Recognition Models
Interspeech (Interspeech), 2022
Muqiao Yang
Ian Lane
Shinji Watanabe
CLL
124
26
0
11 Jul 2022
Toward Zero Oracle Word Error Rate on the Switchboard Benchmark
Interspeech (Interspeech), 2022
Arlo Faria
Adam L. Janin
Korbinian Riedhammer
Sidhi Adkoli
147
2
0
13 Jun 2022
Virtual embeddings and self-consistency for self-supervised learning
T. Bdair
Hossam Abdelhamid
Nassir Navab
Shadi Albarqouni
SSL
101
0
0
13 Jun 2022
Searching Similarity Measure for Binarized Neural Networks
Yanfei Li
Ang Li
Huimin Yu
123
0
0
05 Jun 2022
A Survey of Robust Adversarial Training in Pattern Recognition: Fundamental, Theory, and Methodologies
Pattern Recognition (Pattern Recogn.), 2022
Zhuang Qian
Kaizhu Huang
Qiufeng Wang
Xu-Yao Zhang
OOD
AAML
ObjD
191
89
0
26 Mar 2022
A Survey of Multilingual Models for Automatic Speech Recognition
International Conference on Language Resources and Evaluation (LREC), 2022
Hemant Yadav
Sunayana Sitaram
135
47
0
25 Feb 2022
DeepSketch: A New Machine Learning-Based Reference Search Technique for Post-Deduplication Delta Compression
USENIX Conference on File and Storage Technologies (FAST), 2022
Mohammad Sadrosadati
Jeoggyun Kim
Yeseong Kim
Sungjin Lee
O. Mutlu
94
29
0
17 Feb 2022
Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation
International Conference on Machine Learning (ICML), 2022
Wenxiao Wang
Alexander Levine
Soheil Feizi
AAML
183
64
0
05 Feb 2022
Robust Self-Supervised Audio-Visual Speech Recognition
Interspeech (Interspeech), 2022
Bowen Shi
Wei-Ning Hsu
Abdel-rahman Mohamed
257
114
0
05 Jan 2022
Survey on the Convergence of Machine Learning and Blockchain
Intelligent Systems with Applications (ISA), 2022
Sheng Ding
Chenhui Hu
SyDa
195
14
0
04 Jan 2022
VoiceMoji: A Novel On-Device Pipeline for Seamless Emoji Insertion in Dictation
IEEE India Conference (INDICON), 2021
Sumit Kumar
S. HarichandanaBS
Himanshu Arora
85
2
0
22 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech
Rohit Paturi
S. Srinivasan
Katrin Kirchhoff
Daniel Garcia-Romero
197
9
0
10 Dec 2021
Denoised Internal Models: a Brain-Inspired Autoencoder against Adversarial Attacks
Machine Intelligence Research (MIR), 2021
Kaiyuan Liu
Xingyu Li
Yu-Rui Lai
Hong Xie
Hang Su
Jiacheng Wang
Chunxu Guo
J. Guan
Yi Zhou
AAML
206
4
0
21 Nov 2021
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories
Adnan Siraj Rakin
Md Hafizul Islam Chowdhuryy
Fan Yao
Deliang Fan
AAML
MIACV
140
140
0
08 Nov 2021
GPU-Accelerated Forward-Backward algorithm with Application to Lattice-Free MMI
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Lucas Ondel
Léa-Marie Lam-Yee-Mui
M. Kocour
Caio Corro
L. Burget
107
5
0
22 Oct 2021
User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
Hoang Long Nguyen
Vincent Renkens
J. Pelemans
Srividya Pranavi Potharaju
Anil Kumar Nalamalapu
Murat Akbacak
87
3
0
02 Aug 2021
The History of Speech Recognition to the Year 2030
Awni Y. Hannun
AI4TS
201
24
0
30 Jul 2021
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus
Hamdy Mubarak
A. Hussein
Shammur A. Chowdhury
Ahmed M. Ali
107
60
0
24 Jun 2021
Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021
Yuriy Arabskyy
Aashish Agarwal
S. Dey
Oscar Koller
88
12
0
15 Jun 2021
1
2
3
4
5
Next