Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1610.05256
Cited By
v1
v2 (latest)
Achieving Human Parity in Conversational Speech Recognition
17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Achieving Human Parity in Conversational Speech Recognition"
50 / 201 papers shown
Title
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
313
10
0
13 Jun 2021
On Feature Decorrelation in Self-Supervised Learning
IEEE International Conference on Computer Vision (ICCV), 2021
Tianyu Hua
Wenxiao Wang
Zihui Xue
Sucheng Ren
Yue Wang
Hang Zhao
SSL
OOD
425
210
0
02 May 2021
Defending Against Adversarial Denial-of-Service Data Poisoning Attacks
Nicolas Müller
Simon Roschmann
Konstantin Böttinger
AAML
168
0
0
14 Apr 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Computer Speech and Language (CSL), 2021
Prashanth Gurunath Shivakumar
Shrikanth Narayanan
145
63
0
19 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information Representation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
G. Sun
Chuxu Zhang
P. Woodland
184
35
0
12 Feb 2021
Dompteur: Taming Audio Adversarial Examples
USENIX Security Symposium (USENIX Security), 2021
Thorsten Eisenhofer
Lea Schonherr
Joel Frank
Lars Speckemeier
D. Kolossa
Thorsten Holz
AAML
202
27
0
10 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
Computer Speech and Language (CSL), 2021
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
641
384
0
24 Jan 2021
Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition
Social Science Research Network (SSRN), 2021
D. Pinto
J. Arnau
Antonio González
55
0
0
22 Jan 2021
Arabic Speech Recognition by End-to-End, Modular Systems and Human
Computer Speech and Language (CSL), 2021
A. Hussein
Shinji Watanabe
Ahmed M. Ali
VLM
145
54
0
21 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
International Conference on Machine Learning (ICML), 2021
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OT
SSL
240
134
0
19 Jan 2021
Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement
Félix Grèzes
Zhaoheng Ni
V. Trinh
Michael I. Mandel
139
1
0
02 Dec 2020
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Zhaoheng Ni
Félix Grèzes
V. Trinh
Michael I. Mandel
82
3
0
02 Dec 2020
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
Félix Grèzes
Zhaoheng Ni
V. Trinh
Michael I. Mandel
AI4TS
43
1
0
02 Dec 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Xiaofei Wang
Naoyuki Kanda
Yashesh Gaur
Zhuo Chen
Zhong Meng
Takuya Yoshioka
157
14
0
05 Nov 2020
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush Deep Neural Network in Multi-Tenant FPGA
Adnan Siraj Rakin
Yukui Luo
Xiaolin Xu
Deliang Fan
AAML
213
56
0
05 Nov 2020
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
140
108
0
03 Nov 2020
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
273
41
0
07 Oct 2020
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
792
88
0
17 Sep 2020
How Much Can We Really Trust You? Towards Simple, Interpretable Trust Quantification Metrics for Deep Neural Networks
A. Wong
Xiao Yu Wang
Andrew Hryniowski
146
26
0
12 Sep 2020
Short-term Traffic Prediction with Deep Neural Networks: A Survey
IEEE Access (IEEE Access), 2020
Kyungeun Lee
Moonjung Eo
Euna Jung
Yoonjin Yoon
Wonjong Rhee
GNN
AI4TS
164
66
0
28 Aug 2020
Cross-Utterance Language Models with Acoustic Error Sampling
G. Sun
Chuxu Zhang
P. Woodland
122
2
0
19 Aug 2020
TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
A. Wong
M. Famouri
Maya Pavlova
Siddharth Surana
300
34
0
10 Aug 2020
Word Error Rate Estimation Without ASR Output: e-WER2
Interspeech (Interspeech), 2020
Ahmed M. Ali
Steve Renals
85
15
0
08 Aug 2020
Text-based classification of interviews for mental health -- juxtaposing the state of the art
J. Wouts
112
1
0
29 Jul 2020
Can We Mitigate Backdoor Attack Using Adversarial Detection Methods?
Kaidi Jin
Tianwei Zhang
Chao Shen
Yufei Chen
Ming Fan
Chenhao Lin
Ting Liu
AAML
79
16
0
26 Jun 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
209
9
0
14 Jun 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Jing Pan
Joshua Shapiro
Jeremy Wohlwend
Kyu Jeong Han
Tao Lei
T. Ma
138
23
0
21 May 2020
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
144
10
0
16 May 2020
Adversarial Machine Learning in Network Intrusion Detection Systems
Elie Alhajjar
P. Maxwell
Nathaniel D. Bastian
GAN
SILM
AAML
165
169
0
23 Apr 2020
Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation
Yingqiu Zhu
Yu Chen
Danyang Huang
Bo Zhang
Hansheng Wang
90
0
0
07 Apr 2020
DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips
USENIX Security Symposium (USENIX Security), 2020
Fan Yao
Adnan Siraj Rakin
Deliang Fan
AAML
152
190
0
30 Mar 2020
Towards Deep Learning Models Resistant to Large Perturbations
Amirreza Shaeiri
Rozhin Nobahari
M. Rohban
OOD
AAML
147
14
0
30 Mar 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Interspeech (Interspeech), 2020
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
181
141
0
28 Mar 2020
A Survey of Adversarial Learning on Graphs
Liang Chen
Jintang Li
Jiaying Peng
Tao Xie
Zengxu Cao
Kun Xu
Xiangnan He
Zibin Zheng
Bingzhe Wu
AAML
179
90
0
10 Mar 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
Théodore Bluche
Maël Primet
Thibault Gisselbrecht
ObjD
MQ
96
29
0
25 Feb 2020
A simple way to make neural networks robust against diverse image corruptions
E. Rusak
Lukas Schott
Roland S. Zimmermann
Julian Bitterwolf
Oliver Bringmann
Matthias Bethge
Wieland Brendel
243
65
0
16 Jan 2020
ATHENA: A Framework based on Diverse Weak Defenses for Building Adversarial Defense
Meng
Jianhai Su
Jason M. O'Kane
Pooyan Jamshidi
AAML
104
7
0
02 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019
Lu Huang
Gaofeng Cheng
Pengyuan Zhang
Yi Yang
Shumin Xu
Jiasong Sun
95
8
0
25 Dec 2019
Predicting detection filters for small footprint open-vocabulary keyword spotting
Interspeech (Interspeech), 2019
Théodore Bluche
Thibault Gisselbrecht
ObjD
160
22
0
16 Dec 2019
Advances in Online Audio-Visual Meeting Transcription
Automatic Speech Recognition & Understanding (ASRU), 2019
Takuya Yoshioka
Igor Abramovski
Cem Aksoylar
Zhuo Chen
Moshe David
...
Huaming Wang
Zhenghao Wang
Jun Zhang
Yong Zhao
Tianyan Zhou
154
79
0
10 Dec 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited Data
ACM Asia Conference on Computer and Communications Security (AsiaCCS), 2019
Xinyun Chen
Wenxiao Wang
Chris Bender
Yiming Ding
R. Jia
Yue Liu
Basel Alomair
AAML
199
113
0
17 Nov 2019
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Ching-Feng Yeh
Jay Mahadeokar
Kaustubh Kalgaonkar
Yongqiang Wang
Duc Le
Mahaveer Jain
Kjell Schubert
Christian Fuegen
M. Seltzer
179
159
0
28 Oct 2019
Look-up and Adapt: A One-shot Semantic Parser
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Zhichu Lu
Forough Arabshahi
I. Labutov
Tom Michael Mitchell
100
5
0
27 Oct 2019
Bottom-Up Meta-Policy Search
Luckeciano C. Melo
Marcos R. O. A. Máximo
A. Cunha
108
6
0
22 Oct 2019
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2019
Shahram Ghorbani
S. Khorram
John H. L. Hansen
123
19
0
01 Oct 2019
Alleviating Sequence Information Loss with Data Overlapping and Prime Batch Sizes
Conference on Computational Natural Language Learning (CoNLL), 2019
Noémien Kocher
Christian Scuito
Lorenzo Tarantino
Alexandros Lazaridis
Andreas Fischer
C. Musat
81
0
0
18 Sep 2019
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
301
223
0
16 Aug 2019
SANTLR: Speech Annotation Toolkit for Low Resource Languages
Interspeech (Interspeech), 2019
Xinjian Li
Zhong Zhou
Siddharth Dalmia
A. Black
Florian Metze
98
7
0
02 Aug 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling
Interspeech (Interspeech), 2019
Xinjian Li
Siddharth Dalmia
A. Black
Florian Metze
118
17
0
02 Aug 2019
Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in Speech Recognition
Tom Sercu
Neil Rohit Mallinar
85
0
0
29 Jul 2019
Previous
1
2
3
4
5
Next