ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.05256
  4. Cited By
Achieving Human Parity in Conversational Speech Recognition
v1v2 (latest)

Achieving Human Parity in Conversational Speech Recognition

17 October 2016
Wayne Xiong
J. Droppo
Xuedong Huang
Frank Seide
M. Seltzer
A. Stolcke
Dong Yu
Geoffrey Zweig
ArXiv (abs)PDFHTML

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 201 papers shown
Title
Cross-utterance Reranking Models with BERT and Graph Convolutional
  Networks for Conversational Speech Recognition
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech RecognitionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021
Shih-Hsuan Chiu
Tien-Hong Lo
Fu-An Chao
Berlin Chen
BDL
313
10
0
13 Jun 2021
On Feature Decorrelation in Self-Supervised Learning
On Feature Decorrelation in Self-Supervised LearningIEEE International Conference on Computer Vision (ICCV), 2021
Tianyu Hua
Wenxiao Wang
Zihui Xue
Sucheng Ren
Yue Wang
Hang Zhao
SSLOOD
429
210
0
02 May 2021
Defending Against Adversarial Denial-of-Service Data Poisoning Attacks
Defending Against Adversarial Denial-of-Service Data Poisoning Attacks
Nicolas Müller
Simon Roschmann
Konstantin Böttinger
AAML
172
0
0
14 Apr 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An
  Empirical Study
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical StudyComputer Speech and Language (CSL), 2021
Prashanth Gurunath Shivakumar
Shrikanth Narayanan
145
63
0
19 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information
  Representation
Transformer Language Models with LSTM-based Cross-utterance Information RepresentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
G. Sun
Chuxu Zhang
P. Woodland
184
35
0
12 Feb 2021
Dompteur: Taming Audio Adversarial Examples
Dompteur: Taming Audio Adversarial ExamplesUSENIX Security Symposium (USENIX Security), 2021
Thorsten Eisenhofer
Lea Schonherr
Joel Frank
Lars Speckemeier
D. Kolossa
Thorsten Holz
AAML
202
27
0
10 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep Learning
A Review of Speaker Diarization: Recent Advances with Deep LearningComputer Speech and Language (CSL), 2021
Tae Jin Park
Naoyuki Kanda
Dimitrios Dimitriadis
Kyu Jeong Han
Shinji Watanabe
Shrikanth Narayanan
VLM
665
384
0
24 Jan 2021
Exploiting Beam Search Confidence for Energy-Efficient Speech
  Recognition
Exploiting Beam Search Confidence for Energy-Efficient Speech RecognitionSocial Science Research Network (SSRN), 2021
D. Pinto
J. Arnau
Antonio González
55
0
0
22 Jan 2021
Arabic Speech Recognition by End-to-End, Modular Systems and Human
Arabic Speech Recognition by End-to-End, Modular Systems and HumanComputer Speech and Language (CSL), 2021
A. Hussein
Shinji Watanabe
Ahmed M. Ali
VLM
145
54
0
21 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and
  Unlabeled Data
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled DataInternational Conference on Machine Learning (ICML), 2021
Chengyi Wang
Yu-Huan Wu
Yao Qian
K. Kumatani
Shujie Liu
Furu Wei
Michael Zeng
Xuedong Huang
OTSSL
244
134
0
19 Jan 2021
Combining Spatial Clustering with LSTM Speech Models for Multichannel
  Speech Enhancement
Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement
Félix Grèzes
Zhaoheng Ni
V. Trinh
Michael I. Mandel
139
1
0
02 Dec 2020
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial
  Clustering Masks
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks
Zhaoheng Ni
Félix Grèzes
V. Trinh
Michael I. Mandel
82
3
0
02 Dec 2020
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM
  Neural Networks
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks
Félix Grèzes
Zhaoheng Ni
V. Trinh
Michael I. Mandel
AI4TS
43
1
0
02 Dec 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting
  Transcription
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Xiaofei Wang
Naoyuki Kanda
Yashesh Gaur
Zhuo Chen
Zhong Meng
Takuya Yoshioka
157
14
0
05 Nov 2020
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush
  Deep Neural Network in Multi-Tenant FPGA
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush Deep Neural Network in Multi-Tenant FPGA
Adnan Siraj Rakin
Yukui Luo
Xiaolin Xu
Deliang Fan
AAML
221
56
0
05 Nov 2020
Integration of speech separation, diarization, and recognition for
  multi-speaker meetings: System description, comparison, and analysis
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Desh Raj
Pavel Denisov
Zhuo Chen
Hakan Erdogan
Zili Huang
...
Yi Luo
Naoyuki Kanda
Jinyu Li
Scott Wisdom
J. Hershey
148
108
0
03 Nov 2020
Super-Human Performance in Online Low-latency Recognition of
  Conversational Speech
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
T. Nguyen
S. Stueker
A. Waibel
BDL
273
41
0
07 Oct 2020
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
800
88
0
17 Sep 2020
How Much Can We Really Trust You? Towards Simple, Interpretable Trust
  Quantification Metrics for Deep Neural Networks
How Much Can We Really Trust You? Towards Simple, Interpretable Trust Quantification Metrics for Deep Neural Networks
A. Wong
Xiao Yu Wang
Andrew Hryniowski
146
26
0
12 Sep 2020
Short-term Traffic Prediction with Deep Neural Networks: A Survey
Short-term Traffic Prediction with Deep Neural Networks: A SurveyIEEE Access (IEEE Access), 2020
Kyungeun Lee
Moonjung Eo
Euna Jung
Yoonjin Yoon
Wonjong Rhee
GNNAI4TS
164
66
0
28 Aug 2020
Cross-Utterance Language Models with Acoustic Error Sampling
Cross-Utterance Language Models with Acoustic Error Sampling
G. Sun
Chuxu Zhang
P. Woodland
122
2
0
19 Aug 2020
TinySpeech: Attention Condensers for Deep Speech Recognition Neural
  Networks on Edge Devices
TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
A. Wong
M. Famouri
Maya Pavlova
Siddharth Surana
300
34
0
10 Aug 2020
Word Error Rate Estimation Without ASR Output: e-WER2
Word Error Rate Estimation Without ASR Output: e-WER2Interspeech (Interspeech), 2020
Ahmed M. Ali
Steve Renals
85
15
0
08 Aug 2020
Text-based classification of interviews for mental health -- juxtaposing
  the state of the art
Text-based classification of interviews for mental health -- juxtaposing the state of the art
J. Wouts
112
1
0
29 Jul 2020
Can We Mitigate Backdoor Attack Using Adversarial Detection Methods?
Can We Mitigate Backdoor Attack Using Adversarial Detection Methods?
Kaidi Jin
Tianwei Zhang
Chao Shen
Yufei Chen
Ming Fan
Chenhao Lin
Ting Liu
AAML
79
16
0
26 Jun 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6
  Challenge
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge
Ashish Arora
Desh Raj
Aswin Shanmugam Subramanian
Ke Li
Bar Ben Yair
Matthew Maciejewski
Piotr Żelasko
Leibny Paola García-Perera
Shinji Watanabe
Sanjeev Khudanpur
209
9
0
14 Jun 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech
  Recognition
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Jing Pan
Joshua Shapiro
Jeremy Wohlwend
Kyu Jeong Han
Tao Lei
T. Ma
138
23
0
21 May 2020
Large scale weakly and semi-supervised learning for low-resource video
  ASR
Large scale weakly and semi-supervised learning for low-resource video ASR
Kritika Singh
Vimal Manohar
Alex Xiao
Sergey Edunov
Ross B. Girshick
Vitaliy Liptchinsky
Christian Fuegen
Yatharth Saraf
Geoffrey Zweig
Abdel-rahman Mohamed
144
10
0
16 May 2020
Adversarial Machine Learning in Network Intrusion Detection Systems
Adversarial Machine Learning in Network Intrusion Detection Systems
Elie Alhajjar
P. Maxwell
Nathaniel D. Bastian
GANSILMAAML
165
169
0
23 Apr 2020
Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by
  Local Quadratic Approximation
Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation
Yingqiu Zhu
Yu Chen
Danyang Huang
Bo Zhang
Hansheng Wang
90
0
0
07 Apr 2020
DeepHammer: Depleting the Intelligence of Deep Neural Networks through
  Targeted Chain of Bit Flips
DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit FlipsUSENIX Security Symposium (USENIX Security), 2020
Fan Yao
Adnan Siraj Rakin
Deliang Fan
AAML
156
190
0
30 Mar 2020
Towards Deep Learning Models Resistant to Large Perturbations
Towards Deep Learning Models Resistant to Large Perturbations
Amirreza Shaeiri
Rozhin Nobahari
M. Rohban
OODAAML
151
14
0
30 Mar 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
Serialized Output Training for End-to-End Overlapped Speech RecognitionInterspeech (Interspeech), 2020
Naoyuki Kanda
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Takuya Yoshioka
181
141
0
28 Mar 2020
A Survey of Adversarial Learning on Graphs
A Survey of Adversarial Learning on Graphs
Liang Chen
Jintang Li
Jiaying Peng
Tao Xie
Zengxu Cao
Kun Xu
Xiangnan He
Zibin Zheng
Bingzhe Wu
AAML
179
90
0
10 Mar 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM
  Networks
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
Théodore Bluche
Maël Primet
Thibault Gisselbrecht
ObjDMQ
96
29
0
25 Feb 2020
A simple way to make neural networks robust against diverse image
  corruptions
A simple way to make neural networks robust against diverse image corruptions
E. Rusak
Lukas Schott
Roland S. Zimmermann
Julian Bitterwolf
Oliver Bringmann
Matthias Bethge
Wieland Brendel
243
65
0
16 Jan 2020
ATHENA: A Framework based on Diverse Weak Defenses for Building
  Adversarial Defense
ATHENA: A Framework based on Diverse Weak Defenses for Building Adversarial Defense
Meng
Jianhai Su
Jason M. O'Kane
Pooyan Jamshidi
AAML
108
7
0
02 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled
  BLSTM for Single-channel Multi-talker Speech Separation
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech SeparationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019
Lu Huang
Gaofeng Cheng
Pengyuan Zhang
Yi Yang
Shumin Xu
Jiasong Sun
99
8
0
25 Dec 2019
Predicting detection filters for small footprint open-vocabulary keyword
  spotting
Predicting detection filters for small footprint open-vocabulary keyword spottingInterspeech (Interspeech), 2019
Théodore Bluche
Thibault Gisselbrecht
ObjD
160
22
0
16 Dec 2019
Advances in Online Audio-Visual Meeting Transcription
Advances in Online Audio-Visual Meeting TranscriptionAutomatic Speech Recognition & Understanding (ASRU), 2019
Takuya Yoshioka
Igor Abramovski
Cem Aksoylar
Zhuo Chen
Moshe David
...
Huaming Wang
Zhenghao Wang
Jun Zhang
Yong Zhao
Tianyan Zhou
162
79
0
10 Dec 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems
  With Limited Data
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited DataACM Asia Conference on Computer and Communications Security (AsiaCCS), 2019
Xinyun Chen
Wenxiao Wang
Chris Bender
Yiming Ding
R. Jia
Yue Liu
Basel Alomair
AAML
199
113
0
17 Nov 2019
Transformer-Transducer: End-to-End Speech Recognition with
  Self-Attention
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Ching-Feng Yeh
Jay Mahadeokar
Kaustubh Kalgaonkar
Yongqiang Wang
Duc Le
Mahaveer Jain
Kjell Schubert
Christian Fuegen
M. Seltzer
179
159
0
28 Oct 2019
Look-up and Adapt: A One-shot Semantic Parser
Look-up and Adapt: A One-shot Semantic ParserConference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Zhichu Lu
Forough Arabshahi
I. Labutov
Tom Michael Mitchell
100
5
0
27 Oct 2019
Bottom-Up Meta-Policy Search
Bottom-Up Meta-Policy Search
Luckeciano C. Melo
Marcos R. O. A. Máximo
A. Cunha
116
6
0
22 Oct 2019
Domain Expansion in DNN-based Acoustic Models for Robust Speech
  Recognition
Domain Expansion in DNN-based Acoustic Models for Robust Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019
Shahram Ghorbani
S. Khorram
John H. L. Hansen
123
19
0
01 Oct 2019
Alleviating Sequence Information Loss with Data Overlapping and Prime
  Batch Sizes
Alleviating Sequence Information Loss with Data Overlapping and Prime Batch SizesConference on Computational Natural Language Learning (CoNLL), 2019
Noémien Kocher
Christian Scuito
Lorenzo Tarantino
Alexandros Lazaridis
Andreas Fischer
C. Musat
81
0
0
18 Sep 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DVVLMAI4TS
305
223
0
16 Aug 2019
SANTLR: Speech Annotation Toolkit for Low Resource Languages
SANTLR: Speech Annotation Toolkit for Low Resource LanguagesInterspeech (Interspeech), 2019
Xinjian Li
Zhong Zhou
Siddharth Dalmia
A. Black
Florian Metze
98
7
0
02 Aug 2019
Multilingual Speech Recognition with Corpus Relatedness Sampling
Multilingual Speech Recognition with Corpus Relatedness SamplingInterspeech (Interspeech), 2019
Xinjian Li
Siddharth Dalmia
A. Black
Florian Metze
122
17
0
02 Aug 2019
Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in
  Speech Recognition
Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in Speech Recognition
Tom Sercu
Neil Rohit Mallinar
85
0
0
29 Jul 2019
Previous
12345
Next