v1v2 (latest)

Achieving Human Parity in Conversational Speech Recognition

17 October 2016

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 201 papers shown

Title
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech RecognitionAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021 Shih-Hsuan Chiu Tien-Hong Lo Fu-An Chao Berlin Chen BDL 313 10 0 13 Jun 2021
On Feature Decorrelation in Self-Supervised LearningIEEE International Conference on Computer Vision (ICCV), 2021 Tianyu Hua Wenxiao Wang Zihui Xue Sucheng Ren Yue Wang Hang Zhao SSL OOD 429 210 0 02 May 2021
Defending Against Adversarial Denial-of-Service Data Poisoning Attacks Nicolas Müller Simon Roschmann Konstantin Böttinger AAML 172 0 0 14 Apr 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical StudyComputer Speech and Language (CSL), 2021 Prashanth Gurunath Shivakumar Shrikanth Narayanan 145 63 0 19 Feb 2021
Transformer Language Models with LSTM-based Cross-utterance Information RepresentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 G. Sun Chuxu Zhang P. Woodland 184 35 0 12 Feb 2021
Dompteur: Taming Audio Adversarial ExamplesUSENIX Security Symposium (USENIX Security), 2021 Thorsten Eisenhofer Lea Schonherr Joel Frank Lars Speckemeier D. Kolossa Thorsten Holz AAML 202 27 0 10 Feb 2021
A Review of Speaker Diarization: Recent Advances with Deep LearningComputer Speech and Language (CSL), 2021 Tae Jin Park Naoyuki Kanda Dimitrios Dimitriadis Kyu Jeong Han Shinji Watanabe Shrikanth Narayanan VLM 665 384 0 24 Jan 2021
Exploiting Beam Search Confidence for Energy-Efficient Speech RecognitionSocial Science Research Network (SSRN), 2021 D. Pinto J. Arnau Antonio González 55 0 0 22 Jan 2021
Arabic Speech Recognition by End-to-End, Modular Systems and HumanComputer Speech and Language (CSL), 2021 A. Hussein Shinji Watanabe Ahmed M. Ali VLM 145 54 0 21 Jan 2021
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled DataInternational Conference on Machine Learning (ICML), 2021 Chengyi Wang Yu-Huan Wu Yao Qian K. Kumatani Shujie Liu Furu Wei Michael Zeng Xuedong Huang OT SSL 244 134 0 19 Jan 2021
Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement Félix Grèzes Zhaoheng Ni V. Trinh Michael I. Mandel 139 1 0 02 Dec 2020
Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks Zhaoheng Ni Félix Grèzes V. Trinh Michael I. Mandel 82 3 0 02 Dec 2020
Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks Félix Grèzes Zhaoheng Ni V. Trinh Michael I. Mandel AI4TS 43 1 0 02 Dec 2020
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription Xiaofei Wang Naoyuki Kanda Yashesh Gaur Zhuo Chen Zhong Meng Takuya Yoshioka 157 14 0 05 Nov 2020
Deep-Dup: An Adversarial Weight Duplication Attack Framework to Crush Deep Neural Network in Multi-Tenant FPGA Adnan Siraj Rakin Yukui Luo Xiaolin Xu Deliang Fan AAML 221 56 0 05 Nov 2020
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis Desh Raj Pavel Denisov Zhuo Chen Hakan Erdogan Zili Huang ... Yi Luo Naoyuki Kanda Jinyu Li Scott Wisdom J. Hershey 148 108 0 03 Nov 2020
Super-Human Performance in Online Low-latency Recognition of Conversational Speech T. Nguyen S. Stueker A. Waibel BDL 273 41 0 07 Oct 2020
Review: Deep Learning in Electron Microscopy Jeffrey M. Ede 800 88 0 17 Sep 2020
How Much Can We Really Trust You? Towards Simple, Interpretable Trust Quantification Metrics for Deep Neural Networks A. Wong Xiao Yu Wang Andrew Hryniowski 146 26 0 12 Sep 2020
Short-term Traffic Prediction with Deep Neural Networks: A SurveyIEEE Access (IEEE Access), 2020 Kyungeun Lee Moonjung Eo Euna Jung Yoonjin Yoon Wonjong Rhee GNN AI4TS 164 66 0 28 Aug 2020
Cross-Utterance Language Models with Acoustic Error Sampling G. Sun Chuxu Zhang P. Woodland 122 2 0 19 Aug 2020
TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices A. Wong M. Famouri Maya Pavlova Siddharth Surana 300 34 0 10 Aug 2020
Word Error Rate Estimation Without ASR Output: e-WER2Interspeech (Interspeech), 2020 Ahmed M. Ali Steve Renals 85 15 0 08 Aug 2020
Text-based classification of interviews for mental health -- juxtaposing the state of the art J. Wouts 112 1 0 29 Jul 2020
Can We Mitigate Backdoor Attack Using Adversarial Detection Methods? Kaidi Jin Tianwei Zhang Chao Shen Yufei Chen Ming Fan Chenhao Lin Ting Liu AAML 79 16 0 26 Jun 2020
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Ashish Arora Desh Raj Aswin Shanmugam Subramanian Ke Li Bar Ben Yair Matthew Maciejewski Piotr Żelasko Leibny Paola García-Perera Shinji Watanabe Sanjeev Khudanpur 209 9 0 14 Jun 2020
ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition Jing Pan Joshua Shapiro Jeremy Wohlwend Kyu Jeong Han Tao Lei T. Ma 138 23 0 21 May 2020
Large scale weakly and semi-supervised learning for low-resource video ASR Kritika Singh Vimal Manohar Alex Xiao Sergey Edunov Ross B. Girshick Vitaliy Liptchinsky Christian Fuegen Yatharth Saraf Geoffrey Zweig Abdel-rahman Mohamed 144 10 0 16 May 2020
Adversarial Machine Learning in Network Intrusion Detection Systems Elie Alhajjar P. Maxwell Nathaniel D. Bastian GAN SILM AAML 165 169 0 23 Apr 2020
Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation Yingqiu Zhu Yu Chen Danyang Huang Bo Zhang Hansheng Wang 90 0 0 07 Apr 2020
DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit FlipsUSENIX Security Symposium (USENIX Security), 2020 Fan Yao Adnan Siraj Rakin Deliang Fan AAML 156 190 0 30 Mar 2020
Towards Deep Learning Models Resistant to Large Perturbations Amirreza Shaeiri Rozhin Nobahari M. Rohban OOD AAML 151 14 0 30 Mar 2020
Serialized Output Training for End-to-End Overlapped Speech RecognitionInterspeech (Interspeech), 2020 Naoyuki Kanda Yashesh Gaur Xiaofei Wang Zhong Meng Takuya Yoshioka 181 141 0 28 Mar 2020
A Survey of Adversarial Learning on Graphs Liang Chen Jintang Li Jiaying Peng Tao Xie Zengxu Cao Kun Xu Xiangnan He Zibin Zheng Bingzhe Wu AAML 179 90 0 10 Mar 2020
Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks Théodore Bluche Maël Primet Thibault Gisselbrecht ObjD MQ 96 29 0 25 Feb 2020
A simple way to make neural networks robust against diverse image corruptions E. Rusak Lukas Schott Roland S. Zimmermann Julian Bitterwolf Oliver Bringmann Matthias Bethge Wieland Brendel 243 65 0 16 Jan 2020
ATHENA: A Framework based on Diverse Weak Defenses for Building Adversarial Defense Meng Jianhai Su Jason M. O'Kane Pooyan Jamshidi AAML 108 7 0 02 Jan 2020
Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech SeparationAsia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2019 Lu Huang Gaofeng Cheng Pengyuan Zhang Yi Yang Shumin Xu Jiasong Sun 99 8 0 25 Dec 2019
Predicting detection filters for small footprint open-vocabulary keyword spottingInterspeech (Interspeech), 2019 Théodore Bluche Thibault Gisselbrecht ObjD 160 22 0 16 Dec 2019
Advances in Online Audio-Visual Meeting TranscriptionAutomatic Speech Recognition & Understanding (ASRU), 2019 Takuya Yoshioka Igor Abramovski Cem Aksoylar Zhuo Chen Moshe David ... Huaming Wang Zhenghao Wang Jun Zhang Yong Zhao Tianyan Zhou 162 79 0 10 Dec 2019
REFIT: A Unified Watermark Removal Framework For Deep Learning Systems With Limited DataACM Asia Conference on Computer and Communications Security (AsiaCCS), 2019 Xinyun Chen Wenxiao Wang Chris Bender Yiming Ding R. Jia Yue Liu Basel Alomair AAML 199 113 0 17 Nov 2019
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention Ching-Feng Yeh Jay Mahadeokar Kaustubh Kalgaonkar Yongqiang Wang Duc Le Mahaveer Jain Kjell Schubert Christian Fuegen M. Seltzer 179 159 0 28 Oct 2019
Look-up and Adapt: A One-shot Semantic ParserConference on Empirical Methods in Natural Language Processing (EMNLP), 2019 Zhichu Lu Forough Arabshahi I. Labutov Tom Michael Mitchell 100 5 0 27 Oct 2019
Bottom-Up Meta-Policy Search Luckeciano C. Melo Marcos R. O. A. Máximo A. Cunha 116 6 0 22 Oct 2019
Domain Expansion in DNN-based Acoustic Models for Robust Speech RecognitionAutomatic Speech Recognition & Understanding (ASRU), 2019 Shahram Ghorbani S. Khorram John H. L. Hansen 123 19 0 01 Oct 2019
Alleviating Sequence Information Loss with Data Overlapping and Prime Batch SizesConference on Computational Natural Language Learning (CoNLL), 2019 Noémien Kocher Christian Scuito Lorenzo Tarantino Alexandros Lazaridis Andreas Fischer C. Musat 81 0 0 18 Sep 2019
Survey on Deep Neural Networks in Speech and Vision Systems M. Alam Manar D. Samad Lasitha Vidyaratne Alexander M. Glandon Khan M. Iftekharuddin 3DV VLM AI4TS 305 223 0 16 Aug 2019
SANTLR: Speech Annotation Toolkit for Low Resource LanguagesInterspeech (Interspeech), 2019 Xinjian Li Zhong Zhou Siddharth Dalmia A. Black Florian Metze 98 7 0 02 Aug 2019
Multilingual Speech Recognition with Corpus Relatedness SamplingInterspeech (Interspeech), 2019 Xinjian Li Siddharth Dalmia A. Black Florian Metze 122 17 0 02 Aug 2019
Multi-Frame Cross-Entropy Training for Convolutional Neural Networks in Speech Recognition Tom Sercu Neil Rohit Mallinar 85 0 0 29 Jul 2019