v1v2 (latest)

Achieving Human Parity in Conversational Speech Recognition

17 October 2016

Papers citing "Achieving Human Parity in Conversational Speech Recognition"

50 / 201 papers shown

Title
Dealing with the Hard Facts of Low-Resource African NLP Yacouba Diarra Nouhoum Souleymane Coulibaly Panga Azazia Kamaté Madani Amadou Tall Emmanuel Élisé Koné Aymane Dembélé Michael Leventhal 20 0 0 23 Nov 2025
What Do Humans Hear When Interacting? Experiments on Selective Listening for Evaluating ASR of Spoken Dialogue Systems Kiyotada Mori Seiya Kawano Chaoran Liu C. Ishi Angel García Contreras Koichiro Yoshino 127 0 0 06 Aug 2025
The 2025 PNPL Competition: Speech Detection and Phoneme Classification in the LibriBrain Dataset Gilad Landau Miran Özdogan Gereon Elvers Francesco Mantegna Pratik Somaiya ... Yorguin Mantilla Ramos Çağlar Gülçehre M. Woolrich Natalie Voets Oiwi Parker Jones 151 4 0 11 Jun 2025
Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling Michael McGuire 160 1 0 10 Mar 2025
Deep CLAS: Deep Contextual Listen, Attend and Spell Shifu Xiong Mengzhi Wang Genshun Wan Hang Chen Jianqing Gao Lirong Dai 173 0 0 26 Sep 2024
Focus Agent: LLM-Powered Virtual Focus GroupInternational Conference on Intelligent Virtual Agents (IVA), 2024 Taiyu Zhang Xuesong Zhang Robbe Cools Adalberto L. Simeone LLMAG 127 6 0 03 Sep 2024
Backdoor Defense through Self-Supervised and Generative LearningBritish Machine Vision Conference (BMVC), 2024 Ivan Sabolić Ivan Grubišić Siniša Šegvić AAML 215 1 0 02 Sep 2024
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Manish Dhakal Arman Chhetri Aman Kumar Gupta Prabin B. Lamichhane S. Pandey S. Shakya AI4TS 148 11 0 25 Jun 2024
Tag and correct: high precision post-editing approach to correction of speech recognition errors Tomasz Ziętkiewicz 146 2 0 11 Jun 2024
Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation Yuan Xiao Shiqing Ma Juan Zhai Chunrong Fang Jinyuan Jia Zhenyu Chen AAML 169 1 0 02 Jun 2024
An inclusive review on deep learning techniques and their scope in handwriting recognitionELCVIA Electronic Letters on Computer Vision and Image Analysis (ELCVIA), 2024 Sukhdeep Singh Sudhir Rohilla Anuj Sharma 145 2 0 10 Apr 2024
Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder He Wang Pengcheng Guo Xucheng Wan Huan Zhou Lei Xie 176 5 0 08 Apr 2024
Impart: An Imperceptible and Effective Label-Specific Backdoor Attack Jingke Zhao Zan Wang Yongwei Wang Lanjun Wang AAML 62 0 0 18 Mar 2024
AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models Kazuki Kawamura Jun Rekimoto 87 1 0 05 Mar 2024
Predicting Outcomes in Video Games with Long Short Term Memory Networks Kittimate Chulajata Sean Wu Fabien Scalzo Eun Sang Cha 183 2 0 24 Feb 2024
MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition He Wang Pengcheng Guo Pan Zhou Lei Xie 315 16 0 07 Jan 2024
Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models A. Ogawa Naohiro Tawara Marc Delcroix S. Araki 172 3 0 20 Dec 2023
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 A. Ogawa Takafumi Moriya Naoyuki Kamo Naohiro Tawara Marc Delcroix 120 3 0 17 Oct 2023
A Glance is Enough: Extract Target Sentence By Looking at A keyword Ying Shi Dong Wang Lantian Li Jiqing Han 155 1 0 09 Oct 2023
Beating Backdoor Attack at Its Own GameIEEE International Conference on Computer Vision (ICCV), 2023 Min Liu Alberto L. Sangiovanni-Vincentelli Xiangyu Yue AAML 468 13 0 28 Jul 2023
ATWM: Defense against adversarial malware based on adversarial training Kunkun Li Fan Zhang Wei Guo AAML 120 1 0 11 Jul 2023
Toward Interactive DictationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Belinda Z. Li J. Eisner Adam Pauls Sam Thomson KELM 138 2 0 08 Jul 2023
On Practical Aspects of Aggregation Defenses against Data Poisoning Attacks Wenxiao Wang Soheil Feizi AAML 138 1 0 28 Jun 2023
Multilingual Word Error Rate Estimation: e-WER3IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 Shammur A. Chowdhury Ahmed M. Ali 159 9 0 02 Apr 2023
Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashing Xianxian Zeng Yanjun Zheng 118 5 0 20 Mar 2023
Verbal behavior without syntactic structures: beyond Skinner and Chomsky S. Edelman 22 0 0 11 Mar 2023
Factual Consistency Oriented Speech RecognitionInterspeech (Interspeech), 2023 Naoyuki Kanda Takuya Yoshioka Yang Liu 190 1 0 24 Feb 2023
From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech RecognitionInternational Conference on Human Factors in Computing Systems (CHI), 2023 Colin S. Lea Zifang Huang Lauren Tooley Jaya Narain Dianna Yee P. Georgiou Dung Tien Tran Jeffrey P. Bigham Leah Findlater 323 43 0 17 Feb 2023
Improved Speech Pre-Training with Supervision-Enhanced Acoustic Unit Pengcheng Li Genshun Wan Fenglin Ding Hang Chen Jianqing Gao Jia Pan Cong Liu SSL 157 1 0 07 Dec 2022
An Investigation of Smart Contract for Collaborative Machine Learning Model Training Sheng Ding Chenhui Hu 136 3 0 12 Sep 2022
Lethal Dose Conjecture on Data PoisoningNeural Information Processing Systems (NeurIPS), 2022 Wenxiao Wang Alexander Levine Soheil Feizi FedML 151 17 0 05 Aug 2022
Online Continual Learning of End-to-End Speech Recognition ModelsInterspeech (Interspeech), 2022 Muqiao Yang Ian Lane Shinji Watanabe CLL 124 26 0 11 Jul 2022
Toward Zero Oracle Word Error Rate on the Switchboard BenchmarkInterspeech (Interspeech), 2022 Arlo Faria Adam L. Janin Korbinian Riedhammer Sidhi Adkoli 147 2 0 13 Jun 2022
Virtual embeddings and self-consistency for self-supervised learning T. Bdair Hossam Abdelhamid Nassir Navab Shadi Albarqouni SSL 101 0 0 13 Jun 2022
Searching Similarity Measure for Binarized Neural Networks Yanfei Li Ang Li Huimin Yu 123 0 0 05 Jun 2022
A Survey of Robust Adversarial Training in Pattern Recognition: Fundamental, Theory, and MethodologiesPattern Recognition (Pattern Recogn.), 2022 Zhuang Qian Kaizhu Huang Qiufeng Wang Xu-Yao Zhang OOD AAML ObjD 191 89 0 26 Mar 2022
A Survey of Multilingual Models for Automatic Speech RecognitionInternational Conference on Language Resources and Evaluation (LREC), 2022 Hemant Yadav Sunayana Sitaram 135 47 0 25 Feb 2022
DeepSketch: A New Machine Learning-Based Reference Search Technique for Post-Deduplication Delta CompressionUSENIX Conference on File and Storage Technologies (FAST), 2022 Mohammad Sadrosadati Jeoggyun Kim Yeseong Kim Sungjin Lee O. Mutlu 94 29 0 17 Feb 2022
Improved Certified Defenses against Data Poisoning with (Deterministic) Finite AggregationInternational Conference on Machine Learning (ICML), 2022 Wenxiao Wang Alexander Levine Soheil Feizi AAML 183 64 0 05 Feb 2022
Robust Self-Supervised Audio-Visual Speech RecognitionInterspeech (Interspeech), 2022 Bowen Shi Wei-Ning Hsu Abdel-rahman Mohamed 257 114 0 05 Jan 2022
Survey on the Convergence of Machine Learning and BlockchainIntelligent Systems with Applications (ISA), 2022 Sheng Ding Chenhui Hu SyDa 195 14 0 04 Jan 2022
VoiceMoji: A Novel On-Device Pipeline for Seamless Emoji Insertion in DictationIEEE India Conference (INDICON), 2021 Sumit Kumar S. HarichandanaBS Himanshu Arora 85 2 0 22 Dec 2021
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech Rohit Paturi S. Srinivasan Katrin Kirchhoff Daniel Garcia-Romero 197 9 0 10 Dec 2021
Denoised Internal Models: a Brain-Inspired Autoencoder against Adversarial AttacksMachine Intelligence Research (MIR), 2021 Kaiyuan Liu Xingyu Li Yu-Rui Lai Hong Xie Hang Su Jiacheng Wang Chunxu Guo J. Guan Yi Zhou AAML 206 4 0 21 Nov 2021
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories Adnan Siraj Rakin Md Hafizul Islam Chowdhuryy Fan Yao Deliang Fan AAML MIACV 140 140 0 08 Nov 2021
GPU-Accelerated Forward-Backward algorithm with Application to Lattice-Free MMIIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 Lucas Ondel Léa-Marie Lam-Yee-Mui M. Kocour Caio Corro L. Burget 107 5 0 22 Oct 2021
User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems Hoang Long Nguyen Vincent Renkens J. Pelemans Srividya Pranavi Potharaju Anil Kumar Nalamalapu Murat Akbacak 87 3 0 02 Aug 2021
The History of Speech Recognition to the Year 2030 Awni Y. Hannun AI4TS 201 24 0 30 Jul 2021
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus Hamdy Mubarak A. Hussein Shammur A. Chowdhury Ahmed M. Ali 107 60 0 24 Jun 2021
Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021 Yuriy Arabskyy Aashish Agarwal S. Dey Oscar Koller 88 12 0 15 Jun 2021