Deep context: end-to-end contextual speech recognition

7 August 2018

Ding Zhao

Papers citing "Deep context: end-to-end contextual speech recognition"

46 / 46 papers shown

Title
LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR Iuliia Thorbecke Juan Zuluaga-Gomez Esaú Villatoro-Tello Andres Carofilis Shashi Kumar P. Motlícek Karthik Pandia A. Ganapathiraju 37 0 0 20 Sep 2024
Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words Kento Nozawa Takashi Masuko Toru Taniguchi 43 1 0 15 Aug 2024
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation Ruizhe Huang M. Yarmohammadi Sanjeev Khudanpur Dan Povey 43 2 0 14 Jul 2024
An efficient text augmentation approach for contextualized Mandarin speech recognition Naijun Zheng Xucheng Wan Kai Liu Ziqing Du Zhou Huan 42 1 0 14 Jun 2024
Text Injection for Neural Contextual Biasing Zhong Meng Zelin Wu Rohit Prabhavalkar Cal Peyser Weiran Wang Nanxin Chen Tara N. Sainath Bhuvana Ramabhadran 46 3 0 05 Jun 2024
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition Hainan Xu Zhehuai Chen Fei Jia Boris Ginsburg 41 0 0 04 Apr 2024
Improving ASR Contextual Biasing with Guided Attention Jiyang Tang Kwangyoun Kim Suwon Shon Felix Wu Prashant Sridhar Shinji Watanabe 31 8 0 16 Jan 2024
Promptformer: Prompted Conformer Transducer for ASR Sergio Duarte Torres Arunasish Sen Aman Rana Lukas Drude Alejandro Gomez-Alanis Andreas Schwarz Leif Rädel Volker Leutnant 40 3 0 14 Jan 2024
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs Sai Muralidhar Jayanthi Devang Kulshreshtha Saket Dingliwal S. Ronanki S. Bodapati 38 7 0 14 Nov 2023
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization Zhihong Lei Ernest Pusateri Shiyi Han Leo Liu Mingbin Xu ... R. Travadi Youyuan Zhang Mirko Hannemann Man-Hung Siu Zhen Huang 23 9 0 16 Oct 2023
SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation Zhehuai Chen He Huang A. Andrusenko Oleksii Hrinchuk Krishna C. Puvvada Jason Chun Lok Li Subhankar Ghosh Jagadeesh Balam Boris Ginsburg LRM 29 51 0 13 Oct 2023
Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource Languages Devang Kulshreshtha Saket Dingliwal Brady C. Houston S. Bodapati 14 2 0 03 Jul 2023
Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition Tianyi Xu Zhanheng Yang Kaixun Huang Pengcheng Guo Aoting Zhang Biao Li Changru Chen Chong Li Linfu Xie 22 10 0 01 Jun 2023
External Language Model Integration for Factorized Neural Transducers Michael Levit S. Parthasarathy Cem Aksoylar Mohammad Sadegh Rasooli Shuangyu Chang 29 2 0 26 May 2023
CopyNE: Better Contextual ASR by Copying Named Entities Shilin Zhou Zhenghua Li Yu Hong Mengdi Zhang Zhefeng Wang Baoxing Huai 15 6 0 22 May 2023
Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network Kaixun Huang Aoting Zhang Zhanheng Yang Pengcheng Guo Bingshen Mu Tianyi Xu Linfu Xie 35 16 0 21 May 2023
FunASR: A Fundamental End-to-End Speech Recognition Toolkit Zhifu Gao Zerui Li Jiaming Wang Haoneng Luo Xian Shi ... Yabin Li Lingyun Zuo Zhihao Du Zhangyu Xiao Shiliang Zhang 37 54 0 18 May 2023
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition Xuandi Fu Kanthashree Mysore Sathyendra Ankur Gandhe Jing Liu Grant P. Strimel Ross McGowan Athanasios Mouchtaris 33 14 0 09 May 2023
Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition Saumya Yashmohini Sahai Jing Liu Thejaswi Muniyappa Kanthashree Mysore Sathyendra Anastasios Alexandridis ... Ross McGowan Ariya Rastrow Feng-Ju Chang Athanasios Mouchtaris Siegfried Kunzmann 39 5 0 03 Apr 2023
Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer Zhanheng Yang Sining Sun Xiong Wang Yike Zhang Long Ma Linfu Xie 26 9 0 17 Jan 2023
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results Ao Zhang F. Yu Kaixun Huang Linfu Xie Longbiao Wang Eng Siong Chng Hui Bu Binbin Zhang Wei Chen Xin Xu 32 4 0 03 Nov 2022
End-to-end Spoken Language Understanding with Tree-constrained Pointer Generator Guangzhi Sun C. Zhang P. Woodland 30 8 0 29 Oct 2022
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent? Pradip Pramanick Chayan Sarkar 24 7 0 21 Oct 2022
Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation Llion Jones R. Sproat Haruko Ishikawa Alexander Gutkin 30 1 0 18 Oct 2022
Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting Saket Dingliwal Monica Sunkara S. Bodapati S. Ronanki Jeffrey J. Farris Katrin Kirchhoff 33 0 0 18 Oct 2022
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model Jennifer Drexler Fox Natalie Delworth KELM 30 18 0 02 Sep 2022
Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages Kaushal Bhogale A. Raman Tahir Javed Sumanth Doddapaneni Anoop Kunchukuttan Pratyush Kumar Mitesh M. Khapra 36 22 0 26 Aug 2022
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition Guangzhi Sun C. Zhang P. Woodland 22 12 0 02 Jul 2022
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems Jesús Andrés-Ferrer Dario Albesano P. Zhan Paul Vozila 16 6 0 29 Jun 2022
Contextual Adapters for Personalized Speech Recognition in Neural Transducers Kanthashree Mysore Sathyendra Thejaswi Muniyappa Feng-Ju Chang Jing Liu Jinru Su Grant P. Strimel Athanasios Mouchtaris Siegfried Kunzmann 19 75 0 26 May 2022
Minimising Biasing Word Errors for Contextual ASR with the Tree-Constrained Pointer Generator Guangzhi Sun C. Zhang P. Woodland 34 14 0 18 May 2022
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems Xiaoqiang Wang Yanqing Liu Jinyu Li Veljko Miljanic Sheng Zhao H. Khalil KELM 16 18 0 02 Mar 2022
Context-Aware Transformer Transducer for Speech Recognition Feng-Ju Chang Jing Liu Martin H. Radfar Athanasios Mouchtaris M. Omologo Ariya Rastrow Siegfried Kunzmann 21 79 0 05 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition Jinyu Li VLM 35 363 0 02 Nov 2021
Spell my name: keyword boosted speech recognition Namkyu Jung Geon-min Kim Joon Son Chung 51 13 0 06 Oct 2021
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition Tsendsuren Munkhdalai K. Sim Angad Chandorkar Fan Gao Mason Chua Trevor Strohman F. Beaufays 32 34 0 05 Oct 2021
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition Christian Huber Juan Hussain Sebastian Stüker A. Waibel 29 24 0 05 Jul 2021
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion Duc Le Mahaveer Jain Gil Keren Suyoun Kim Yangyang Shi ... Yuan Shangguan Christian Fuegen Ozlem Kalinli Yatharth Saraf M. Seltzer 27 90 0 05 Apr 2021
CIF-based Collaborative Decoding for End-to-end Contextual Speech Recognition Minglun Han Linhao Dong Shiyu Zhou Bo Xu 21 21 0 17 Dec 2020
Deep Shallow Fusion for RNN-T Personalization Duc Le Gil Keren Julian Chan Jay Mahadeokar Christian Fuegen M. Seltzer 21 77 0 16 Nov 2020
Class LM and word mapping for contextual biasing in End-to-End ASR Rongqing Huang Ossama Abdel-Hamid Xinwei Li G. Evermann 31 47 0 10 Jul 2020
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition Linhao Dong Cheng Yi Jianzong Wang Shiyu Zhou Shuang Xu X. Jia Bo Xu 36 17 0 20 May 2020
Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model Da-Rong Liu Chunxi Liu Frank Zhang Gabriel Synnaeve Yatharth Saraf Geoffrey Zweig 28 19 0 15 May 2020
Two-Pass End-to-End Speech Recognition Tara N. Sainath Ruoming Pang David Rybach Yanzhang He Rohit Prabhavalkar ... Qiao Liang Trevor Strohman Yonghui Wu Ian McGraw Chung-Cheng Chiu 32 147 0 29 Aug 2019
Listen, Attend, Spell and Adapt: Speaker Adapted Sequence-to-Sequence ASR F. Weninger Jesús Andrés-Ferrer Xinwei Li P. Zhan AI4TS 29 26 0 08 Jul 2019
Acoustic-to-Word Models with Conversational Context Information Suyoun Kim Florian Metze 22 7 0 21 May 2019