ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.02480
  4. Cited By
Deep context: end-to-end contextual speech recognition

Deep context: end-to-end contextual speech recognition

7 August 2018
Golan Pundak
Tara N. Sainath
Rohit Prabhavalkar
Anjuli Kannan
Ding Zhao
ArXiv (abs)PDFHTML

Papers citing "Deep context: end-to-end contextual speech recognition"

50 / 145 papers shown
Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning
Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning
Yangui Fang
Jing Peng
Xu Li
Yu Xi
Chengwei Zhang
Guohui Zhong
Kai Yu
219
10
0
24 Dec 2025
A Neural Model for Contextual Biasing Score Learning and Filtering
A Neural Model for Contextual Biasing Score Learning and Filtering
Wanting Huang
Weiran Wang
138
1
0
27 Oct 2025
PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition
PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition
Li Fu
Yu Xin
Sunlu Zeng
Lu Fan
Youzheng Wu
Xiaodong He
204
1
0
16 Sep 2025
Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition
Efficient Trie-based Biasing using K-step Prediction for Rare Word Recognition
Chin Yuen Kwok
J. Yip
83
0
0
11 Sep 2025
Enhancing the Robustness of Contextual ASR to Varying Biasing Information Volumes Through Purified Semantic Correlation Joint Modeling
Enhancing the Robustness of Contextual ASR to Varying Biasing Information Volumes Through Purified Semantic Correlation Joint ModelingIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Yue Gu
Zhihao Du
Ying Shi
Shiliang Zhang
Qian Chen
Jiqing Han
146
1
0
07 Sep 2025
TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition
TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition
Minh N. H. Nguyen
Anh Nguyen Tran
Dung Truong Dinh
Nam Van Vo
233
0
0
07 Sep 2025
Contextualized Token Discrimination for Speech Search Query Correction
Contextualized Token Discrimination for Speech Search Query Correction
Junyu Lu
Di Jiang
Mengze Hong
Victor Junqiu Wei
Qintian Guo
Zhiyang Su
200
3
0
04 Sep 2025
PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation
PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation
Jiajun He
Naoki Sawada
Koichi Miyazaki
Tomoki Toda
256
1
0
04 Sep 2025
Generative Annotation for ASR Named Entity Correction
Generative Annotation for ASR Named Entity Correction
Yuanchang Luo
Daimeng Wei
Shaojun Li
Hengchao Shang
Jiaxin Guo
...
Zhanglin Wu
Xiaoyu Chen
Zhiqiang Rao
Jinlong Yang
Hao Yang
193
2
0
28 Aug 2025
Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation
Zero-shot Context Biasing with Trie-based Decoding using Synthetic Multi-Pronunciation
Changsong Liu
Yizhou Peng
Eng Siong Chng
219
0
0
25 Aug 2025
H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems
H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems
Huangyu Dai
Lingtao Mao
Ben Chen
Zihan Wang
Zihan Liang
Ying Han
Chenyi Lei
Han Li
KELM
153
0
0
22 Aug 2025
TurboBias: Universal ASR Context-Biasing powered by GPU-accelerated Phrase-Boosting Tree
TurboBias: Universal ASR Context-Biasing powered by GPU-accelerated Phrase-Boosting Tree
A. Andrusenko
Vladimir Bataev
Lilit Grigoryan
Vitaly Lavrukhin
Boris Ginsburg
391
2
0
09 Aug 2025
Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition
Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition
Christian Huber
Alexander Waibel
195
0
0
23 Jun 2025
Improving Speech Recognition of Named Entities in Classroom Speech with LLM Revision and Phonetic-Semantic Context
Improving Speech Recognition of Named Entities in Classroom Speech with LLM Revision and Phonetic-Semantic Context
V. Trinh
Xinlu He
Jacob Whitehill
KELM
398
2
0
12 Jun 2025
OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
Yui Sudo
Yusuke Fujita
Atsushi Kojima
Tomoya Mizumoto
Lianbo Liu
243
2
0
11 Jun 2025
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing
Yu Nakagome
Michael Hentschel
283
0
0
02 Jun 2025
PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding
PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective DecodingIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Jiajun He
Tomoki Toda
238
4
0
31 May 2025
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation
Zhennan Lin
Kaixun Huang
Wei Ren
Linju Yang
Lei Xie
AI4CE
271
0
0
29 May 2025
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM
Xun Gong
Anqi Lv
Zhiming Wang
Huijia Zhu
Y. Qian
276
10
0
25 May 2025
Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction
Reward-Driven Interaction: Enhancing Proactive Dialogue Agents through User Satisfaction Prediction
Wei Shen
Xiaonan He
Chuheng Zhang
Xuyun Zhang
Xiaolong Xu
Wanchun Dou
178
0
0
24 May 2025
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR
Chuong Chu
Vu Tuan Dat Pham
Kien Dao
Hoang Nguyen
Quoc Hung Truong
167
2
0
13 Jan 2025
CTC-Assisted LLM-Based Contextual ASR
CTC-Assisted LLM-Based Contextual ASRSpoken Language Technology Workshop (SLT), 2024
Guanrou Yang
Tianhao Shen
Zhifu Gao
Shiliang Zhang
Xie Chen
264
23
0
10 Nov 2024
Optimizing Contextual Speech Recognition Using Vector Quantization for
  Efficient Retrieval
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient RetrievalIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Nikolaos Flemotomos
Roger Hsiao
P. Swietojanski
Takaaki Hori
Dogan Can
Xiaodan Zhuang
564
3
0
01 Nov 2024
Deep CLAS: Deep Contextual Listen, Attend and Spell
Deep CLAS: Deep Contextual Listen, Attend and Spell
Shifu Xiong
Mengzhi Wang
Genshun Wan
Hang Chen
Jianqing Gao
Lirong Dai
222
2
0
26 Sep 2024
Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices
Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices
L. Velikovich
Christopher Li
D. Caseiro
Shankar Kumar
Pat Rondon
Kandarp Joshi
Xavier Velez
KELM
432
2
0
24 Sep 2024
Unifying Global and Near-Context Biasing in a Single Trie Pass
Unifying Global and Near-Context Biasing in a Single Trie PassInternational Conference on Text, Speech and Dialogue (TSD), 2024
Iuliia Thorbecke
Esaú Villatoro-Tello
Juan Zuluaga-Gomez
Shashi Kumar
Sergio Burdisso
...
A. Ganapathiraju
P. Motlícek
Karthik Pandia
Kadri Hacioğlu
Andreas Stolcke
417
0
0
20 Sep 2024
Contextualization of ASR with LLM using phonetic retrieval-based
  augmentation
Contextualization of ASR with LLM using phonetic retrieval-based augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Zhihong Lei
Xingyu Na
Mingbin Xu
Ernest Pusateri
Christophe Van Gysel
Yuanyuan Zhang
Shiyi Han
Zhen Huang
322
13
0
11 Sep 2024
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech
  Recognition
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech RecognitionSpoken Language Technology Workshop (SLT), 2024
Yi-Cheng Wang
Li-Ting Pai
Bi-Cheng Yan
Hsin-Wei Wang
Chi-Han Lin
Berlin Chen
240
3
0
10 Sep 2024
XCB: an effective contextual biasing approach to bias cross-lingual
  phrases in speech recognition
XCB: an effective contextual biasing approach to bias cross-lingual phrases in speech recognition
Xucheng Wan
Naijun Zheng
Kai Liu
Huan Zhou
221
0
0
20 Aug 2024
Enhancing Large Language Model-based Speech Recognition by
  Contextualization for Rare and Ambiguous Words
Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words
Kento Nozawa
Takashi Masuko
Toru Taniguchi
230
3
0
15 Aug 2024
Improving Neural Biasing for Contextual Speech Recognition by Early
  Context Injection and Text Perturbation
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation
Ruizhe Huang
M. Yarmohammadi
Sanjeev Khudanpur
Dan Povey
289
13
0
14 Jul 2024
Contextualized End-to-end Automatic Speech Recognition with Intermediate
  Biasing Loss
Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss
Muhammad Shakeel
Yui Sudo
Yifan Peng
Shinji Watanabe
AI4CE
324
11
0
23 Jun 2024
An efficient text augmentation approach for contextualized Mandarin
  speech recognition
An efficient text augmentation approach for contextualized Mandarin speech recognitionInterspeech (Interspeech), 2024
Naijun Zheng
Xucheng Wan
Kai Liu
Ziqing Du
Zhou Huan
217
2
0
14 Jun 2024
Fast Context-Biasing for CTC and Transducer ASR models with CTC-based
  Word Spotter
Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter
A. Andrusenko
A. Laptev
Vladimir Bataev
Vitaly Lavrukhin
Boris Ginsburg
357
7
0
11 Jun 2024
Text Injection for Neural Contextual Biasing
Text Injection for Neural Contextual Biasing
Zhong Meng
Zelin Wu
Rohit Prabhavalkar
Cal Peyser
Weiran Wang
Nanxin Chen
Tara N. Sainath
Bhuvana Ramabhadran
360
6
0
05 Jun 2024
Keyword-Guided Adaptation of Automatic Speech Recognition
Keyword-Guided Adaptation of Automatic Speech Recognition
Aviv Shamsian
Aviv Navon
Neta Glazer
Gill Hetz
Joseph Keshet
333
5
0
04 Jun 2024
Contextualized Automatic Speech Recognition with Dynamic Vocabulary
Contextualized Automatic Speech Recognition with Dynamic Vocabulary
Yui Sudo
Yosuke Fukumoto
Muhammad Shakeel
Yifan Peng
Shinji Watanabe
333
11
0
22 May 2024
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context
  Encoding for Non-Streaming ASR
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR
Zelin Wu
Gan Song
Christopher Li
Pat Rondon
Zhong Meng
...
D. Caseiro
Golan Pundak
Tsendsuren Munkhdalai
Angad Chandorkar
Rohit Prabhavalkar
378
5
0
15 Apr 2024
Transducers with Pronunciation-aware Embeddings for Automatic Speech
  Recognition
Transducers with Pronunciation-aware Embeddings for Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Hainan Xu
Zhehuai Chen
Fei Jia
Boris Ginsburg
198
0
0
04 Apr 2024
DANCER: Entity Description Augmented Named Entity Corrector for
  Automatic Speech Recognition
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Yi-Cheng Wang
Hsin-Wei Wang
Bi-Cheng Yan
Chi-Han Lin
Berlin Chen
268
3
0
26 Mar 2024
M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual
  Academic Lecture Dataset
M3^33AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
Zhe Chen
Heyang Liu
Wenyi Yu
Guangzhi Sun
Hongcheng Liu
Ji Wu
Chao Zhang
Yu Wang
Yanfeng Wang
VGen
267
5
0
21 Mar 2024
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn
  Medical Interview
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Heyang Liu
Yu Wang
Yanfeng Wang
316
1
0
01 Mar 2024
Representing Online Handwriting for Recognition in Large Vision-Language
  Models
Representing Online Handwriting for Recognition in Large Vision-Language Models
Anastasiia Fadeeva
Philippe Schlattner
Andrii Maksai
Mark Collier
Efi Kokiopoulou
Jesse Berent
C. Musat
323
8
0
23 Feb 2024
Self-consistent context aware conformer transducer for speech
  recognition
Self-consistent context aware conformer transducer for speech recognition
Konstantin Kolokolov
Pavel Pekichev
Karthik Raghunathan
217
1
0
09 Feb 2024
Locality enhanced dynamic biasing and sampling strategies for contextual
  ASR
Locality enhanced dynamic biasing and sampling strategies for contextual ASRAutomatic Speech Recognition & Understanding (ASRU), 2023
Md. Asif Jalal
Pablo Peso Parada
George Pavlidis
Vasileios Moschopoulos
Karthikeyan P. Saravanan
...
Jisi Zhang
Anastasios Drosou
Gil Ho Lee
Jungin Lee
Seokyeong Jung
278
4
0
23 Jan 2024
Contextualized Automatic Speech Recognition with Attention-Based Bias
  Phrase Boosted Beam Search
Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search
Yui Sudo
Muhammad Shakeel
Yosuke Fukumoto
Yifan Peng
Shinji Watanabe
271
20
0
19 Jan 2024
Improving ASR Contextual Biasing with Guided Attention
Improving ASR Contextual Biasing with Guided AttentionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiyang Tang
Kwangyoun Kim
Suwon Shon
Felix Wu
Prashant Sridhar
Shinji Watanabe
213
26
0
16 Jan 2024
Promptformer: Prompted Conformer Transducer for ASR
Promptformer: Prompted Conformer Transducer for ASRIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Sergio Duarte Torres
Arunasish Sen
Aman Rana
Lukas Drude
Alejandro Gomez-Alanis
Andreas Schwarz
Leif Rädel
Volker Leutnant
316
3
0
14 Jan 2024
LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition
LCB-net: Long-Context Biasing for Audio-Visual Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Fan Yu
Haoxu Wang
Xian Shi
Shiliang Zhang
260
6
0
12 Jan 2024
High-precision Voice Search Query Correction via Retrievable Speech-text
  Embedings
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings
Christopher Li
Gary Wang
Kyle Kastner
Heng Su
Allen Chen
...
Zelin Wu
L. Velikovich
Pat Rondon
D. Caseiro
Petar S. Aleksic
201
2
0
08 Jan 2024
123
Next
Page 1 of 3