Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2201.12806
Cited By
v1
v2 (latest)
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
30 January 2022
Minglun Han
Linhao Dong
Zhenlin Liang
Meng Cai
Shiyu Zhou
Zejun Ma
Bo Xu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection"
38 / 38 papers shown
A Neural Model for Contextual Biasing Score Learning and Filtering
Wanting Huang
Weiran Wang
136
1
0
27 Oct 2025
Retrieval Augmented Generation based context discovery for ASR
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Dimitrios Siskos
Stavros Papadopoulos
Pablo Peso Parada
Jisi Zhang
Karthikeyan P. Saravanan
Anastasios Drosou
RALM
281
1
0
23 Sep 2025
Enhancing the Robustness of Contextual ASR to Varying Biasing Information Volumes Through Purified Semantic Correlation Joint Modeling
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Yue Gu
Zhihao Du
Ying Shi
Shiliang Zhang
Qian Chen
Jiqing Han
146
1
0
07 Sep 2025
New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR
Xugang Lu
Peng Shen
Yu Tsao
187
0
0
06 Sep 2025
PARCO: Phoneme-Augmented Robust Contextual ASR via Contrastive Entity Disambiguation
Jiajun He
Naoki Sawada
Koichi Miyazaki
Tomoki Toda
248
0
0
04 Sep 2025
H-PRM: A Pluggable Hotword Pre-Retrieval Module for Various Speech Recognition Systems
Huangyu Dai
Lingtao Mao
Ben Chen
Zihan Wang
Zihan Liang
Ying Han
Chenyi Lei
Han Li
KELM
147
0
0
22 Aug 2025
Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Suhang Wu
Jialong Tang
Chengyi Yang
Pei Zhang
Baosong Yang
Junhui Li
Junfeng Yao
Min Zhang
Jinsong Su
179
3
0
24 Jul 2025
Context Biasing for Pronunciation-Orthography Mismatch in Automatic Speech Recognition
Christian Huber
Alexander Waibel
182
0
0
23 Jun 2025
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation
Zhennan Lin
Kaixun Huang
Wei Ren
Linju Yang
Lei Xie
AI4CE
268
0
0
29 May 2025
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR
Xugang Lu
Peng Shen
Yu Tsao
Hisashi Kawai
OT
344
0
0
19 May 2025
Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2024
Nikolaos Flemotomos
Roger Hsiao
P. Swietojanski
Takaaki Hori
Dogan Can
Xiaodan Zhuang
558
3
0
01 Nov 2024
Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss
Muhammad Shakeel
Yui Sudo
Yifan Peng
Shinji Watanabe
AI4CE
316
11
0
23 Jun 2024
An efficient text augmentation approach for contextualized Mandarin speech recognition
Interspeech (Interspeech), 2024
Naijun Zheng
Xucheng Wan
Kai Liu
Ziqing Du
Zhou Huan
217
2
0
14 Jun 2024
Contextualized Automatic Speech Recognition with Dynamic Vocabulary
Yui Sudo
Yosuke Fukumoto
Muhammad Shakeel
Yifan Peng
Shinji Watanabe
330
11
0
22 May 2024
Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR
Zelin Wu
Gan Song
Christopher Li
Pat Rondon
Zhong Meng
...
D. Caseiro
Golan Pundak
Tsendsuren Munkhdalai
Angad Chandorkar
Rohit Prabhavalkar
374
5
0
15 Apr 2024
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview
Heyang Liu
Yu Wang
Yanfeng Wang
313
1
0
01 Mar 2024
MM-LLMs: Recent Advances in MultiModal Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Duzhen Zhang
Yahan Yu
Jiahua Dong
Chenxing Li
Dan Su
Chenhui Chu
Dong Yu
OffRL
LRM
715
389
0
24 Jan 2024
Locality enhanced dynamic biasing and sampling strategies for contextual ASR
Automatic Speech Recognition & Understanding (ASRU), 2023
Md. Asif Jalal
Pablo Peso Parada
George Pavlidis
Vasileios Moschopoulos
Karthikeyan P. Saravanan
...
Jisi Zhang
Anastasios Drosou
Gil Ho Lee
Jungin Lee
Seokyeong Jung
276
4
0
23 Jan 2024
Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search
Yui Sudo
Muhammad Shakeel
Yosuke Fukumoto
Yifan Peng
Shinji Watanabe
268
19
0
19 Jan 2024
LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Fan Yu
Haoxu Wang
Xian Shi
Shiliang Zhang
257
5
0
12 Jan 2024
Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
Ankitha Sudarshan
Vinay Samuel
Parth Patwa
Ibtihel Amara
Vasu Sharma
451
3
0
14 Oct 2023
ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Automatic Speech Recognition & Understanding (ASRU), 2023
Jiajun He
Zekun Yang
Tomoki Toda
223
11
0
08 Oct 2023
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition
Automatic Speech Recognition & Understanding (ASRU), 2023
Kaixun Huang
Aoting Zhang
Binbin Zhang
Tianyi Xu
Xingchen Song
Lei Xie
214
5
0
07 Oct 2023
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Interspeech (Interspeech), 2023
Weiran Wang
Zelin Wu
D. Caseiro
Tsendsuren Munkhdalai
K. Sim
...
Rohit Prabhavalkar
Zhong Meng
Ding Zhao
Tara N. Sainath
P. M. Mengibar
355
12
0
29 Sep 2023
Cross-modal Alignment with Optimal Transport for CTC-based ASR
Automatic Speech Recognition & Understanding (ASRU), 2023
Xugang Lu
Peng Shen
Yu Tsao
Hisashi Kawai
329
8
0
24 Sep 2023
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting
Interspeech (Interspeech), 2023
Yuang Li
Min Zhang
Yan Yu
Yinglu Li
Xiaosong Qiao
Mengxin Ren
Miaomiao Ma
Daimeng Wei
Shimin Tao
Hao Yang
301
10
0
18 Sep 2023
SeACo-Paraformer: A Non-Autoregressive ASR System with Flexible and Effective Hotword Customization Ability
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xian Shi
Yexin Yang
Zerui Li
Yanni Chen
Zhifu Gao
Shiliang Zhang
357
24
0
07 Aug 2023
N-gram Boosting: Improving Contextual Biasing with Normalized N-gram Targets
Wang Yau Li
Shreekantha Nadig
K. Chang
Zafarullah Mahmood
Riqiang Wang
Simon Vandieken
Jonas Robertson
Frederic Mailhot
237
0
0
04 Aug 2023
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ziyi Ni
Minglun Han
Feilong Chen
Linghui Meng
Jing Shi
Shuang Xu
Bo Xu
236
4
0
31 May 2023
CopyNE: Better Contextual ASR by Copying Named Entities
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shilin Zhou
Zhenghua Li
Yu Hong
Hao Fei
Zhefeng Wang
Baoxing Huai
334
15
0
22 May 2023
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Xuandi Fu
Kanthashree Mysore Sathyendra
Ankur Gandhe
Jing Liu
Grant P. Strimel
Ross McGowan
Athanasios Mouchtaris
456
24
0
09 May 2023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Feilong Chen
Minglun Han
Haozhi Zhao
Qingyang Zhang
Jing Shi
Shuang Xu
Bo Xu
MLLM
421
158
0
07 May 2023
CB-Conformer: Contextual biasing Conformer for biased word recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Yaoxun Xu
Baiji Liu
Qiaochu Huang and
Xingcheng Song
Zhiyong Wu
Shiyin Kang
Helen Meng
317
19
0
19 Apr 2023
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Zefa Hu
Xiuyi Chen
Hao Wu
Minglun Han
Ziyi Ni
Jing Shi
Shuang Xu
Bo Xu
178
8
0
02 Mar 2023
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2023
Minglun Han
Qingyu Wang
Tielin Zhang
Yi Wang
Duzhen Zhang
Bo Xu
205
38
0
02 Feb 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Interspeech (Interspeech), 2023
Minglun Han
Feilong Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
257
17
0
30 Jan 2023
Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer
Interspeech (Interspeech), 2023
Zhanheng Yang
Sining Sun
Xiong Wang
Yike Zhang
Long Ma
Linfu Xie
272
16
0
17 Jan 2023
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition
Interspeech (Interspeech), 2022
Guangzhi Sun
Chuxu Zhang
P. Woodland
190
19
0
02 Jul 2022
1
Page 1 of 1