Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.09988
Cited By
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
16 October 2023
Zhihong Lei
Ernest Pusateri
Shiyi Han
Leo Liu
Mingbin Xu
Tim Ng
R. Travadi
Youyuan Zhang
Mirko Hannemann
Man-Hung Siu
Zhen Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization"
10 / 10 papers shown
Title
ValSub: Subsampling Validation Data to Mitigate Forgetting during ASR Personalization
Haaris Mehmood
Karthikeyan P. Saravanan
Pablo Peso Parada
David Tuckey
Mete Ozay
Gil Ho Lee
Jungin Lee
Seokyeong Jung
52
0
0
12 Mar 2025
Contextualization of ASR with LLM using phonetic retrieval-based augmentation
Zhihong Lei
Xingyu Na
Mingbin Xu
Ernest Pusateri
Christophe Van Gysel
Yuanyuan Zhang
Shiyi Han
Zhen Huang
28
2
0
11 Sep 2024
Retrieval Augmented Correction of Named Entity Speech Recognition Errors
Ernest Pusateri
Anmol Walia
Anirudh Kashi
Bortik Bandyopadhyay
Nadia Hyder
Sayantan Mahinder
R. Anantha
Daben Liu
Sashank Gondala
RALM
3DV
26
2
0
09 Sep 2024
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation
Ruizhe Huang
M. Yarmohammadi
Sanjeev Khudanpur
Dan Povey
33
2
0
14 Jul 2024
New Solutions on LLM Acceleration, Optimization, and Application
Yingbing Huang
Lily Jiaxin Wan
Hanchen Ye
Manvi Jha
Jinghua Wang
Yuhong Li
Xiaofan Zhang
Deming Chen
42
12
0
16 Jun 2024
Enhancing CTC-based speech recognition with diverse modeling units
Shiyi Han
Zhihong Lei
Mingbin Xu
Xingyu Na
Zhen Huang
33
0
0
05 Jun 2024
Conformer-Based Speech Recognition On Extreme Edge-Computing Devices
Mingbin Xu
Alex Jin
Sicheng Wang
Mu Su
Tim Ng
...
Shiyi Han
Zhihong Lei
Yaqiao Deng
Zhen Huang
Mahesh Krishnamoorthy
17
4
0
16 Dec 2023
Acoustic Model Fusion for End-to-end Speech Recognition
Zhihong Lei
Mingbin Xu
Shiyi Han
Leo Liu
Zhen Huang
...
Yuanyuan Zhang
Ernest Pusateri
Mirko Hannemann
Yaqiao Deng
Man-Hung Siu
11
5
0
10 Oct 2023
Scaling Speech Technology to 1,000+ Languages
Vineel Pratap
Andros Tjandra
Bowen Shi
Paden Tomasello
Arun Babu
...
Yossi Adi
Xiaohui Zhang
Wei-Ning Hsu
Alexis Conneau
Michael Auli
VLM
77
298
0
22 May 2023
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages
Yu Zhang
Wei Han
James Qin
Yongqiang Wang
Ankur Bapna
...
Pedro J. Moreno
Chung-Cheng Chiu
J. Schalkwyk
Franccoise Beaufays
Yonghui Wu
VLM
79
252
0
02 Mar 2023
1