v1v2 (latest)
Analysis about Theoretical Foundations for Method to Enhancing ASR
Performance using OCR Word Frequency Differences
IEEE International Conference on Consumer Electronics (ICCE), 2024
Abstract
As interest in large language models (LLMs) grows, the importance of accuracy in automatic speech recognition (ASR) has become more pronounced. This is particularly true for lectures that include specialized terminology, where the success rate of traditional ASR models tends to be low, posing a challenging problem. A method to improve ASR performance for specialized terminology using the word frequency difference approach has been proposed. Through experiments and data analysis, we investigate whether this proposal effectively addresses the issue. Additionally, we introduce the power law as the theoretical foundation for the relative frequency
View on arXivComments on this paper
