An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution

11 April 2024

Berlin Chen

Abstract

Automated speaking assessment (ASA) typically involves automatic speech recognition (ASR) and hand-crafted feature extraction from the ASR transcript of a learner's speech. Recently, self-supervised learning (SSL) has shown stellar performance compared to traditional methods. However, SSL-based ASA systems are faced with at least three data-related challenges: limited annotated data, uneven distribution of learner proficiency levels and non-uniform score intervals between different CEFR proficiency levels. To address these challenges, we explore the use of two novel modeling strategies: metric-based classification and loss reweighting, leveraging distinct SSL-based embedding features. Extensive experimental results on the ICNALE benchmark dataset suggest that our approach can outperform existing strong baselines by a sizable margin, achieving a significant improvement of more than 10% in CEFR prediction accuracy.

View on arXiv

@article{lo2025_2404.07575,
  title={ An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution },
  author={ Tien-Hong Lo and Fu-An Chao and Tzu-I Wu and Yao-Ting Sung and Berlin Chen },
  journal={arXiv preprint arXiv:2404.07575},
  year={ 2025 }
}

Comments on this paper