SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS PredictionInternational Conference on Signal Processing and Communications (ICSPC), 2024 |
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and TextIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 |
A Study on Incorporating Whisper for Robust Speech AssessmentIEEE International Conference on Multimedia and Expo (ICME), 2023 |
RAMP: Retrieval-Augmented MOS Prediction via Confidence-based Dynamic
WeightingInterspeech (Interspeech), 2023 |
MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open
Response ScenariosInterspeech (Interspeech), 2023 |
BASPRO: a balanced script producer for speech corpus collection based on
the genetic algorithmAPSIPA Transactions on Signal and Information Processing (TASIP), 2022 |