PESTalk: Speech-Driven 3D Facial Animation with Personalized Emotional StylesACM Multimedia (ACM MM), 2025 |
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024 |
Understanding Deep Contrastive Learning via Coordinate-wise OptimizationNeural Information Processing Systems (NeurIPS), 2022 |
Learning spectro-temporal representations of complex sounds with
parameterized neural networksJournal of the Acoustical Society of America (JASA), 2021 |
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker
Verification ChallengeInterspeech (Interspeech), 2020 |