
Title |
|---|
![]() PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective DecodingIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025 |
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLMAnnual Meeting of the Association for Computational Linguistics (ACL), 2025 |
![]() CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot
Text-to-SpeechInternational Conference on Learning Representations (ICLR), 2024 |
![]() PhoBERT: Pre-trained language models for VietnameseFindings (Findings), 2020 |