Multimodal Perception System for Real Open EnvironmentInternational Conference on Signal Processing Systems (ICSPS), 2024 Yuyang Sha |
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band
Generation and Inverse Short-Time Fourier TransformIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022 |
Parallel Synthesis for Autoregressive Speech GenerationIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 |