DiffProsody: Diffusion-based Latent Prosody Generation for Expressive
Speech Synthesis with Prosody Conditional Adversarial TrainingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023 |
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for
Speech Synthesis based on Disentanglement between Prosody and TimbreIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 |