iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for
Speech Synthesis based on Disentanglement between Prosody and TimbreIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022 |
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme
Representations for Text to SpeechInterspeech (Interspeech), 2022 |