HierSpeech++: Bridging the Gap between Semantic and Acoustic
Representation of Speech by Hierarchical Variational Inference for Zero-shot
Speech SynthesisIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023 |
Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022 |
Non-autoregressive sequence-to-sequence voice conversionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021 |
Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence
ModelingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020 |
Nonparallel Voice Conversion with Augmented Classifier Star Generative
Adversarial NetworksIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020 |
Pretraining Techniques for Sequence-to-Sequence Voice ConversionIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2020 |