v1v2v3v4 (latest)

DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment

IEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024

16 January 2024

ArXiv (abs)PDF HTML Github (385★)

Papers citing "DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment"

6 / 6 papers shown

NaturalVoices: A Large-Scale, Spontaneous and Emotional Podcast Dataset for Voice Conversion

Zongyang Du

Shreeram Suresh Chandra

201

31 Oct 2025

Emotional Styles Hide in Deep Speaker Embeddings: Disentangle Deep Speaker Embeddings for Speaker Clustering

233

27 Sep 2025

DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-SpeechInterspeech (Interspeech), 2025

246

26 May 2025

JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

140

10 Jan 2025

ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial TrainingIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

356

08 Jan 2025

EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical VectorIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024

519

04 Nov 2024