VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech

3 November 2020

Haizhou Li

Papers citing "VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech"

24 / 24 papers shown

Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer

Soumya Dutta

Avni Jain

Sriram Ganapathy

317

23 May 2025

EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models

456

14 Mar 2025

A Review of Human Emotion Synthesis Based on Generative Technology

...

318

10 Dec 2024

Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Xin Jing

Kun Zhou

Andreas Triantafyllopoulos

Björn W. Schuller

DiffM

251

10 Sep 2024

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

Nguyen Trung Hieu

Bin Ma

261

04 Jun 2024

Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion ModelThe Speaker and Language Recognition Workshop (Odyssey), 2024

294

02 May 2024

Fine-Grained Quantitative Emotion Editing for Speech Generation

Sho Inoue

Kun Zhou

Shuai Wang

Haizhou Li

277

04 Mar 2024

DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024

740

16 Jan 2024

Zero Shot Audio to Audio Emotion Transfer With Speaker DisentanglementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Soumya Dutta

Sriram Ganapathy

230

09 Jan 2024

Attention-based Interactive Disentangling Network for Instance-level Emotional Voice ConversionInterspeech (Interspeech), 2023

Yun Chen

Lingxiao Yang

Qi Chen

Jianhuang Lai

Xiaohua Xie

176

29 Dec 2023

In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis

N. Prabhu

N. Lehmann-Willenbrock

Timo Gerkmann

231

02 Jun 2023

Privacy in Speech Technology

Tomas Bäckström

449

09 May 2023

EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face AnimationIEEE International Conference on Computer Vision (ICCV), 2023

Jun He

498

188

20 Mar 2023

Mixed-EVC: Mixed Emotion Synthesis and Control in Voice ConversionThe Speaker and Language Recognition Workshop (Odyssey), 2022

Haizhou Li

355

25 Oct 2022

Speech Synthesis with Mixed EmotionsIEEE Transactions on Affective Computing (IEEE TAC), 2022

Haizhou Li

368

11 Aug 2022

SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder BottlenecksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

Kaizhi Qian

304

26 Mar 2022

Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022

Kun Zhou

Berrak Sisman

R. Rana

Björn W. Schuller

Haizhou Li

421

10 Jan 2022

How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition

Dong Wang

145

24 Nov 2021

Textless Speech Emotion Conversion using Discrete and Decomposed RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Yossi Adi

392

14 Nov 2021

Disentanglement of Emotional Style and Speaker Identity for Expressive Voice ConversionInterspeech (Interspeech), 2021

Zongyang Du

Berrak Sisman

Kun Zhou

Haizhou Li

297

20 Oct 2021

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style TransferAutomatic Speech Recognition & Understanding (ASRU), 2021

Zongyang Du

Berrak Sisman

Kun Zhou

Haizhou Li

331

08 Jul 2021

Global Rhythm Style Transfer Without Text Transcriptions

Kaizhi Qian

Yang Zhang

Shiyu Chang

Jinjun Xiong

Chuang Gan

David D. Cox

M. Hasegawa-Johnson

275

16 Jun 2021

Emotional Voice Conversion: Theory, Databases and ESDSpeech Communication (Speech Commun.), 2021

Kun Zhou

Berrak Sisman

Rui Liu

Haizhou Li

531

264

31 May 2021

Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence TrainingInterspeech (Interspeech), 2021

Kun Zhou

Berrak Sisman

Haizhou Li

407

31 Mar 2021