ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.02314
  4. Cited By
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in
  Speech

VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech

3 November 2020
Kun Zhou
Berrak Sisman
Haizhou Li
    DRL
ArXiv (abs)PDFHTML

Papers citing "VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech"

24 / 24 papers shown
Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer
Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer
Soumya Dutta
Avni Jain
Sriram Ganapathy
317
0
0
23 May 2025
EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models
EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models
Yixuan Zhang
Qing Chang
Yuxi Wang
Guang Chen
Zhenru Zhang
Junran Peng
456
1
0
14 Mar 2025
A Review of Human Emotion Synthesis Based on Generative Technology
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yujiao Shi
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
318
15
0
10 Dec 2024
Enhancing Emotional Text-to-Speech Controllability with Natural Language
  Guidance through Contrastive Learning and Diffusion Models
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Xin Jing
Kun Zhou
Andreas Triantafyllopoulos
Björn W. Schuller
DiffM
251
8
0
10 Sep 2024
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
Kun Zhou
Shengkui Zhao
Yukun Ma
Chong Zhang
Hao Wang
Dianwen Ng
Chongjia Ni
Nguyen Trung Hieu
J. Yip
Bin Ma
261
6
0
04 Jun 2024
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a
  Conditional Diffusion Model
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion ModelThe Speaker and Language Recognition Workshop (Odyssey), 2024
Zongyang Du
Junchen Lu
Kun Zhou
Lakshmish Kaushik
Berrak Sisman
294
7
0
02 May 2024
Fine-Grained Quantitative Emotion Editing for Speech Generation
Fine-Grained Quantitative Emotion Editing for Speech Generation
Sho Inoue
Kun Zhou
Shuai Wang
Haizhou Li
277
5
0
04 Mar 2024
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text AlignmentIEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024
Hyoung-Seok Oh
Sang-Hoon Lee
Deok-Hyun Cho
Seong-Whan Lee
740
1
0
16 Jan 2024
Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
Zero Shot Audio to Audio Emotion Transfer With Speaker DisentanglementIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Soumya Dutta
Sriram Ganapathy
230
8
0
09 Jan 2024
Attention-based Interactive Disentangling Network for Instance-level
  Emotional Voice Conversion
Attention-based Interactive Disentangling Network for Instance-level Emotional Voice ConversionInterspeech (Interspeech), 2023
Yun Chen
Lingxiao Yang
Qi Chen
Jianhuang Lai
Xiaohua Xie
176
7
0
29 Dec 2023
In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised
  Representations and Neural Vocoder-based Resynthesis
In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis
N. Prabhu
N. Lehmann-Willenbrock
Timo Gerkmann
231
4
0
02 Jun 2023
Privacy in Speech Technology
Privacy in Speech Technology
Tomas Bäckström
449
11
0
09 May 2023
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face AnimationIEEE International Conference on Computer Vision (ICCV), 2023
Ziqiao Peng
Hao Wu
Zhenbo Song
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
CVBM
498
188
0
20 Mar 2023
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice ConversionThe Speaker and Language Recognition Workshop (Odyssey), 2022
Kun Zhou
Berrak Sisman
John H. L. Hansen
Bin Ma
Haizhou Li
355
6
0
25 Oct 2022
Speech Synthesis with Mixed Emotions
Speech Synthesis with Mixed EmotionsIEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
B.W.Schuller
Haizhou Li
368
67
0
11 Aug 2022
SpeechSplit 2.0: Unsupervised speech disentanglement for voice
  conversion Without tuning autoencoder Bottlenecks
SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder BottlenecksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chak Ho Chan
Kaizhi Qian
Yang Zhang
M. Hasegawa-Johnson
DRL
304
56
0
26 Mar 2022
Emotion Intensity and its Control for Emotional Voice Conversion
Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
Björn W. Schuller
Haizhou Li
421
82
0
10 Jan 2022
How Speech is Recognized to Be Emotional - A Study Based on Information
  Decomposition
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Haoran Sun
Lantian Li
Tianshi Zheng
Dong Wang
CVBM
145
0
0
24 Nov 2021
Textless Speech Emotion Conversion using Discrete and Decomposed
  Representations
Textless Speech Emotion Conversion using Discrete and Decomposed RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Felix Kreuk
Adam Polyak
Jade Copet
Eugene Kharitonov
Tu Nguyen
M. Rivière
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
Yossi Adi
392
47
0
14 Nov 2021
Disentanglement of Emotional Style and Speaker Identity for Expressive
  Voice Conversion
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice ConversionInterspeech (Interspeech), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
297
34
0
20 Oct 2021
Expressive Voice Conversion: A Joint Framework for Speaker Identity and
  Emotional Style Transfer
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style TransferAutomatic Speech Recognition & Understanding (ASRU), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
331
25
0
08 Jul 2021
Global Rhythm Style Transfer Without Text Transcriptions
Global Rhythm Style Transfer Without Text Transcriptions
Kaizhi Qian
Yang Zhang
Shiyu Chang
Jinjun Xiong
Chuang Gan
David D. Cox
M. Hasegawa-Johnson
275
21
0
16 Jun 2021
Emotional Voice Conversion: Theory, Databases and ESD
Emotional Voice Conversion: Theory, Databases and ESDSpeech Communication (Speech Commun.), 2021
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
531
264
0
31 May 2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech:
  Two-stage Sequence-to-Sequence Training
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence TrainingInterspeech (Interspeech), 2021
Kun Zhou
Berrak Sisman
Haizhou Li
407
35
0
31 Mar 2021
1
Page 1 of 1