Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.02314
Cited By
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech
3 November 2020
Kun Zhou
Berrak Sisman
Haizhou Li
DRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech"
24 / 24 papers shown
Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer
Soumya Dutta
Avni Jain
Sriram Ganapathy
317
0
0
23 May 2025
EmoDiffusion: Enhancing Emotional 3D Facial Animation with Latent Diffusion Models
Yixuan Zhang
Qing Chang
Yuxi Wang
Guang Chen
Zhenru Zhang
Junran Peng
456
1
0
14 Mar 2025
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yujiao Shi
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
318
15
0
10 Dec 2024
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Xin Jing
Kun Zhou
Andreas Triantafyllopoulos
Björn W. Schuller
DiffM
251
8
0
10 Sep 2024
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
Kun Zhou
Shengkui Zhao
Yukun Ma
Chong Zhang
Hao Wang
Dianwen Ng
Chongjia Ni
Nguyen Trung Hieu
J. Yip
Bin Ma
261
6
0
04 Jun 2024
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
The Speaker and Language Recognition Workshop (Odyssey), 2024
Zongyang Du
Junchen Lu
Kun Zhou
Lakshmish Kaushik
Berrak Sisman
294
7
0
02 May 2024
Fine-Grained Quantitative Emotion Editing for Speech Generation
Sho Inoue
Kun Zhou
Shuai Wang
Haizhou Li
277
5
0
04 Mar 2024
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment
IEEE Transactions on Affective Computing (IEEE Trans. Affective Comput.), 2024
Hyoung-Seok Oh
Sang-Hoon Lee
Deok-Hyun Cho
Seong-Whan Lee
740
1
0
16 Jan 2024
Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Soumya Dutta
Sriram Ganapathy
230
8
0
09 Jan 2024
Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion
Interspeech (Interspeech), 2023
Yun Chen
Lingxiao Yang
Qi Chen
Jianhuang Lai
Xiaohua Xie
176
7
0
29 Dec 2023
In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis
N. Prabhu
N. Lehmann-Willenbrock
Timo Gerkmann
231
4
0
02 Jun 2023
Privacy in Speech Technology
Tomas Bäckström
449
11
0
09 May 2023
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
IEEE International Conference on Computer Vision (ICCV), 2023
Ziqiao Peng
Hao Wu
Zhenbo Song
Hao-Xuan Xu
Xiangyu Zhu
Jun He
Hongyan Liu
Zhaoxin Fan
CVBM
498
188
0
20 Mar 2023
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion
The Speaker and Language Recognition Workshop (Odyssey), 2022
Kun Zhou
Berrak Sisman
John H. L. Hansen
Bin Ma
Haizhou Li
355
6
0
25 Oct 2022
Speech Synthesis with Mixed Emotions
IEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
B.W.Schuller
Haizhou Li
368
67
0
11 Aug 2022
SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Chak Ho Chan
Kaizhi Qian
Yang Zhang
M. Hasegawa-Johnson
DRL
304
56
0
26 Mar 2022
Emotion Intensity and its Control for Emotional Voice Conversion
IEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
Björn W. Schuller
Haizhou Li
421
82
0
10 Jan 2022
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition
Haoran Sun
Lantian Li
Tianshi Zheng
Dong Wang
CVBM
145
0
0
24 Nov 2021
Textless Speech Emotion Conversion using Discrete and Decomposed Representations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Felix Kreuk
Adam Polyak
Jade Copet
Eugene Kharitonov
Tu Nguyen
M. Rivière
Wei-Ning Hsu
Abdel-rahman Mohamed
Emmanuel Dupoux
Yossi Adi
392
47
0
14 Nov 2021
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion
Interspeech (Interspeech), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
297
34
0
20 Oct 2021
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer
Automatic Speech Recognition & Understanding (ASRU), 2021
Zongyang Du
Berrak Sisman
Kun Zhou
Haizhou Li
331
25
0
08 Jul 2021
Global Rhythm Style Transfer Without Text Transcriptions
Kaizhi Qian
Yang Zhang
Shiyu Chang
Jinjun Xiong
Chuang Gan
David D. Cox
M. Hasegawa-Johnson
275
21
0
16 Jun 2021
Emotional Voice Conversion: Theory, Databases and ESD
Speech Communication (Speech Commun.), 2021
Kun Zhou
Berrak Sisman
Rui Liu
Haizhou Li
531
264
0
31 May 2021
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training
Interspeech (Interspeech), 2021
Kun Zhou
Berrak Sisman
Haizhou Li
407
35
0
31 Mar 2021
1
Page 1 of 1