Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.03943
Cited By
v1
v2
v3
v4 (latest)
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement
8 November 2020
Daxin Tan
Tan Lee
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement"
13 / 13 papers shown
CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation
Zhaojie Chu
K. Guo
Xiaofen Xing
Yilin Lan
Bolun Cai
Xiangmin Xu
309
13
0
17 Oct 2023
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS
Automatic Speech Recognition & Understanding (ASRU), 2023
Dake Guo
Xinfa Zhu
Liumeng Xue
Tao Li
Yuanjun Lv
Yuepeng Jiang
Linfu Xie
258
4
0
25 Sep 2023
MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Zhichao Wang
Xinsheng Wang
Qicong Xie
Tao Li
Linfu Xie
Qiao Tian
Yuping Wang
440
7
0
03 Sep 2023
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Xixin Wu
Shiyin Kang
Helen Meng
253
10
0
29 Jul 2023
Controllable speech synthesis by learning discrete phoneme-level prosodic representations
Speech Communication (Speech Commun.), 2022
Nikolaos Ellinas
Myrsini Christidou
Alexandra Vioni
June Sig Sung
Aimilios Chalamandaris
Pirros Tsiakoulis
P. Mastorocostas
194
10
0
29 Nov 2022
Speech Synthesis with Mixed Emotions
IEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
B.W.Schuller
Haizhou Li
373
67
0
11 Aug 2022
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Interspeech (Interspeech), 2022
Shunwei Lei
Yixuan Zhou
Liyang Chen
Jiankun Hu
Zhiyong Wu
Shiyin Kang
Helen Meng
241
13
0
06 Apr 2022
On incorporating social speaker characteristics in synthetic speech
S. Rallabandi
Sebastian Möller
224
0
0
03 Apr 2022
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
IEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yinjiao Lei
Shan Yang
Xinsheng Wang
Lei Xie
243
99
0
17 Jan 2022
Emotion Intensity and its Control for Emotional Voice Conversion
IEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
Björn W. Schuller
Haizhou Li
441
84
0
10 Jan 2022
Fine-grained style control in Transformer-based Text-to-speech Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Li-Wei Chen
Alexander I. Rudnicky
389
41
0
12 Oct 2021
Applying the Information Bottleneck Principle to Prosodic Representation Learning
Interspeech (Interspeech), 2021
Guangyan Zhang
Ying Qin
Daxin Tan
Tan Lee
282
5
0
05 Aug 2021
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Daxin Tan
Hingpang Huang
Guangyan Zhang
Tan Lee
446
6
0
08 Mar 2021
1
Page 1 of 1