ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.03943
  4. Cited By
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech
  Synthesis via Phone-Level Content-Style Disentanglement
v1v2v3v4 (latest)

Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement

8 November 2020
Daxin Tan
Tan Lee
ArXiv (abs)PDFHTMLGithub

Papers citing "Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement"

13 / 13 papers shown
CorrTalk: Correlation Between Hierarchical Speech and Facial Activity
  Variances for 3D Animation
CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation
Zhaojie Chu
K. Guo
Xiaofen Xing
Yilin Lan
Bolun Cai
Xiangmin Xu
309
13
0
17 Oct 2023
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for
  Expressive Long-form TTS
HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTSAutomatic Speech Recognition & Understanding (ASRU), 2023
Dake Guo
Xinfa Zhu
Liumeng Xue
Tao Li
Yuanjun Lv
Yuepeng Jiang
Linfu Xie
258
4
0
25 Sep 2023
MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice
  Conversion by Multi-scale Style Modeling
MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style ModelingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Zhichao Wang
Xinsheng Wang
Qicong Xie
Tao Li
Linfu Xie
Qiao Tian
Yuping Wang
440
7
0
03 Sep 2023
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context
  Information for Expressive Speech Synthesis
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech SynthesisIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Xixin Wu
Shiyin Kang
Helen Meng
253
10
0
29 Jul 2023
Controllable speech synthesis by learning discrete phoneme-level
  prosodic representations
Controllable speech synthesis by learning discrete phoneme-level prosodic representationsSpeech Communication (Speech Commun.), 2022
Nikolaos Ellinas
Myrsini Christidou
Alexandra Vioni
June Sig Sung
Aimilios Chalamandaris
Pirros Tsiakoulis
P. Mastorocostas
194
10
0
29 Nov 2022
Speech Synthesis with Mixed Emotions
Speech Synthesis with Mixed EmotionsIEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
B.W.Schuller
Haizhou Li
373
67
0
11 Aug 2022
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context
  Information for Mandarin Speech Synthesis
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech SynthesisInterspeech (Interspeech), 2022
Shunwei Lei
Yixuan Zhou
Liyang Chen
Jiankun Hu
Zhiyong Wu
Shiyin Kang
Helen Meng
241
13
0
06 Apr 2022
On incorporating social speaker characteristics in synthetic speech
On incorporating social speaker characteristics in synthetic speech
S. Rallabandi
Sebastian Möller
224
0
0
03 Apr 2022
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for
  emotional speech synthesis
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesisIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yinjiao Lei
Shan Yang
Xinsheng Wang
Lei Xie
243
99
0
17 Jan 2022
Emotion Intensity and its Control for Emotional Voice Conversion
Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022
Kun Zhou
Berrak Sisman
R. Rana
Björn W. Schuller
Haizhou Li
441
84
0
10 Jan 2022
Fine-grained style control in Transformer-based Text-to-speech Synthesis
Fine-grained style control in Transformer-based Text-to-speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Li-Wei Chen
Alexander I. Rudnicky
389
41
0
12 Oct 2021
Applying the Information Bottleneck Principle to Prosodic Representation
  Learning
Applying the Information Bottleneck Principle to Prosodic Representation LearningInterspeech (Interspeech), 2021
Guangyan Zhang
Ying Qin
Daxin Tan
Tan Lee
282
5
0
05 Aug 2021
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge
Daxin Tan
Hingpang Huang
Guangyan Zhang
Tan Lee
446
6
0
08 Mar 2021
1
Page 1 of 1