v1v2v3v4 (latest)

Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement

8 November 2020

Daxin Tan

Tan Lee

ArXiv (abs)PDF HTML Github

Papers citing "Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement"

13 / 13 papers shown

CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation

309

17 Oct 2023

HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTSAutomatic Speech Recognition & Understanding (ASRU), 2023

258

25 Sep 2023

MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style ModelingIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

440

03 Sep 2023

MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech SynthesisIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2023

Yixuan Zhou

Zhiyong Wu

Shiyin Kang

253

29 Jul 2023

Controllable speech synthesis by learning discrete phoneme-level prosodic representationsSpeech Communication (Speech Commun.), 2022

Aimilios Chalamandaris

Pirros Tsiakoulis

P. Mastorocostas

194

29 Nov 2022

Speech Synthesis with Mixed EmotionsIEEE Transactions on Affective Computing (IEEE TAC), 2022

Haizhou Li

373

11 Aug 2022

Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech SynthesisInterspeech (Interspeech), 2022

Shunwei Lei

Yixuan Zhou

Liyang Chen

Jiankun Hu

Zhiyong Wu

Shiyin Kang

Helen Meng

241

06 Apr 2022

On incorporating social speaker characteristics in synthetic speech

S. Rallabandi

Sebastian Möller

224

03 Apr 2022

MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesisIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Yinjiao Lei

Shan Yang

Xinsheng Wang

Lei Xie

243

17 Jan 2022

Emotion Intensity and its Control for Emotional Voice ConversionIEEE Transactions on Affective Computing (IEEE TAC), 2022

Kun Zhou

Berrak Sisman

R. Rana

Björn W. Schuller

Haizhou Li

441

10 Jan 2022

Fine-grained style control in Transformer-based Text-to-speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Li-Wei Chen

Alexander I. Rudnicky

389

12 Oct 2021

Applying the Information Bottleneck Principle to Prosodic Representation LearningInterspeech (Interspeech), 2021

282

05 Aug 2021

CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge

446

08 Mar 2021