Deep Encoder-Decoder Models for Unsupervised Learning of Controllable
Speech Synthesis

Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis

30 July 2018

Jaime Lorenzo-Trueba

Xin Wang

Junichi Yamagishi

Papers citing "Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis"

15 / 15 papers shown

Title
Multi-objective Deep Data Generation with Correlated Property Control Shiyu Wang Xiaojie Guo Xuanyang Lin Bo Pan Yuanqi Du ... S. Alkhalifa K. Minbiole Bill Wuest Amarda Shehu Liang Zhao AI4CE 54 14 0 01 Oct 2022
Controllable Data Generation by Deep Learning: A Review Shiyu Wang Yuanqi Du Xiaojie Guo Bo Pan Zhaohui Qin Liang Zhao 33 28 0 19 Jul 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022 Jiameng Gao 30 0 0 08 Apr 2022
Autoregressive Co-Training for Learning Discrete Speech Representations Sung-Lin Yeh Hao Tang SSL 27 6 0 29 Mar 2022
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models Jen-Hao Rick Chang A. Shrivastava H. Koppula Xiaoshuai Zhang Oncel Tuzel DiffM 51 16 0 06 Oct 2021
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis D. Mohan Qinmin Hu Tian Huey Teh Alexandra Torresquintero C. Wallis Marlene Staib Lorenzo Foglianti Jiameng Gao Simon King 25 16 0 15 Jun 2021
Fine-grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis Yinjiao Lei Shan Yang Lei Xie 27 55 0 17 Nov 2020
Deep generative models for musical audio synthesis M. Huzaifah L. Wyse 27 20 0 10 Jun 2020
DiscreTalk: Text-to-Speech as a Machine Translation Problem Tomoki Hayashi Shinji Watanabe 27 32 0 12 May 2020
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis Guangzhi Sun Yu Zhang Ron J. Weiss Yuanbin Cao Heiga Zen Yonghui Wu 16 130 0 06 Feb 2020
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach Noé Tits 16 10 0 05 Jul 2019
Using generative modelling to produce varied intonation for speech synthesis Zack Hodari O. Watts Simon King 29 29 0 10 Jun 2019
Quantization-Based Regularization for Autoencoders Hanwei Wu M. Flierl DRL 11 2 0 27 May 2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network V. Wan Chun-an Chan Tom Kenter Jakub Vít R. Clark 19 75 0 17 May 2019
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Ye Jia Yu Zhang Ron J. Weiss Quan Wang Jonathan Shen ... Z. Chen Patrick Nguyen Ruoming Pang Ignacio López Moreno Yonghui Wu 207 820 0 12 Jun 2018