Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.11470
Cited By
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis
30 July 2018
G. Henter
Jaime Lorenzo-Trueba
Xin Wang
Junichi Yamagishi
DRL
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis"
15 / 15 papers shown
Title
Multi-objective Deep Data Generation with Correlated Property Control
Shiyu Wang
Xiaojie Guo
Xuanyang Lin
Bo Pan
Yuanqi Du
...
S. Alkhalifa
K. Minbiole
Bill Wuest
Amarda Shehu
Liang Zhao
AI4CE
54
14
0
01 Oct 2022
Controllable Data Generation by Deep Learning: A Review
Shiyu Wang
Yuanqi Du
Xiaojie Guo
Bo Pan
Zhaohui Qin
Liang Zhao
33
28
0
19 Jul 2022
The Sillwood Technologies System for the VoiceMOS Challenge 2022
Jiameng Gao
30
0
0
08 Apr 2022
Autoregressive Co-Training for Learning Discrete Speech Representations
Sung-Lin Yeh
Hao Tang
SSL
27
6
0
29 Mar 2022
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models
Jen-Hao Rick Chang
A. Shrivastava
H. Koppula
Xiaoshuai Zhang
Oncel Tuzel
DiffM
51
16
0
06 Oct 2021
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
D. Mohan
Qinmin Hu
Tian Huey Teh
Alexandra Torresquintero
C. Wallis
Marlene Staib
Lorenzo Foglianti
Jiameng Gao
Simon King
25
16
0
15 Jun 2021
Fine-grained Emotion Strength Transfer, Control and Prediction for Emotional Speech Synthesis
Yinjiao Lei
Shan Yang
Lei Xie
27
55
0
17 Nov 2020
Deep generative models for musical audio synthesis
M. Huzaifah
L. Wyse
27
20
0
10 Jun 2020
DiscreTalk: Text-to-Speech as a Machine Translation Problem
Tomoki Hayashi
Shinji Watanabe
27
32
0
12 May 2020
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis
Guangzhi Sun
Yu Zhang
Ron J. Weiss
Yuanbin Cao
Heiga Zen
Yonghui Wu
16
130
0
06 Feb 2020
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach
Noé Tits
16
10
0
05 Jul 2019
Using generative modelling to produce varied intonation for speech synthesis
Zack Hodari
O. Watts
Simon King
29
29
0
10 Jun 2019
Quantization-Based Regularization for Autoencoders
Hanwei Wu
M. Flierl
DRL
11
2
0
27 May 2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
V. Wan
Chun-an Chan
Tom Kenter
Jakub Vít
R. Clark
19
75
0
17 May 2019
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Z. Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
207
820
0
12 Jun 2018
1