Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.09474
Cited By
v1
v2
v3
v4 (latest)
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech
Interspeech (Interspeech), 2021
17 March 2021
Keon Lee
Kyumin Park
Daeyoung Kim
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech"
13 / 13 papers shown
Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer
Soumya Dutta
Avni Jain
Sriram Ganapathy
316
0
0
23 May 2025
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Yu Zhang
Ziyue Jiang
Ruiqi Li
Changhao Pan
Jinzheng He
Rongjie Huang
Chuxin Wang
Zhou Zhao
DiffM
VLM
621
25
0
24 Sep 2024
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis
AAAI Conference on Artificial Intelligence (AAAI), 2023
Yu Zhang
Rongjie Huang
Ruiqi Li
Jinzheng He
Yan Xia
Feiyang Chen
Xinyu Duan
Baoxing Huai
Zhou Zhao
VLM
559
43
0
17 Dec 2023
Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Chunyu Qiang
Peng Yang
Hao Che
Ying Zhang
Xiaorui Wang
Zhong-ming Wang
252
12
0
14 Mar 2023
Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis
International Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Chunyu Qiang
Peng Yang
Hao Che
Xiaorui Wang
Zhongyuan Wang
BDL
216
9
0
13 Dec 2022
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS
Interspeech (Interspeech), 2022
Dongchao Yang
Songxiang Liu
Jianwei Yu
Helin Wang
Chao Weng
Yuexian Zou
DiffM
VLM
217
22
0
04 Nov 2022
RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech
Kyumin Park
Keon Lee
Daeyoung Kim
Luan Tuyen Chau
182
0
0
26 Oct 2022
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation
Interspeech (Interspeech), 2022
Giulia Comini
Goeric Huybrechts
M. Ribeiro
Adam Gabry's
Jaime Lorenzo-Trueba
195
7
0
29 Jul 2022
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
Neural Information Processing Systems (NeurIPS), 2022
Rongjie Huang
Yi Ren
Jinglin Liu
Chenye Cui
Zhou Zhao
OODD
VLM
334
36
0
15 May 2022
Fine-grained Noise Control for Multispeaker Speech Synthesis
Interspeech (Interspeech), 2022
Karolos Nikitaras
G. Vamvoukakis
Nikolaos Ellinas
Konstantinos Klapsas
K. Markopoulos
S. Raptis
June Sig Sung
Gunu Jho
Aimilios Chalamandaris
Pirros Tsiakoulis
230
5
0
11 Apr 2022
Cross-speaker style transfer for text-to-speech using data augmentation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
M. Ribeiro
Julian Roth
Giulia Comini
Goeric Huybrechts
Adam Gabry's
Jaime Lorenzo-Trueba
207
28
0
10 Feb 2022
Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech Synthesis
Interspeech (Interspeech), 2021
Julian Zaïdi
Hugo Seuté
Benjamin van Niekerk
M. Carbonneau
151
30
0
04 Aug 2021
A Survey on Neural Speech Synthesis
Xu Tan
Tao Qin
Frank Soong
Tie-Yan Liu
AI4TS
443
446
0
29 Jun 2021
1
Page 1 of 1