v1v2v3v4 (latest)

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

Interspeech (Interspeech), 2021

17 March 2021

Papers citing "STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech"

13 / 13 papers shown

Textless and Non-Parallel Speech-to-Speech Emotion Style Transfer

Soumya Dutta

Avni Jain

Sriram Ganapathy

316

23 May 2025

TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Changhao Pan

Rongjie Huang

Chuxin Wang

Zhou Zhao

DiffM VLM

621

24 Sep 2024

StyleSinger: Style Transfer for Out-of-Domain Singing Voice SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2023

559

17 Dec 2023

Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

252

14 Mar 2023

Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech SynthesisInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022

216

13 Dec 2022

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTSInterspeech (Interspeech), 2022

Dongchao Yang

Helin Wang

217

04 Nov 2022

RedPen: Region- and Reason-Annotated Dataset of Unnatural Speech

182

26 Oct 2022

Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentationInterspeech (Interspeech), 2022

195

29 Jul 2022

GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-SpeechNeural Information Processing Systems (NeurIPS), 2022

Rongjie Huang

Zhou Zhao

334

15 May 2022

Fine-grained Noise Control for Multispeaker Speech SynthesisInterspeech (Interspeech), 2022

Aimilios Chalamandaris

Pirros Tsiakoulis

230

11 Apr 2022

Cross-speaker style transfer for text-to-speech using data augmentationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

207

10 Feb 2022

Daft-Exprt: Cross-Speaker Prosody Transfer on Any Text for Expressive Speech SynthesisInterspeech (Interspeech), 2021

151

04 Aug 2021

A Survey on Neural Speech Synthesis

Xu Tan

443

446

29 Jun 2021