Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

9 March 2020

Papers citing "Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis"

18 / 18 papers shown

Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement

122

02 Oct 2025

Progressive Facial Granularity Aggregation with Bilateral Attribute-based Enhancement for Face-to-Speech SynthesisConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

394

09 Sep 2025

Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2024

Yejin Jeon

Yunsu Kim

Gary Geunbae Lee

301

04 Jan 2024

DC CoMix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with MixerInterspeech (Interspeech), 2023

Yerin Choi

M. Koo

439

31 May 2023

Transformers in Speech Processing: A Survey

515

21 Mar 2023

Learning from Multiple Sources for Data-to-Text and Text-to-DataInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

238

22 Feb 2023

Self-supervised Context-aware Style Representation for Expressive Speech SynthesisInterspeech (Interspeech), 2022

395

25 Jun 2022

MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesisIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022

Yinjiao Lei

Shan Yang

Xinsheng Wang

Lei Xie

239

17 Jan 2022

Synt++: Utilizing Imperfect Synthetic Data to Improve Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Ting-Yao Hu

Mohammadreza Armandpour

279

21 Oct 2021

Fine-grained style control in Transformer-based Text-to-speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Li-Wei Chen

Alexander I. Rudnicky

378

12 Oct 2021

Using multiple reference audios and style embedding constraints for speech synthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021

Longbiao Wang

197

09 Oct 2021

Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models

299

06 Oct 2021

Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis

Xudong Dai

Cheng Gong

Longbiao Wang

Kaili Zhang

178

04 Aug 2021

VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice ConversionInterspeech (Interspeech), 2021

228

178

18 Jun 2021

Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement

Daxin Tan

Tan Lee

446

08 Nov 2020

Paralinguistic Privacy Protection at the Edge

Ranya Aloufi

Hamed Haddadi

David E. Boyle

371

04 Nov 2020

Unsupervised Learning of Disentangled Speech Content and Style RepresentationInterspeech (Interspeech), 2020

295

24 Oct 2020

Privacy-preserving Voice Analysis via Disentangled Representations

Ranya Aloufi

Hamed Haddadi

David E. Boyle

DRL

365

29 Jul 2020