ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.06227
  4. Cited By
Unsupervised Style and Content Separation by Minimizing Mutual
  Information for Speech Synthesis

Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
9 March 2020
Ting-Yao Hu
A. Shrivastava
Oncel Tuzel
C. Dhir
ArXiv (abs)PDFHTML

Papers citing "Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis"

18 / 18 papers shown
Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Emotional Text-To-Speech Based on Mutual-Information-Guided Emotion-Timbre Disentanglement
Jianing Yang
Sheng Li
Takahiro Shinozaki
Yuki Saito
Hiroshi Saruwatari
122
0
0
02 Oct 2025
Progressive Facial Granularity Aggregation with Bilateral Attribute-based Enhancement for Face-to-Speech Synthesis
Progressive Facial Granularity Aggregation with Bilateral Attribute-based Enhancement for Face-to-Speech SynthesisConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Yejin Jeon
Youngjae Kim
Jihyun Lee
Hyounghun Kim
G. G. Lee
CVBM
394
0
0
09 Sep 2025
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker
  Representations
Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2024
Yejin Jeon
Yunsu Kim
Gary Geunbae Lee
301
7
0
04 Jan 2024
DC CoMix TTS: An End-to-End Expressive TTS with Discrete Code
  Collaborated with Mixer
DC CoMix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with MixerInterspeech (Interspeech), 2023
Yerin Choi
M. Koo
439
1
0
31 May 2023
Transformers in Speech Processing: A Survey
Transformers in Speech Processing: A Survey
S. Latif
Aun Zaidi
Heriberto Cuayáhuitl
Fahad Shamshad
Moazzam Shoukat
Muhammad Usama
Junaid Qadir
515
76
0
21 Mar 2023
Learning from Multiple Sources for Data-to-Text and Text-to-Data
Learning from Multiple Sources for Data-to-Text and Text-to-DataInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
S. Duong
Alberto Lumbreras
Mike Gartrell
Patrick Gallinari
238
3
0
22 Feb 2023
Self-supervised Context-aware Style Representation for Expressive Speech
  Synthesis
Self-supervised Context-aware Style Representation for Expressive Speech SynthesisInterspeech (Interspeech), 2022
Yihan Wu
Xi Wang
Xi Wang
Lei He
Ruihua Song
J. Nie
395
17
0
25 Jun 2022
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for
  emotional speech synthesis
MsEmoTTS: Multi-scale emotion transfer, prediction, and control for emotional speech synthesisIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2022
Yinjiao Lei
Shan Yang
Xinsheng Wang
Lei Xie
239
99
0
17 Jan 2022
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech RecognitionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ting-Yao Hu
Mohammadreza Armandpour
A. Shrivastava
Jen-Hao Rick Chang
H. Koppula
Oncel Tuzel
SyDa
279
49
0
21 Oct 2021
Fine-grained style control in Transformer-based Text-to-speech Synthesis
Fine-grained style control in Transformer-based Text-to-speech SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Li-Wei Chen
Alexander I. Rudnicky
378
40
0
12 Oct 2021
Using multiple reference audios and style embedding constraints for
  speech synthesis
Using multiple reference audios and style embedding constraints for speech synthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Cheng Gong
Longbiao Wang
Zhenhua Ling
Ju Zhang
Jianwu Dang
197
7
0
09 Oct 2021
Style Equalization: Unsupervised Learning of Controllable Generative
  Sequence Models
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models
Jen-Hao Rick Chang
A. Shrivastava
H. Koppula
Xiaoshuai Zhang
Oncel Tuzel
DiffM
299
20
0
06 Oct 2021
Information Sieve: Content Leakage Reduction in End-to-End Prosody For
  Expressive Speech Synthesis
Information Sieve: Content Leakage Reduction in End-to-End Prosody For Expressive Speech Synthesis
Xudong Dai
Cheng Gong
Longbiao Wang
Kaili Zhang
178
2
0
04 Aug 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised
  Speech Representation Disentanglement for One-shot Voice Conversion
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice ConversionInterspeech (Interspeech), 2021
Disong Wang
Liqun Deng
Y. Yeung
Xiao Chen
Xunying Liu
Helen Meng
DRL
228
178
0
18 Jun 2021
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech
  Synthesis via Phone-Level Content-Style Disentanglement
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement
Daxin Tan
Tan Lee
446
22
0
08 Nov 2020
Paralinguistic Privacy Protection at the Edge
Paralinguistic Privacy Protection at the Edge
Ranya Aloufi
Hamed Haddadi
David E. Boyle
371
18
0
04 Nov 2020
Unsupervised Learning of Disentangled Speech Content and Style
  Representation
Unsupervised Learning of Disentangled Speech Content and Style RepresentationInterspeech (Interspeech), 2020
Andros Tjandra
Ruoming Pang
Yu Zhang
Shigeki Karita
BDLDRL
295
21
0
24 Oct 2020
Privacy-preserving Voice Analysis via Disentangled Representations
Privacy-preserving Voice Analysis via Disentangled Representations
Ranya Aloufi
Hamed Haddadi
David E. Boyle
DRL
365
65
0
29 Jul 2020
1
Page 1 of 1