ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.15004
28
0
v1v2 (latest)

EASY: Emotion-aware Speaker Anonymization via Factorized Distillation

21 May 2025
Jixun Yao
Hexin Liu
Eng Siong Chng
Lei Xie
ArXiv (abs)PDFHTML
Main:4 Pages
1 Figures
Bibliography:1 Pages
3 Tables
Abstract

Emotion plays a significant role in speech interaction, conveyed through tone, pitch, and rhythm, enabling the expression of feelings and intentions beyond words to create a more personalized experience. However, most existing speaker anonymization systems employ parallel disentanglement methods, which only separate speech into linguistic content and speaker identity, often neglecting the preservation of the original emotional state. In this study, we introduce EASY, an emotion-aware speaker anonymization framework. EASY employs a novel sequential disentanglement process to disentangle speaker identity, linguistic content, and emotional representation, modeling each speech attribute in distinct subspaces through a factorized distillation approach. By independently constraining speaker identity and emotional representation, EASY minimizes information leakage, enhancing privacy protection while preserving original linguistic content and emotional state. Experimental results on the VoicePrivacy Challenge official datasets demonstrate that our proposed approach outperforms all baseline systems, effectively protecting speaker privacy while maintaining linguistic content and emotional state.

View on arXiv
@article{yao2025_2505.15004,
  title={ EASY: Emotion-aware Speaker Anonymization via Factorized Distillation },
  author={ Jixun Yao and Hexin Liu and Eng Siong Chng and Lei Xie },
  journal={arXiv preprint arXiv:2505.15004},
  year={ 2025 }
}
Comments on this paper