ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.17269
29
0

Towards Generalized and Training-Free Text-Guided Semantic Manipulation

24 April 2025
Yu Hong
Xiao Cai
Pengpeng Zeng
Shuai Zhang
Jingkuan Song
Lianli Gao
H. Shen
    DiffM
ArXivPDFHTML
Abstract

Text-guided semantic manipulation refers to semantically editing an image generated from a source prompt to match a target prompt, enabling the desired semantic changes (e.g., addition, removal, and style transfer) while preserving irrelevant contents. With the powerful generative capabilities of the diffusion model, the task has shown the potential to generate high-fidelity visual content. Nevertheless, existing methods either typically require time-consuming fine-tuning (inefficient), fail to accomplish multiple semantic manipulations (poorly extensible), and/or lack support for different modality tasks (limited generalizability). Upon further investigation, we find that the geometric properties of noises in the diffusion model are strongly correlated with the semantic changes. Motivated by this, we propose a novel GTF\textit{GTF}GTF for text-guided semantic manipulation, which has the following attractive capabilities: 1) Generalized\textbf{Generalized}Generalized: our GTF\textit{GTF}GTF supports multiple semantic manipulations (e.g., addition, removal, and style transfer) and can be seamlessly integrated into all diffusion-based methods (i.e., Plug-and-play) across different modalities (i.e., modality-agnostic); and 2) Training-free\textbf{Training-free}Training-free: GTF\textit{GTF}GTF produces high-fidelity results via simply controlling the geometric relationship between noises without tuning or optimization. Our extensive experiments demonstrate the efficacy of our approach, highlighting its potential to advance the state-of-the-art in semantics manipulation.

View on arXiv
@article{hong2025_2504.17269,
  title={ Towards Generalized and Training-Free Text-Guided Semantic Manipulation },
  author={ Yu Hong and Xiao Cai and Pengpeng Zeng and Shuai Zhang and Jingkuan Song and Lianli Gao and Heng Tao Shen },
  journal={arXiv preprint arXiv:2504.17269},
  year={ 2025 }
}
Comments on this paper