ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.16491
21
7

Z∗Z^*Z∗: Zero-shot Style Transfer via Attention Rearrangement

25 November 2023
Yingying Deng
Xiangyu He
Fan Tang
Weiming Dong
    DiffM
ArXivPDFHTML
Abstract

Despite the remarkable progress in image style transfer, formulating style in the context of art is inherently subjective and challenging. In contrast to existing learning/tuning methods, this study shows that vanilla diffusion models can directly extract style information and seamlessly integrate the generative prior into the content image without retraining. Specifically, we adopt dual denoising paths to represent content/style references in latent space and then guide the content image denoising process with style latent codes. We further reveal that the cross-attention mechanism in latent diffusion models tends to blend the content and style images, resulting in stylized outputs that deviate from the original content image. To overcome this limitation, we introduce a cross-attention rearrangement strategy. Through theoretical analysis and experiments, we demonstrate the effectiveness and superiority of the diffusion-based Z‾\underline{Z}Z​ero-shot S‾\underline{S}S​tyle T‾\underline{T}T​ransfer via A‾\underline{A}A​ttention R‾\underline{R}R​earrangement, Z-STAR.

View on arXiv
Comments on this paper