ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.08157
57
0

U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers

11 March 2025
Zhanjie Zhang
Ao Ma
Ke Cao
Jing Wang
Shanyuan Liu
Yuhang Ma
Bo Cheng
Dawei Leng
Yuhui Yin
ArXivPDFHTML
Abstract

Ultra-high quality artistic style transfer refers to repainting an ultra-high quality content image using the style information learned from the style image. Existing artistic style transfer methods can be categorized into style reconstruction-based and content-style disentanglement-based style transfer approaches. Although these methods can generate some artistic stylized images, they still exhibit obvious artifacts and disharmonious patterns, which hinder their ability to produce ultra-high quality artistic stylized images. To address these issues, we propose a novel artistic image style transfer method, U-StyDiT, which is built on transformer-based diffusion (DiT) and learns content-style disentanglement, generating ultra-high quality artistic stylized images. Specifically, we first design a Multi-view Style Modulator (MSM) to learn style information from a style image from local and global perspectives, conditioning U-StyDiT to generate stylized images with the learned style information. Then, we introduce a StyDiT Block to learn content and style conditions simultaneously from a style image. Additionally, we propose an ultra-high quality artistic image dataset, Aes4M, comprising 10 categories, each containing 400,000 style images. This dataset effectively solves the problem that the existing style transfer methods cannot produce high-quality artistic stylized images due to the size of the dataset and the quality of the images in the dataset. Finally, the extensive qualitative and quantitative experiments validate that our U-StyDiT can create higher quality stylized images compared to state-of-the-art artistic style transfer methods. To our knowledge, our proposed method is the first to address the generation of ultra-high quality stylized images using transformer-based diffusion.

View on arXiv
@article{zhang2025_2503.08157,
  title={ U-StyDiT: Ultra-high Quality Artistic Style Transfer Using Diffusion Transformers },
  author={ Zhanjie Zhang and Ao Ma and Ke Cao and Jing Wang and Shanyuan Liu and Yuhang Ma and Bo Cheng and Dawei Leng and Yuhui Yin },
  journal={arXiv preprint arXiv:2503.08157},
  year={ 2025 }
}
Comments on this paper