ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,616 papers shown
Title
FreeTuner: Any Subject in Any Style with Training-free Diffusion
FreeTuner: Any Subject in Any Style with Training-free Diffusion
Youcan Xu
Zhen Wang
Jun Xiao
Wei Liu
Long Chen
DiffM
36
9
0
23 May 2024
Perceptual Fairness in Image Restoration
Perceptual Fairness in Image Restoration
Guy Ohayon
Michael Elad
T. Michaeli
SupR
43
1
0
22 May 2024
MotionCraft: Physics-based Zero-Shot Video Generation
MotionCraft: Physics-based Zero-Shot Video Generation
L. S. Aira
Antonio Montanaro
Emanuele Aiello
D. Valsesia
E. Magli
DiffM
VGen
26
9
0
22 May 2024
An Empirical Study and Analysis of Text-to-Image Generation Using Large
  Language Model-Powered Textual Representation
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation
Zhiyu Tan
Mengping Yang
Luozheng Qin
Hao Yang
Ye Qian
Qiang-feng Zhou
Cheng Zhang
Hao Li
65
3
0
21 May 2024
DisenStudio: Customized Multi-subject Text-to-Video Generation with
  Disentangled Spatial Control
DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control
Hong Chen
Xin Wang
Yipeng Zhang
Yuwei Zhou
Zeyang Zhang
Siao Tang
Wenwu Zhu
VGen
DiffM
41
9
0
21 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
46
40
0
20 May 2024
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise
  Attention
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
Peng Li
Yuan-Bin Liu
Xiaoxiao Long
Feihu Zhang
Cheng Lin
...
Wenhan Luo
Ping Tan
Wenping Wang
Qi-fei Liu
Yi-Ting Guo
VGen
77
40
0
19 May 2024
On the Trajectory Regularity of ODE-based Diffusion Sampling
On the Trajectory Regularity of ODE-based Diffusion Sampling
Defang Chen
Zhenyu Zhou
Can Wang
Chunhua Shen
Siwei Lyu
35
14
0
18 May 2024
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
Zeyu Zhang
Yiran Wang
Biao Wu
Shuo Chen
Zhiyuan Zhang
Shiya Huang
Wenbo Zhang
Meng Fang
Ling-Hao Chen
Yang Zhao
VGen
38
6
0
18 May 2024
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory
  Score Matching
Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching
Xingyu Miao
Haoran Duan
Varun Ojha
Jun Song
Tejal Shah
Yang Long
R. Ranjan
27
3
0
18 May 2024
Generative AI for 2D Character Animation
Generative AI for 2D Character Animation
Jaime Guajardo
Ozgun Y. Bursalioglu
Dan B. Goldman
VGen
14
3
0
17 May 2024
LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with
  T-Diffusion
LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion
Tong Chen
Qingcheng Lyu
Long Bai
Erjian Guo
Huxin Gao
Xiaoxiao Yang
Hongliang Ren
Luping Zhou
MedIm
35
6
0
17 May 2024
VirtualModel: Generating Object-ID-retentive Human-object Interaction
  Image by Diffusion Model for E-commerce Marketing
VirtualModel: Generating Object-ID-retentive Human-object Interaction Image by Diffusion Model for E-commerce Marketing
Binghui Chen
Chongyang Zhong
Wangmeng Xiang
Yifeng Geng
Xuansong Xie
DiffM
28
6
0
16 May 2024
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with
  Fine-Grained Chinese Understanding
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Zhimin Li
Jianwei Zhang
Qin Lin
Jiangfeng Xiong
Yanxin Long
...
Wei Liu
Dingyong Wang
Yong Yang
Jie Jiang
Qinglin Lu
ViT
46
91
0
14 May 2024
Compositional Text-to-Image Generation with Dense Blob Representations
Compositional Text-to-Image Generation with Dense Blob Representations
Weili Nie
Sifei Liu
Morteza Mardani
Chao Liu
Benjamin Eckart
Arash Vahdat
DiffM
80
17
0
14 May 2024
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator
Hanshu Yan
Xingchao Liu
Jiachun Pan
Jun Hao Liew
Qiang Liu
Jiashi Feng
42
40
0
13 May 2024
Erasing Concepts from Text-to-Image Diffusion Models with Few-shot
  Unlearning
Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning
Masane Fuchi
Tomohiro Takagi
DiffM
VLM
50
13
0
12 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional
  Text-to-image Generation
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
44
2
0
11 May 2024
Disrupting Style Mimicry Attacks on Video Imagery
Disrupting Style Mimicry Attacks on Video Imagery
Josephine Passananti
Stanley Wu
Shawn Shan
Haitao Zheng
Ben Y. Zhao
AAML
25
4
0
11 May 2024
Distilling Diffusion Models into Conditional GANs
Distilling Diffusion Models into Conditional GANs
Minguk Kang
Richard Zhang
Connelly Barnes
Sylvain Paris
Suha Kwak
Jaesik Park
Eli Shechtman
Jun-Yan Zhu
Taesung Park
40
36
0
09 May 2024
Lumina-T2X: Transforming Text into Any Modality, Resolution, and
  Duration via Flow-based Large Diffusion Transformers
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Peng Gao
Le Zhuo
Ziyi Lin
Ruoyi Du
Xu Luo
...
Weicai Ye
He Tong
Jingwen He
Yu Qiao
Hongsheng Li
VGen
37
83
0
09 May 2024
MasterWeaver: Taming Editability and Face Identity for Personalized
  Text-to-Image Generation
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
Hongzhi Zhang
Lei Zhang
W. Zuo
DiffM
49
0
0
09 May 2024
Attention-Driven Training-Free Efficiency Enhancement of Diffusion
  Models
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
Hongjie Wang
Difan Liu
Yan Kang
Yijun Li
Zhe Lin
N. Jha
Yuchen Liu
29
12
0
08 May 2024
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion
Zehan Wang
Ziang Zhang
Xize Cheng
Rongjie Huang
Luping Liu
...
Haifeng Huang
Yang Zhao
Tao Jin
Peng Gao
Zhou Zhao
31
8
0
08 May 2024
Towards Geographic Inclusion in the Evaluation of Text-to-Image Models
Towards Geographic Inclusion in the Evaluation of Text-to-Image Models
Melissa Hall
Samuel J. Bell
Candace Ross
Adina Williams
M. Drozdzal
Adriana Romero Soriano
EGVM
33
4
0
07 May 2024
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion
  Transformer
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Zhuoyi Yang
Heyang Jiang
Wenyi Hong
Jiayan Teng
Wendi Zheng
Yuxiao Dong
Ming Ding
Jie Tang
SupR
30
5
0
07 May 2024
Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your
  Diffusion Model
Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model
Joo Young Choi
Jaesung R. Park
Inkyu Park
Jaewoong Cho
Albert No
Ernest K. Ryu
AI4CE
35
4
0
07 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World
  Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGen
LM&Ro
84
36
0
06 May 2024
Video Diffusion Models: A Survey
Video Diffusion Models: A Survey
Andrew Melnik
Michal Ljubljanac
Cong Lu
Qi Yan
Weiming Ren
Helge J. Ritter
VGen
71
12
0
06 May 2024
Customizing Text-to-Image Models with a Single Image Pair
Customizing Text-to-Image Models with a Single Image Pair
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
25
19
0
02 May 2024
LocInv: Localization-aware Inversion for Text-Guided Image Editing
LocInv: Localization-aware Inversion for Text-Guided Image Editing
Chuanming Tang
Kai Wang
Fei Yang
J. Weijer
DiffM
39
3
0
02 May 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
  Generation
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Yupeng Zhou
Daquan Zhou
Ming-Ming Cheng
Jiashi Feng
Qibin Hou
DiffM
VGen
40
88
0
02 May 2024
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines
Ye Tian
Zhen Jia
Ziyue Luo
Yida Wang
Chuan Wu
AI4CE
23
2
0
02 May 2024
On Mechanistic Knowledge Localization in Text-to-Image Generative Models
On Mechanistic Knowledge Localization in Text-to-Image Generative Models
Samyadeep Basu
Keivan Rezaei
Priyatham Kattakinda
Ryan Rossi
Cherry Zhao
Vlad I. Morariu
Varun Manjunatha
S. Feizi
25
13
0
02 May 2024
Obtaining Favorable Layouts for Multiple Object Generation
Obtaining Favorable Layouts for Multiple Object Generation
Barak Battash
Amit Rozner
Lior Wolf
Ofir Lindenbaum
DiffM
48
2
0
01 May 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu
Yiming Hao
Manyuan Zhang
Keqiang Sun
Zhaoyang Huang
Guanglu Song
Yu Liu
Hongsheng Li
EGVM
76
16
0
01 May 2024
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion
  Generation
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation
Xujie Zhang
Ente Lin
Xiu Li
Yuxuan Luo
Michael C. Kampffmeyer
Xin Dong
Xiaodan Liang
51
10
0
01 May 2024
Synthetic Image Verification in the Era of Generative AI: What Works and
  What Isn't There Yet
Synthetic Image Verification in the Era of Generative AI: What Works and What Isn't There Yet
D. Tariang
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
48
8
0
30 Apr 2024
DOCCI: Descriptions of Connected and Contrasting Images
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe
Sunayana Rane
Zachary Berger
Yonatan Bitton
Jaemin Cho
...
Zarana Parekh
Jordi Pont-Tuset
Garrett Tanzer
Su Wang
Jason Baldridge
39
48
0
30 Apr 2024
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
Yunhao Ge
Xiaohui Zeng
Jacob Samuel Huffman
Tsung-Yi Lin
Ming-Yu Liu
Yin Cui
CoGe
DiffM
30
14
0
30 Apr 2024
GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting
GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting
Kai Zhang
Sai Bi
Hao Tan
Yuanbo Xiangli
Nanxuan Zhao
Kalyan Sunkavalli
Zexiang Xu
3DGS
34
123
0
30 Apr 2024
TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image
  Generation with Diffusion Models
TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models
Teng Zhou
Yongchuan Tang
DiffM
42
2
0
30 Apr 2024
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing
Minghao Chen
Iro Laina
Andrea Vedaldi
3DGS
42
23
0
29 Apr 2024
Stylus: Automatic Adapter Selection for Diffusion Models
Stylus: Automatic Adapter Selection for Diffusion Models
Michael Luo
Justin Wong
Brandon Trabucco
Yanping Huang
Joseph E. Gonzalez
Zhifeng Chen
Ruslan Salakhutdinov
Ion Stoica
DiffM
43
6
0
29 Apr 2024
TheaterGen: Character Management with LLM for Consistent Multi-turn
  Image Generation
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Junhao Cheng
Baiqiao Yin
Kaixin Cai
Minbin Huang
Hanhui Li
...
Yue Li
Yifei Li
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
MLLM
32
12
0
29 Apr 2024
G-Refine: A General Quality Refiner for Text-to-Image Generation
G-Refine: A General Quality Refiner for Text-to-Image Generation
Chunyi Li
Haoning Wu
Hongkun Hao
Zicheng Zhang
Tengchaun Kou
Chaofeng Chen
Lei Bai
Xiaohong Liu
Weisi Lin
Guangtao Zhai
27
4
0
29 Apr 2024
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation
Tianyidan Xie
Rui Ma
Qian Wang
Xiaoqian Ye
Feixuan Liu
Ying Tai
Zhenyu Zhang
Lanjun Wang
Zili Yi
DiffM
MLLM
47
2
0
29 Apr 2024
Seizing the Means of Production: Exploring the Landscape of Crafting,
  Adapting and Navigating Generative AI Models in the Visual Arts
Seizing the Means of Production: Exploring the Landscape of Crafting, Adapting and Navigating Generative AI Models in the Visual Arts
Ahmed M. Abuzuraiq
Philippe Pasquier
23
1
0
26 Apr 2024
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality
  Virtual Try-on in Videos
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos
Zhengze Xu
Mengting Chen
Zhao Wang
Linyu Xing
Zhonghua Zhai
Nong Sang
Jinsong Lan
Shuai Xiao
Changxin Gao
DiffM
38
11
0
26 Apr 2024
ReflectanceFusion: Diffusion-based text to SVBRDF Generation
ReflectanceFusion: Diffusion-based text to SVBRDF Generation
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
25
2
0
25 Apr 2024
Previous
123...222324...313233
Next