ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,616 papers shown
Title
Zero-Shot Distillation for Image Encoders: How to Make Effective Use of
  Synthetic Data
Zero-Shot Distillation for Image Encoders: How to Make Effective Use of Synthetic Data
Niclas Popp
J. H. Metzen
Matthias Hein
VLM
40
1
0
25 Apr 2024
MuseumMaker: Continual Style Customization without Catastrophic
  Forgetting
MuseumMaker: Continual Style Customization without Catastrophic Forgetting
Chenxi Liu
Gan Sun
Wenqi Liang
Jiahua Dong
Can Qin
Yang Cong
DiffM
50
3
0
25 Apr 2024
CoCoG: Controllable Visual Stimuli Generation based on Human Concept
  Representations
CoCoG: Controllable Visual Stimuli Generation based on Human Concept Representations
Chen Wei
Jiachen Zou
Dietmar Heinke
Quanying Liu
45
3
0
25 Apr 2024
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Olivia Wiles
Chuhan Zhang
Isabela Albuquerque
Ivana Kajić
Su Wang
...
Jordi Pont-Tuset
Aida Nematzadeh
Anant Nawalgaria
Jordi Pont-Tuset
Aida Nematzadeh
EGVM
125
13
0
25 Apr 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
41
58
0
24 Apr 2024
Unifying Bayesian Flow Networks and Diffusion Models through Stochastic
  Differential Equations
Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Kaiwen Xue
Yuhao Zhou
Shen Nie
Xu Min
Xiaolu Zhang
Jun Zhou
Chongxuan Li
DiffM
36
11
0
24 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with
  Reward Feedback Learning
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
41
12
0
23 Apr 2024
From Parts to Whole: A Unified Reference Framework for Controllable
  Human Image Generation
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Zehuan Huang
Hongxing Fan
Lipeng Wang
Lu Sheng
DiffM
39
10
0
23 Apr 2024
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation
  Method
CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method
Mingbao Lin
Zhihang Lin
Wengyi Zhan
Liujuan Cao
Rongrong Ji
40
7
0
23 Apr 2024
The Adversarial AI-Art: Understanding, Generation, Detection, and
  Benchmarking
The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking
Yuying Li
Zeyan Liu
Junyi Zhao
Liangqin Ren
Fengjun Li
Jiebo Luo
Bo Luo
32
1
0
22 Apr 2024
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Amirmojtaba Sabour
Sanja Fidler
Karsten Kreis
DiffM
32
24
0
22 Apr 2024
FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on
FLDM-VTON: Faithful Latent Diffusion Model for Virtual Try-on
Chenhui Wang
Tao Chen
Zhihao Chen
Zhizhong Huang
Taoran Jiang
Qi Wang
Hongming Shan
DiffM
31
5
0
22 Apr 2024
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Gorgeous: Create Your Desired Character Facial Makeup from Any Ideas
Jia Wei Sii
Chee Seng Chan
DiffM
48
0
0
22 Apr 2024
Towards Better Text-to-Image Generation Alignment via Attention
  Modulation
Towards Better Text-to-Image Generation Alignment via Attention Modulation
Yihang Wu
Xiao Cao
Kaixin Li
Zitan Chen
Haonan Wang
Lei Meng
Zhiyong Huang
DiffM
32
5
0
22 Apr 2024
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation
Yuying Ge
Sijie Zhao
Jinguo Zhu
Yixiao Ge
Kun Yi
Lin Song
Chen Li
Xiaohan Ding
Ying Shan
VLM
60
107
0
22 Apr 2024
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
RHanDS: Refining Malformed Hands for Generated Images with Decoupled Structure and Style Guidance
Chengrui Wang
Pengfei Liu
Min Zhou
Ming Zeng
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
44
4
0
22 Apr 2024
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image
  Synthesis
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis
Yuxi Ren
Xin Xia
Yanzuo Lu
Jiacheng Zhang
Jie Wu
Pan Xie
Xing Wang
Xuefeng Xiao
45
63
0
21 Apr 2024
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation
Haoyu Zheng
Wenqiao Zhang
Yaoke Wang
Hao Zhou
Jiang Liu
Juncheng Li
Zheqi Lv
Siliang Tang
Yueting Zhuang
Yueting Zhuang
42
1
0
21 Apr 2024
PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt
  Condition
PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition
Xi Fang
Weigang Wang
Xiaoxin Lv
Jun Yan
EGVM
42
3
0
20 Apr 2024
F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained
  Embeddings for Unpaired Frozen Section to FFPE Translation
F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation
M. M. Ho
Shikha Dubey
Yosep Chong
Beatrice S. Knudsen
Tolga Tasdizen
MedIm
AI4CE
32
2
0
19 Apr 2024
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I
  Diffusion Models
GenVideo: One-shot Target-image and Shape Aware Video Editing using T2I Diffusion Models
Sai Sree Harsha
Ambareesh Revanur
Dhwanit Agarwal
Shradha Agrawal
VGen
DiffM
40
3
0
18 Apr 2024
BLINK: Multimodal Large Language Models Can See but Not Perceive
BLINK: Multimodal Large Language Models Can See but Not Perceive
Xingyu Fu
Yushi Hu
Bangzheng Li
Yu Feng
Haoyu Wang
Xudong Lin
Dan Roth
Noah A. Smith
Wei-Chiu Ma
Ranjay Krishna
VLM
LRM
MLLM
41
110
0
18 Apr 2024
Lazy Diffusion Transformer for Interactive Image Editing
Lazy Diffusion Transformer for Interactive Image Editing
Yotam Nitzan
Zongze Wu
Richard Zhang
Eli Shechtman
Daniel Cohen-Or
Taesung Park
Michael Gharbi
40
8
0
18 Apr 2024
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Nupur Kumari
Grace Su
Richard Zhang
Taesung Park
Eli Shechtman
Jun-Yan Zhu
DiffM
44
3
0
18 Apr 2024
EdgeFusion: On-Device Text-to-Image Generation
EdgeFusion: On-Device Text-to-Image Generation
Thibault Castells
Hyoung-Kyu Song
Tairen Piao
Shinkook Choi
Bo-Kyeong Kim
Hanyoung Yim
Changgwun Lee
Jae Gon Kim
Tae-Ho Kim
VLM
29
6
0
18 Apr 2024
FreeDiff: Progressive Frequency Truncation for Image Editing with
  Diffusion Models
FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models
Wei Wu
Qingnan Fan
Shuai Qin
Hong Gu
Ruoyu Zhao
Antoni B. Chan
DiffM
27
2
0
18 Apr 2024
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from
  Diffusion Prior
InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior
Zhiheng Liu
Ouyang Hao
Qiuyu Wang
Ka Leong Cheng
Jie Xiao
Kai Zhu
Nan Xue
Yu Liu
Yujun Shen
Yang Cao
DiffM
3DGS
45
20
0
17 Apr 2024
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based
  Image Editing
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
Sherry X Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Kuo-Chin Lien
Misha Sra
Pradeep Sen
39
3
0
17 Apr 2024
LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models
LAPTOP-Diff: Layer Pruning and Normalized Distillation for Compressing Diffusion Models
Dingkun Zhang
Sijia Li
Chen Chen
Qingsong Xie
H. Lu
39
22
0
17 Apr 2024
RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting
RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting
Ashkan Mirzaei
Riccardo de Lutio
Seungryong Kim
David Acuna
J. Kelly
Sanja Fidler
Igor Gilitschenski
Zan Gojcic
DiffM
45
10
0
16 Apr 2024
LaDiC: Are Diffusion Models Really Inferior to Autoregressive
  Counterparts for Image-to-Text Generation?
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Yuchi Wang
Shuhuai Ren
Rundong Gao
Linli Yao
Qingyan Guo
Kaikai An
Jianhong Bai
Xu Sun
DiffM
VLM
36
6
0
16 Apr 2024
From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for Efficient Text-based Person Search
From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for Efficient Text-based Person Search
Jintao Sun
Zhedong Zheng
Gangyi Ding
Gangyi Ding
37
7
0
16 Apr 2024
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
  Controls to Any Diffusion Model
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
Han Lin
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
DiffM
VGen
69
20
0
15 Apr 2024
Photo-Realistic Image Restoration in the Wild with Controlled
  Vision-Language Models
Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models
Ziwei Luo
Fredrik K. Gustafsson
Zheng Zhao
Jens Sjölund
Thomas B. Schon
VLM
32
11
0
15 Apr 2024
TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection
  for Efficient Diffusion Models
TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models
Haojun Sun
Chen Tang
Zhi Wang
Yuan Meng
Jingyan Jiang
Xinzhu Ma
Wenwu Zhu
MQ
31
5
0
15 Apr 2024
Semantic Approach to Quantifying the Consistency of Diffusion Model
  Image Generation
Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation
Brinnae Bent
34
0
0
12 Apr 2024
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
Moreno DÍncà
E. Peruzzo
Massimiliano Mancini
Dejia Xu
Vidit Goel
Xingqian Xu
Zhangyang Wang
Humphrey Shi
N. Sebe
53
31
0
11 Apr 2024
Latent Guard: a Safety Framework for Text-to-image Generation
Latent Guard: a Safety Framework for Text-to-image Generation
Runtao Liu
Ashkan Khakzar
Jindong Gu
Qifeng Chen
Philip H. S. Torr
Fabio Pizzati
21
23
0
11 Apr 2024
Rethinking Artistic Copyright Infringements in the Era of Text-to-Image
  Generative Models
Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models
Mazda Moayeri
Samyadeep Basu
S. Balasubramanian
Priyatham Kattakinda
Atoosa Malemir Chegini
R. Brauneis
S. Feizi
WIGM
38
4
0
11 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
C. L. P. Chen
35
62
0
11 Apr 2024
Taming Stable Diffusion for Text to 360° Panorama Image Generation
Taming Stable Diffusion for Text to 360° Panorama Image Generation
Cheng Zhang
Qianyi Wu
Camilo Cruz Gambardella
Xiaoshui Huang
Dinh Q. Phung
Wanli Ouyang
Jianfei Cai
MDE
21
8
0
11 Apr 2024
Applying Guidance in a Limited Interval Improves Sample and Distribution
  Quality in Diffusion Models
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Tuomas Kynkaanniemi
M. Aittala
Tero Karras
S. Laine
Timo Aila
J. Lehtinen
19
57
0
11 Apr 2024
CAT: Contrastive Adapter Training for Personalized Image Generation
CAT: Contrastive Adapter Training for Personalized Image Generation
Jae Wan Park
Sang Hyun Park
Jun Young Koh
Junha Lee
Min Song
35
5
0
11 Apr 2024
UMBRAE: Unified Multimodal Brain Decoding
UMBRAE: Unified Multimodal Brain Decoding
Weihao Xia
Raoul de Charette
Cengiz Öztireli
Jing-Hao Xue
38
6
0
10 Apr 2024
ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR
  Domain Modeling
ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling
Ege Ozsoy
Chantal Pellegrini
Matthias Keicher
Nassir Navab
VLM
35
2
0
10 Apr 2024
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
Jaidev Shriram
Alex Trevithick
Lingjie Liu
Ravi Ramamoorthi
DiffM
3DGS
75
55
0
10 Apr 2024
ZeST: Zero-Shot Material Transfer from a Single Image
ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng
Prafull Sharma
Andrew Markham
Niki Trigoni
Varun Jampani
38
9
0
09 Apr 2024
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Kunpeng Song
Yizhe Zhu
Bingchen Liu
Qing Yan
A. Elgammal
Xiao Yang
DiffM
28
16
0
08 Apr 2024
YaART: Yet Another ART Rendering Technology
YaART: Yet Another ART Rendering Technology
Sergey Kastryulin
Artem Konev
Alexander Shishenya
Eugene Lyapustin
Artem Khurshudov
...
Dmitrii Kornilov
Mikhail Romanov
Artem Babenko
Sergei Ovcharenko
Valentin Khrulkov
EGVM
35
1
0
08 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
37
2
0
08 Apr 2024
Previous
123...232425...313233
Next