ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,616 papers shown
Title
Image Sculpting: Precise Object Editing with 3D Geometry Control
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
30
17
0
02 Jan 2024
Towards a Simultaneous and Granular Identity-Expression Control in
  Personalized Face Generation
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
Renshuai Liu
Bowen Ma
Wei Zhang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Xuan Cheng
DiffM
14
20
0
02 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation
  Models
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
41
20
0
01 Jan 2024
DiffMorph: Text-less Image Morphing with Diffusion Models
DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
DiffM
15
0
0
01 Jan 2024
Diffusion Model with Perceptual Loss
Diffusion Model with Perceptual Loss
Shanchuan Lin
Xiao Yang
DiffM
23
15
0
30 Dec 2023
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
Yuyang Yin
Dejia Xu
Zhangyang Wang
Yao-Min Zhao
Yunchao Wei
3DGS
47
72
0
28 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with
  Time-Decoupled Training and Reusable Coop-Diffusion
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
31
4
0
27 Dec 2023
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and
  Erasing Applications
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
Mengyao Lyu
Yuhong Yang
Haiwen Hong
Hui Chen
Xuan Jin
Yuan He
Hui Xue
Jungong Han
Guiguang Ding
DiffM
21
55
0
26 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for
  Subject-Driven Generation
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
27
58
0
26 Dec 2023
SAiD: Speech-driven Blendshape Facial Animation with Diffusion
SAiD: Speech-driven Blendshape Facial Animation with Diffusion
Inkyu Park
Jaewoong Cho
29
4
0
25 Dec 2023
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data
  Generation Framework using Foundational Models
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models
Gurusha Juneja
Sukrit Kumar
DiffM
6
0
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
27
0
0
23 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
18
237
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image
  Inpainting with Diffusion Models
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
33
28
0
21 Dec 2023
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion
  Models with RL Finetuning
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Desai Xie
Jiahao Li
Hao Tan
Xin Sun
Zhixin Shu
Yi Zhou
Sai Bi
Soren Pirk
Arie E. Kaufman
24
8
0
21 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in
  Text-to-Image Models
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
31
27
0
21 Dec 2023
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Xianfang Zeng
Xin Chen
Zhongqi Qi
Wen Liu
Zibo Zhao
Zhibin Wang
Bin-Bin Fu
Yong-jin Liu
Gang Yu
DiffM
13
66
0
21 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed
  Diffusion Models
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
32
112
0
21 Dec 2023
Generative Multimodal Models are In-Context Learners
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
45
245
0
20 Dec 2023
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
Weijia Mao
Yan-Pei Cao
Jia-Wei Liu
Zhongcong Xu
Mike Zheng Shou
DiffM
43
5
0
20 Dec 2023
RadEdit: stress-testing biomedical vision models via diffusion image
  editing
RadEdit: stress-testing biomedical vision models via diffusion image editing
Fernando Pérez-García
Sam Bond-Taylor
Pedro P. Sanchez
B. V. Breugel
Daniel Coelho De Castro
...
M. Lungren
A. Nori
Javier Alvarez-Valle
Ozan Oktay
Maximilian Ilse
MedIm
43
8
0
20 Dec 2023
Your Student is Better Than Expected: Adaptive Teacher-Student
  Collaboration for Text-Conditional Diffusion Models
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Nikita Starodubcev
Artem Fedorov
Artem Babenko
Dmitry Baranchuk
DiffM
45
3
0
17 Dec 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Mingsheng Li
Xin Chen
C. Zhang
Sijin Chen
Hongyuan Zhu
Fukun Yin
Gang Yu
Tao Chen
17
23
0
17 Dec 2023
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base
Zhiwei Zha
Jiaan Wang
Zhixu Li
Xiangru Zhu
Wei Song
Yanghua Xiao
VLM
29
2
0
16 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced
  AI-Assisted Cancer Diagnosis in Histopathology
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
25
8
0
15 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image
  Editing by Attention Modulation
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
18
30
0
15 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
20
16
0
14 Dec 2023
Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Thibaut Loiseau
Tuan-Hung Vu
Mickaël Chen
Patrick Pérez
Matthieu Cord
UQCV
23
12
0
14 Dec 2023
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
Pakkapon Phongthawee
Worameth Chinchuthakun
Nontaphat Sinsunthithet
Amit Raj
Varun Jampani
Pramook Khungurn
Supasorn Suwajanakorn
DiffM
19
23
0
14 Dec 2023
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and
  Multi-Source Supervision
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision
Shengguang Wu
Zhenglun Chen
Qi Su
DiffM
17
0
0
13 Dec 2023
FreeInit: Bridging Initialization Gap in Video Diffusion Models
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu
Chenyang Si
Yuming Jiang
Ziqi Huang
Ziwei Liu
DiffM
VGen
30
45
0
12 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image
  Diffusion Model with Any Condition
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
43
60
0
12 Dec 2023
EditGuard: Versatile Image Watermarking for Tamper Localization and
  Copyright Protection
EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
Xuanyu Zhang
Runyi Li
Jiwen Yu
You-song Xu
Weiqi Li
Jian Andrew Zhang
WIGM
37
44
0
12 Dec 2023
Boosting Latent Diffusion with Flow Matching
Boosting Latent Diffusion with Flow Matching
Johannes S. Fischer
Ming Gui
Pingchuan Ma
Nick Stracke
S. A. Baumann
Bjorn Ommer
22
20
0
12 Dec 2023
Learned representation-guided diffusion models for large-image
  generation
Learned representation-guided diffusion models for large-image generation
Alexandros Graikos
Srikar Yellapragada
Minh-Quan Le
S. Kapse
Prateek Prasanna
Joel H. Saltz
Dimitris Samaras
DiffM
27
26
0
12 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for
  Controlling Text-to-Image Diffusion Models
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
44
10
0
11 Dec 2023
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction
  Following
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
Shufan Li
Harkanwar Singh
Aditya Grover
DiffM
16
7
0
11 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized
  Text-to-Image Methods
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
18
8
0
11 Dec 2023
Characteristic Guidance: Non-linear Correction for Diffusion Model at
  Large Guidance Scale
Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance Scale
Candi Zheng
Yuan Lan
DiffM
23
4
0
11 Dec 2023
Efficient Quantization Strategies for Latent Diffusion Models
Efficient Quantization Strategies for Latent Diffusion Models
Yuewei Yang
Xiaoliang Dai
Jialiang Wang
Peizhao Zhang
Hongbo Zhang
DiffM
MQ
22
13
0
09 Dec 2023
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained
  Object Insertion and Layout Control
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe-nan Lin
Liang Zheng
DiffM
34
11
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in
  Arbitrary Images via Character-aware Diffusion Models
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
71
27
0
08 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
31
38
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
24
37
0
07 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
39
188
0
07 Dec 2023
Approximate Caching for Efficiently Serving Diffusion Models
Approximate Caching for Efficiently Serving Diffusion Models
Shubham Agarwal
Subrata Mitra
Sarthak Chakraborty
Srikrishna Karanam
Koyel Mukherjee
S. Saini
DiffM
25
4
0
07 Dec 2023
Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted
  Nearby Views
Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views
Yabo Chen
Jiemin Fang
Yuyang Huang
Taoran Yi
Xiaopeng Zhang
Lingxi Xie
Xinggang Wang
Wenrui Dai
Hongkai Xiong
Qi Tian
DiffM
27
20
0
07 Dec 2023
Merging by Matching Models in Task Parameter Subspaces
Merging by Matching Models in Task Parameter Subspaces
Derek Tam
Mohit Bansal
Colin Raffel
MoMe
19
10
0
07 Dec 2023
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image
  Diffusion Model for Interior Design
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design
Ruyi Gan
Xiaojun Wu
Junyu Lu
Yuanhe Tian
Di Zhang
...
Renliang Sun
Chang Liu
Jiaxing Zhang
Pingjian Zhang
Yan Song
62
4
0
07 Dec 2023
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion
  Models for Text-to-Image Synthesis
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
Youngwan Lee
Kwanyong Park
Yoorhim Cho
Yong-Ju Lee
Sung Ju Hwang
VLM
27
3
0
07 Dec 2023
Previous
123...282930313233
Next