Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,616 papers shown
Title
Image Sculpting: Precise Object Editing with 3D Geometry Control
Jiraphon Yenphraphai
Xichen Pan
Sainan Liu
Daniele Panozzo
Saining Xie
30
17
0
02 Jan 2024
Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
Renshuai Liu
Bowen Ma
Wei Zhang
Zhipeng Hu
Changjie Fan
Tangjie Lv
Yu-qiong Ding
Xuan Cheng
DiffM
14
20
0
02 Jan 2024
New Job, New Gender? Measuring the Social Bias in Image Generation Models
Wenxuan Wang
Haonan Bai
Jen-tse Huang
Yuxuan Wan
Youliang Yuan
Haoyi Qiu
Nanyun Peng
Michael R. Lyu
41
20
0
01 Jan 2024
DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
DiffM
15
0
0
01 Jan 2024
Diffusion Model with Perceptual Loss
Shanchuan Lin
Xiao Yang
DiffM
23
15
0
30 Dec 2023
4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency
Yuyang Yin
Dejia Xu
Zhangyang Wang
Yao-Min Zhao
Yunchao Wei
3DGS
47
72
0
28 Dec 2023
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion
Guansong Lu
Yuanfan Guo
Jianhua Han
Minzhe Niu
Yihan Zeng
Songcen Xu
Zeyi Huang
Zhao Zhong
Wei Zhang
Hang Xu
31
4
0
27 Dec 2023
One-Dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications
Mengyao Lyu
Yuhong Yang
Haiwen Hong
Hui Chen
Xuan Jin
Yuan He
Hui Xue
Jungong Han
Guiguang Ding
DiffM
21
55
0
26 Dec 2023
SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation
Yuxuan Zhang
Yiren Song
Jiaming Liu
Rui Wang
Jinpeng Yu
...
Huaxia Li
Xu Tang
Yao Hu
Han Pan
Zhongliang Jing
27
58
0
26 Dec 2023
SAiD: Speech-driven Blendshape Facial Animation with Diffusion
Inkyu Park
Jaewoong Cho
29
4
0
25 Dec 2023
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models
Gurusha Juneja
Sukrit Kumar
DiffM
6
0
0
23 Dec 2023
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
27
0
0
23 Dec 2023
VideoPoet: A Large Language Model for Zero-Shot Video Generation
Dan Kondratyuk
Lijun Yu
Xiuye Gu
José Lezama
Jonathan Huang
...
Irfan Essa
Huisheng Wang
David A. Ross
Bryan Seybold
Lu Jiang
VGen
18
237
0
21 Dec 2023
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
Hayk Manukyan
Andranik Sargsyan
Barsegh Atanyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
33
28
0
21 Dec 2023
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Desai Xie
Jiahao Li
Hao Tan
Xin Sun
Zhixin Shu
Yi Zhou
Sai Bi
Soren Pirk
Arie E. Kaufman
24
8
0
21 Dec 2023
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
Yiming Zhang
Zhening Xing
Yanhong Zeng
Youqing Fang
Kai Chen
VGen
31
27
0
21 Dec 2023
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
Xianfang Zeng
Xin Chen
Zhongqi Qi
Wen Liu
Zibo Zhao
Zhibin Wang
Bin-Bin Fu
Yong-jin Liu
Gang Yu
DiffM
13
66
0
21 Dec 2023
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models
Huan Ling
Seung Wook Kim
Antonio Torralba
Sanja Fidler
Karsten Kreis
DiffM
3DGS
32
112
0
21 Dec 2023
Generative Multimodal Models are In-Context Learners
Quan-Sen Sun
Yufeng Cui
Xiaosong Zhang
Fan Zhang
Qiying Yu
...
Yueze Wang
Yongming Rao
Jingjing Liu
Tiejun Huang
Xinlong Wang
MLLM
LRM
45
245
0
20 Dec 2023
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
Weijia Mao
Yan-Pei Cao
Jia-Wei Liu
Zhongcong Xu
Mike Zheng Shou
DiffM
43
5
0
20 Dec 2023
RadEdit: stress-testing biomedical vision models via diffusion image editing
Fernando Pérez-García
Sam Bond-Taylor
Pedro P. Sanchez
B. V. Breugel
Daniel Coelho De Castro
...
M. Lungren
A. Nori
Javier Alvarez-Valle
Ozan Oktay
Maximilian Ilse
MedIm
43
8
0
20 Dec 2023
Your Student is Better Than Expected: Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
Nikita Starodubcev
Artem Fedorov
Artem Babenko
Dmitry Baranchuk
DiffM
45
3
0
17 Dec 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Mingsheng Li
Xin Chen
C. Zhang
Sijin Chen
Hongyuan Zhu
Fukun Yin
Gang Yu
Tao Chen
17
23
0
17 Dec 2023
M^2ConceptBase: A Fine-Grained Aligned Concept-Centric Multimodal Knowledge Base
Zhiwei Zha
Jiaan Wang
Zhixu Li
Xiangru Zhu
Wei Song
Yanghua Xiao
VLM
29
2
0
16 Dec 2023
Latent Diffusion Models with Image-Derived Annotations for Enhanced AI-Assisted Cancer Diagnosis in Histopathology
Pedro Osório
Guillermo Jiménez-Pérez
Javier Montalt-Tordera
Jens Hooge
Guillem Duran Ballester
...
Sabrina Schroeder
K. Siudak
Julia Vienenkoetter
Bettina Lawrenz
Sadegh Mohammadi
MedIm
25
8
0
15 Dec 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
18
30
0
15 Dec 2023
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
Ruoxi Shi
Xinyue Wei
Cheng Wang
Hao Su
20
16
0
14 Dec 2023
Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Thibaut Loiseau
Tuan-Hung Vu
Mickaël Chen
Patrick Pérez
Matthieu Cord
UQCV
23
12
0
14 Dec 2023
DiffusionLight: Light Probes for Free by Painting a Chrome Ball
Pakkapon Phongthawee
Worameth Chinchuthakun
Nontaphat Sinsunthithet
Amit Raj
Varun Jampani
Pramook Khungurn
Supasorn Suwajanakorn
DiffM
19
23
0
14 Dec 2023
Knowledge-Aware Artifact Image Synthesis with LLM-Enhanced Prompting and Multi-Source Supervision
Shengguang Wu
Zhenglun Chen
Qi Su
DiffM
17
0
0
13 Dec 2023
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Tianxing Wu
Chenyang Si
Yuming Jiang
Ziqi Huang
Ziwei Liu
DiffM
VGen
30
45
0
12 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
43
60
0
12 Dec 2023
EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
Xuanyu Zhang
Runyi Li
Jiwen Yu
You-song Xu
Weiqi Li
Jian Andrew Zhang
WIGM
37
44
0
12 Dec 2023
Boosting Latent Diffusion with Flow Matching
Johannes S. Fischer
Ming Gui
Pingchuan Ma
Nick Stracke
S. A. Baumann
Bjorn Ommer
22
20
0
12 Dec 2023
Learned representation-guided diffusion models for large-image generation
Alexandros Graikos
Srikar Yellapragada
Minh-Quan Le
S. Kapse
Prateek Prasanna
Joel H. Saltz
Dimitris Samaras
DiffM
27
26
0
12 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
44
10
0
11 Dec 2023
InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following
Shufan Li
Harkanwar Singh
Aditya Grover
DiffM
16
7
0
11 Dec 2023
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
18
8
0
11 Dec 2023
Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance Scale
Candi Zheng
Yuan Lan
DiffM
23
4
0
11 Dec 2023
Efficient Quantization Strategies for Latent Diffusion Models
Yuewei Yang
Xiaoliang Dai
Jialiang Wang
Peizhao Zhang
Hongbo Zhang
DiffM
MQ
22
13
0
09 Dec 2023
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
Jaskirat Singh
Jianming Zhang
Qing Liu
Cameron Smith
Zhe-nan Lin
Liang Zheng
DiffM
34
11
0
08 Dec 2023
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
Yiming Zhao
Zhouhui Lian
71
27
0
08 Dec 2023
GenTron: Diffusion Transformers for Image and Video Generation
Shoufa Chen
Mengmeng Xu
Jiawei Ren
Yuren Cong
Sen He
Yanping Xie
Animesh Sinha
Ping Luo
Tao Xiang
Juan-Manuel Perez-Rua
VGen
31
38
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
24
37
0
07 Dec 2023
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
39
188
0
07 Dec 2023
Approximate Caching for Efficiently Serving Diffusion Models
Shubham Agarwal
Subrata Mitra
Sarthak Chakraborty
Srikrishna Karanam
Koyel Mukherjee
S. Saini
DiffM
25
4
0
07 Dec 2023
Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views
Yabo Chen
Jiemin Fang
Yuyang Huang
Taoran Yi
Xiaopeng Zhang
Lingxi Xie
Xinggang Wang
Wenrui Dai
Hongkai Xiong
Qi Tian
DiffM
27
20
0
07 Dec 2023
Merging by Matching Models in Task Parameter Subspaces
Derek Tam
Mohit Bansal
Colin Raffel
MoMe
19
10
0
07 Dec 2023
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design
Ruyi Gan
Xiaojun Wu
Junyu Lu
Yuanhe Tian
Di Zhang
...
Renliang Sun
Chang Liu
Jiaxing Zhang
Pingjian Zhang
Yan Song
62
4
0
07 Dec 2023
KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis
Youngwan Lee
Kwanyong Park
Yoorhim Cho
Yong-Ju Lee
Sung Ju Hwang
VLM
27
3
0
07 Dec 2023
Previous
1
2
3
...
28
29
30
31
32
33
Next