Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,616 papers shown
Title
Matrix3D: Large Photogrammetry Model All-in-One
Yuanxun Lu
Jingyang Zhang
Tian Fang
Jean-Daniel Nahmias
Yanghai Tsin
Long Quan
Xun Cao
Yao Yao
Shiwei Li
114
4
0
11 Feb 2025
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
Weijia Mao
Z. Yang
Mike Zheng Shou
MoE
69
0
0
10 Feb 2025
Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation
Vera Soboleva
M. Nakhodnov
Aibek Alanov
47
0
0
09 Feb 2025
Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Xiao Li
Zekai Zhang
Xiang Li
Siyi Chen
Zhihui Zhu
Peng Wang
Qing Qu
DiffM
49
0
0
09 Feb 2025
Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets
Haoye Lu
Qifan Wu
Yaoliang Yu
DiffM
49
0
0
08 Feb 2025
Training-Free Constrained Generation With Stable Diffusion Models
Stefano Zampini
Jacob K Christopher
Luca Oneto
Davide Anguita
Ferdinando Fioretto
46
0
0
08 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
77
0
0
08 Feb 2025
Hummingbird: High Fidelity Image Generation via Multimodal Context Alignment
Minh-Quan Le
Gaurav Mittal
Tianjian Meng
A S M Iftekhar
Vishwas Suryanarayanan
Barun Patra
Dimitris Samaras
Mei Chen
DiffM
60
0
0
07 Feb 2025
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing
Jinya Sakurai
Issei Sato
74
0
0
06 Feb 2025
Recommendations Beyond Catalogs: Diffusion Models for Personalized Generation
Gabriel Patron
Zhiwei Xu
Ishan Kapnadak
Felipe Maia Polo
DiffM
38
0
0
05 Feb 2025
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach
Yunuo Chen
Junli Cao
Anil Kag
Vidit Goel
Sergei Korolev
Chenfanfu Jiang
Sergey Tulyakov
Jian Ren
DiffM
VGen
88
1
0
05 Feb 2025
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory Distillation
J. Li
Jiezhang Cao
Yong Guo
W. J. Li
Yulun Zhang
DiffM
73
0
0
04 Feb 2025
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
Rohit Gandikota
Zongze Wu
Richard Zhang
David Bau
Eli Shechtman
Nick Kolkin
DiffM
48
1
0
03 Feb 2025
HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment
Lifan Jiang
Boxi Wu
Jiahui Zhang
Xiaotong Guan
Shuang Chen
VGen
61
1
0
02 Feb 2025
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation
Chenguo Lin
Panwang Pan
Bangbang Yang
Zeming Li
Yadong Mu
3DGS
76
7
0
28 Jan 2025
An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control
Aosong Feng
Weikang Qiu
Jinbin Bai
Xiao Zhang
Zhen Dong
Kaicheng Zhou
Rex Ying
Leandros Tassiulas
DiffM
58
6
0
28 Jan 2025
Can Pose Transfer Models Generate Realistic Human Motion?
Vaclav Knapp
Matyas Bohacek
124
0
0
28 Jan 2025
Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
Yunbo Lyu
Zhou Yang
Yuqing Niu
Jing Jiang
David Lo
32
1
0
28 Jan 2025
CE-SDWV: Effective and Efficient Concept Erasure for Text-to-Image Diffusion Models via a Semantic-Driven Word Vocabulary
Jiahang Tu
Qian Feng
Chufan Chen
Jiahua Dong
Hanbin Zhao
Chao Zhang
Hui Qian
72
2
0
28 Jan 2025
Turn That Frown Upside Down: FaceID Customization via Cross-Training Data
Shuhe Wang
Xiaoya Li
Xiaofei Sun
G. Wang
Tianwei Zhang
Jiwei Li
Eduard H. Hovy
38
0
0
28 Jan 2025
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
Tim Broedermann
Christos Sakaridis
Yuqian Fu
Luc Van Gool
57
5
0
28 Jan 2025
Sparse High Rank Adapters
K. Bhardwaj
N. Pandey
Sweta Priyadarshi
Viswanath Ganapathy
Rafael Esteves
...
P. Whatmough
Risheek Garrepalli
M. V. Baalen
Harris Teague
Markus Nagel
MQ
38
4
0
28 Jan 2025
PAID: A Framework of Product-Centric Advertising Image Design
Hongyu Chen
Min Zhou
Jing Jiang
Jiale Chen
Yang Lu
Bo Xiao
T. Ge
Bo Zheng
DiffM
VLM
38
0
0
24 Jan 2025
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev
Adil Mehmood Khan
S. M. Ahsan Kazmi
DiffM
48
0
0
23 Jan 2025
PreciseCam: Precise Camera Control for Text-to-Image Generation
Edurne Bernal-Berdun
Ana Serrano
B. Masiá
Matheus Gadelha
Yannick Hold-Geoffroy
Xin Sun
Diego F. F. Gutierrez
DiffM
VGen
45
0
0
22 Jan 2025
Accelerate High-Quality Diffusion Models with Inner Loop Feedback
M. Gwilliam
Han Cai
Di Wu
Abhinav Shrivastava
Zhiyu Cheng
90
0
0
22 Jan 2025
Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement
Christoph Gebhardt
Robin Willardt
Seyedmorteza Sadat
Chih-Wei Ning
Andreas Brombach
Jie Song
Otmar Hilliges
Christian Holz
65
0
0
21 Jan 2025
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
Daniel Garibi
Shahar Yadin
Roni Paiss
Omer Tov
Shiran Zada
Ariel Ephrat
T. Michaeli
Inbar Mosseri
Tali Dekel
DiffM
103
2
0
21 Jan 2025
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Yiyang Wang
Xi Chen
Xiaogang Xu
S. Ji
Y. Liu
Yujun Shen
Hengshuang Zhao
DiffM
49
0
0
21 Jan 2025
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Zibo Zhao
Zeqiang Lai
Qingxiang Lin
Yunfei Zhao
Haolin Liu
...
Jingwei Huang
Chunchao Guo
Jie Jiang
Jingwei Huang
Chunchao Guo
111
21
0
21 Jan 2025
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance
Jin Zhu
Huimin Ma
Jiansheng Chen
Jian Yuan
71
4
0
20 Jan 2025
StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
Ruojun Xu
Weijie Xi
Xiaodi Wang
Yongbo Mao
Zach Cheng
DiffM
31
1
0
20 Jan 2025
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
Shristi Das Biswas
Matthew Shreve
Xuelu Li
Prateek Singhal
Kaushik Roy
DiffM
41
1
0
20 Jan 2025
Lossy Compression with Pretrained Diffusion Models
Jeremy Vonderfecht
Feng Liu
DiffM
97
1
0
20 Jan 2025
Know "No'' Better: A Data-Driven Approach for Enhancing Negation Awareness in CLIP
J. Park
Jungbeom Lee
Jongyoon Song
Sangwon Yu
Dahuin Jung
Sungroh Yoon
45
0
0
19 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
102
18
0
17 Jan 2025
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation
Hwan Heo
Jangyeong Kim
Seongyeong Lee
Jeong A Wi
Junyoung Choi
Sangjun Ahn
49
0
0
17 Jan 2025
A General Framework for Inference-time Scaling and Steering of Diffusion Models
R. Singhal
Zachary Horvitz
Ryan Teehan
Mengye Ren
Zhou Yu
Kathleen McKeown
Rajesh Ranganath
DiffM
61
15
0
17 Jan 2025
TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping
Despina Konstantinidou
C. Koutlis
Symeon Papadopoulos
70
2
0
17 Jan 2025
How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias
Tosin Fadahunsi
Giordano dÁloisio
A. Marco
Federica Sarro
DiffM
61
0
0
15 Jan 2025
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Dongwon Kim
Ju He
Qihang Yu
Chenglin Yang
Xiaohui Shen
Suha Kwak
Liang-Chieh Chen
VLM
46
6
0
13 Jan 2025
IP-FaceDiff: Identity-Preserving Facial Video Editing with Diffusion
Tharun Anand
Aryan Garg
Kaushik Mitra
VGen
DiffM
45
0
0
13 Jan 2025
Enhancing Image Generation Fidelity via Progressive Prompts
Zhen Xiong
Yuqi Li
Chuanguang Yang
Tiao Tan
Zhihong Zhu
Siyuan Li
Yue Ma
45
1
0
13 Jan 2025
Focus-N-Fix: Region-Aware Fine-Tuning for Text-to-Image Generation
Xiaoying Xing
Avinab Saha
Junfeng He
Susan Hao
Paul Vicol
...
Sahil Singla
Sarah Young
Yinxiao Li
Feng Yang
Deepak Ramachandran
DiffM
48
0
0
11 Jan 2025
VideoAuteur: Towards Long Narrative Video Generation
Junfei Xiao
Feng Cheng
Lu Qi
Liangke Gui
Jiepeng Cen
Zhibei Ma
Alan L. Yuille
Lu Jiang
VGen
56
2
0
10 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
Jiteng Mu
Nuno Vasconcelos
X. Wang
DiffM
38
4
0
08 Jan 2025
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Jiawei Liu
Yuanzhi Zhu
Feiyu Gao
Z. Yang
P. Wang
Junyang Lin
X. Wang
Wenyu Liu
DiffM
43
0
0
08 Jan 2025
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Dongmin Park
Sebin Kim
Taehong Moon
Minkyu Kim
Kangwook Lee
Jaewoong Cho
DiffM
CoGe
62
2
0
08 Jan 2025
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis
Thang-Anh-Quan Nguyen
Nathan Piasco
Luis Roldão
Moussâb Bennehar
D. Tsishkou
Laurent Caraffa
J. Tarel
R. Brémond
DiffM
47
1
0
06 Jan 2025
Nested Attention: Semantic-aware Attention Values for Concept Personalization
Or Patashnik
Rinon Gal
Daniil Ostashev
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
DiffM
35
5
0
03 Jan 2025
Previous
1
2
3
...
8
9
10
...
31
32
33
Next