Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,616 papers shown
Title
Diffusion Illusions: Hiding Images in Plain Sight
R. Burgert
Xiang Li
Abe Leite
Kanchana Ranasinghe
Michael S. Ryoo
48
17
0
06 Dec 2023
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting
Junhao Zhuang
Yanhong Zeng
Wenran Liu
Chun Yuan
Kai Chen
DiffM
39
69
0
06 Dec 2023
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Xuanchi Ren
Jiahui Huang
Xiaohui Zeng
Ken Museth
Sanja Fidler
Francis Williams
18
47
0
06 Dec 2023
FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models
Junhyuk So
Jungwon Lee
Eunhyeok Park
DiffM
26
9
0
06 Dec 2023
Kandinsky 3.0 Technical Report
V.Ya. Arkhipkin
Andrei Filatov
Viacheslav Vasilev
Anastasia Maltseva
Said Azizov
Igor Pavlov
Julia Agafonova
Andrey Kuznetsov
Denis Dimitrov
DiffM
25
10
0
06 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Daniel Cremers
Peter Vajda
Jialiang Wang
DiffM
30
57
0
06 Dec 2023
FERGI: Automatic Annotation of User Preferences for Text-to-Image Generation from Spontaneous Facial Expression Reaction
Shuangquan Feng
Junhua Ma
Virginia R. de Sa
EGVM
16
0
0
05 Dec 2023
DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie
Yang Zhao
Zhisheng Xiao
Kelvin C. K. Chan
Yandong Li
Yanwu Xu
Kun Zhang
Tingbo Hou
DiffM
25
26
0
05 Dec 2023
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Brian Gordon
Yonatan Bitton
Yonatan Shafir
Roopal Garg
Xi Chen
Dani Lischinski
Daniel Cohen-Or
Idan Szpektor
35
11
0
05 Dec 2023
LivePhoto: Real Image Animation with Text-guided Motion Control
Xi Chen
Zhiheng Liu
Mengting Chen
Yutong Feng
Yu Liu
Yujun Shen
Hengshuang Zhao
VGen
DiffM
34
28
0
05 Dec 2023
H-GAP: Humanoid Control with a Generalist Planner
Zhengyao Jiang
Yingchen Xu
Nolan Wagener
Yicheng Luo
Michael Janner
Edward Grefenstette
Tim Rocktaschel
Yuandong Tian
AI4CE
11
5
0
05 Dec 2023
A Contrastive Compositional Benchmark for Text-to-Image Synthesis: A Study with Unified Text-to-Image Fidelity Metrics
Xiangru Zhu
Penglei Sun
Chengyu Wang
Jingping Liu
Zhixu Li
Yanghua Xiao
Jun Huang
CoGe
100
5
0
04 Dec 2023
Readout Guidance: Learning Control from Diffusion Features
Grace Luo
Trevor Darrell
Oliver Wang
Dan B. Goldman
Aleksander Holynski
16
21
0
04 Dec 2023
Style Aligned Image Generation via Shared Attention
Amir Hertz
Andrey Voynov
Shlomi Fruchter
Daniel Cohen-Or
DiffM
16
122
0
04 Dec 2023
Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation
J. Niemeijer
Manuel Schwonberg
Jan-Aike Termöhlen
Nico M. Schmidt
Tim Fingscheidt
DiffM
26
15
0
04 Dec 2023
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model
L. Ran
Xiaodong Cun
Jia-Wei Liu
Rui Zhao
Song Zijie
Xintao Wang
Jussi Keppo
Mike Zheng Shou
18
11
0
04 Dec 2023
ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation
Peng Wang
Yichun Shi
16
165
0
02 Dec 2023
StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D
Pengsheng Guo
Hans Hao
Adam Caccavale
Zhongzheng Ren
Edward Zhang
Qi Shan
Aditya Sankar
A. Schwing
Alex Colburn
Fangchang Ma
DiffM
3DGS
38
7
0
02 Dec 2023
Raising the Bar of AI-generated Image Detection with CLIP
D. Cozzolino
Giovanni Poggi
Riccardo Corvi
Matthias Nießner
L. Verdoliva
VLM
19
72
0
30 Nov 2023
ART
⋅
\boldsymbol{\cdot}
⋅
V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Wenming Weng
Ruoyu Feng
Yanhui Wang
Qi Dai
Chunyu Wang
...
Jianmin Bao
Yuhui Yuan
Chong Luo
Yueyi Zhang
Zhiwei Xiong
VGen
25
32
0
30 Nov 2023
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Yanhui Wang
Jianmin Bao
Wenming Weng
Ruoyu Feng
Dacheng Yin
...
Yuhui Yuan
Chuanxin Tang
Xiaoyan Sun
Chong Luo
Baining Guo
DiffM
VGen
66
15
0
30 Nov 2023
One-step Diffusion with Distribution Matching Distillation
Tianwei Yin
Michael Gharbi
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
Taesung Park
DiffM
124
219
0
30 Nov 2023
IMMA: Immunizing text-to-image Models against Malicious Adaptation
Yijia Zheng
Raymond A. Yeh
30
8
0
30 Nov 2023
Diffusion Models Without Attention
Jing Nathan Yan
Jiatao Gu
Alexander M. Rush
22
60
0
30 Nov 2023
Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding
Wujian Peng
Sicheng Xie
Zuyao You
Shiyi Lan
Zuxuan Wu
VLM
CoGe
MLLM
21
17
0
30 Nov 2023
HiPA: Enabling One-Step Text-to-Image Diffusion Models via High-Frequency-Promoting Adaptation
Yifan Zhang
Bryan Hooi
VLM
27
9
0
30 Nov 2023
ElasticDiffusion: Training-free Arbitrary Size Image Generation through Global-Local Content Separation
Moayed Haji-Ali
Guha Balakrishnan
Vicente Ordonez
35
23
0
30 Nov 2023
GELDA: A generative language annotation framework to reveal visual biases in datasets
Krish Kabra
Kathleen M. Lewis
Guha Balakrishnan
VLM
11
1
0
29 Nov 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
62
346
0
29 Nov 2023
Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov
Amir Hertz
Moab Arar
Shlomi Fruchter
Daniel Cohen-Or
DiffM
17
4
0
29 Nov 2023
M
2
^{2}
2
Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation
Xiaowei Chi
Rongyu Zhang
Zhengkai Jiang
Yijiang Liu
Ziyi Lin
...
Chaoyou Fu
Peng Gao
Shanghang Zhang
Qi-fei Liu
Yi-Ting Guo
MLLM
33
1
0
29 Nov 2023
HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models
Shen Zhang
Zhaowei Chen
Zhenyu Zhao
Yuhao Chen
Yao Tang
Jiajun Liang
29
6
0
29 Nov 2023
MMA-Diffusion: MultiModal Attack on Diffusion Models
Yijun Yang
Ruiyuan Gao
Xiaosen Wang
Tsung-Yi Ho
Nan Xu
Qiang Xu
27
61
0
29 Nov 2023
HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
Wenquan Lu
Yufei Xu
Jing Zhang
Chaoyue Wang
Dacheng Tao
DiffM
56
25
0
29 Nov 2023
Rethinking Image Editing Detection in the Era of Generative AI Revolution
Zhihao Sun
Haipeng Fang
Xinying Zhao
Danding Wang
Juan Cao
22
8
0
29 Nov 2023
DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Jiao Sun
Deqing Fu
Yushi Hu
Su Wang
Royi Rassin
...
Dana Alon
Charles Herrmann
Sjoerd van Steenkiste
Ranjay Krishna
Cyrus Rashtchian
EGVM
23
39
0
29 Nov 2023
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models
Mohammad Mahdi Derakhshani
Menglin Xia
Harkirat Singh Behl
Cees G. M. Snoek
Victor Rühle
19
2
0
28 Nov 2023
Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now
Ayush Sarkar
Hanlin Mai
Amitabh Mahapatra
Svetlana Lazebnik
D. A. Forsyth
Anand Bhattad
GAN
27
34
0
28 Nov 2023
Adversarial Diffusion Distillation
Axel Sauer
Dominik Lorenz
A. Blattmann
Robin Rombach
138
329
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
21
43
0
28 Nov 2023
COLE: A Hierarchical Generation Framework for Multi-Layered and Editable Graphic Design
Peidong Jia
Chenxuan Li
Yuhui Yuan
Zeyu Liu
Yichao Shen
...
Dong Chen
Ji Li
Xiaodong Xie
Shanghang Zhang
Baining Guo
22
6
0
28 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
18
113
0
28 Nov 2023
Panacea: Panoramic and Controllable Video Generation for Autonomous Driving
Yuqing Wen
Yucheng Zhao
Yingfei Liu
Fan Jia
Yanhui Wang
Chong Luo
Chi Zhang
Tiancai Wang
Xiaoyan Sun
Xiangyu Zhang
70
57
0
28 Nov 2023
As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors
Seungwoo Yoo
Kunho Kim
Vladimir G. Kim
Minhyuk Sung
DiffM
21
13
0
28 Nov 2023
LEDITS++: Limitless Image Editing using Text-to-Image Models
Manuel Brack
Felix Friedrich
Katharina Kornmeier
Linoy Tsaban
P. Schramowski
Kristian Kersting
Apolinário Passos
DiffM
27
69
0
28 Nov 2023
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao
Yanwu Xu
Zhisheng Xiao
Haolin Jia
Tingbo Hou
VLM
39
11
0
28 Nov 2023
SEED-Bench-2: Benchmarking Multimodal Large Language Models
Bohao Li
Yuying Ge
Yixiao Ge
Guangzhi Wang
Rui Wang
Ruimao Zhang
Ying Shan
MLLM
VLM
23
67
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
17
60
0
28 Nov 2023
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
Jiancang Ma
Chen Chen
Qingsong Xie
H. Lu
DiffM
VLM
20
3
0
28 Nov 2023
CoSeR: Bridging Image and Language for Cognitive Super-Resolution
Haoze Sun
Wenbo Li
Jianzhuang Liu
Haoyu Chen
Renjing Pei
X. Zou
Youliang Yan
Yujiu Yang
SupR
33
45
0
27 Nov 2023
Previous
1
2
3
...
29
30
31
32
33
Next