Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,616 papers shown
Title
Deceptive-Human: Prompt-to-NeRF 3D Human Generation with 3D-Consistent Synthetic Images
Shiu-hong Kao
Xinhang Liu
Yu-Wing Tai
Chi-Keung Tang
13
0
0
27 Nov 2023
SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion
Hsuan-I Ho
Jie Song
Otmar Hilliges
DiffM
11
31
0
27 Nov 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
28
12
0
27 Nov 2023
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
Minghui Hu
Jianbin Zheng
Chuanxia Zheng
Chaoyue Wang
Dacheng Tao
Tat-Jen Cham
DiffM
19
3
0
27 Nov 2023
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Xiaohan Ding
Yiyuan Zhang
Yixiao Ge
Sijie Zhao
Lin Song
Xiangyu Yue
Ying Shan
VLM
AI4TS
SSL
21
98
0
27 Nov 2023
HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View
D. Kothandaraman
Tianyi Zhou
Ming C. Lin
Dinesh Manocha
DiffM
21
2
0
27 Nov 2023
Flow-Guided Diffusion for Video Inpainting
Bohai Gu
Yongsheng Yu
Hengrui Fan
Libo Zhang
VGen
DiffM
28
12
0
26 Nov 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
A. Blattmann
Tim Dockhorn
Sumith Kulal
Daniel Mendelevitch
Maciej Kilian
...
Zion English
Vikram S. Voleti
Adam Letts
Varun Jampani
Robin Rombach
VGen
150
1,009
0
25 Nov 2023
Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision
Nicholas Lui
Bryan Chia
William Berrios
Candace Ross
Douwe Kiela
19
2
0
25 Nov 2023
GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
Yiwen Chen
Zilong Chen
Chi Zhang
Feng Wang
Xiaofeng Yang
Yikai Wang
Zhongang Cai
Lei Yang
Huaping Liu
Guosheng Lin
3DGS
108
184
0
24 Nov 2023
DemoFusion: Democratising High-Resolution Image Generation With No
Ruoyi Du
Dongliang Chang
Timothy M. Hospedales
Yi-Zhe Song
Zhanyu Ma
33
47
0
24 Nov 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Tingting Gao
Di Zhang
VLM
126
24
0
24 Nov 2023
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs
Viraj Shah
Nataniel Ruiz
Forrester Cole
Erika Lu
Svetlana Lazebnik
Yuanzhen Li
Varun Jampani
DiffM
25
100
0
22 Nov 2023
Diffusion Model Alignment Using Direct Preference Optimization
Bram Wallace
Meihua Dang
Rafael Rafailov
Linqi Zhou
Aaron Lou
Senthil Purushwalkam
Stefano Ermon
Caiming Xiong
Shafiq R. Joty
Nikhil Naik
EGVM
33
220
0
21 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGen
DiffM
23
23
0
21 Nov 2023
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis
Peiang Zhao
Han Li
Ruiyang Jin
S. Kevin Zhou
DiffM
36
12
0
21 Nov 2023
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance
Zuozhuo Dai
Zhenghao Zhang
Yao Yao
Bingxue Qiu
Siyu Zhu
Long Qin
Weizhi Wang
VGen
23
44
0
21 Nov 2023
Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Rohit Gandikota
Joanna Materzyñska
Tingrui Zhou
Antonio Torralba
David Bau
DiffM
35
61
0
20 Nov 2023
Pyramid Diffusion for Fine 3D Large Scene Generation
Yuheng Liu
Xinke Li
Xueting Li
Lu Qi
Chongshou Li
Ming-Hsuan Yang
47
15
0
20 Nov 2023
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen
Haibo Jin
Yixin Liu
Jinyin Chen
Haohan Wang
Lichao Sun
20
10
0
19 Nov 2023
Make Pixels Dance: High-Dynamic Video Generation
Yan Zeng
Guoqiang Wei
Jiani Zheng
Jiaxin Zou
Yang Wei
Yuchen Zhang
Hang Li
DiffM
VGen
19
90
0
18 Nov 2023
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar
Mannat Singh
Andrew Brown
Quentin Duval
S. Azadi
Sai Saketh Rambhatla
Akbar Shah
Xi Yin
Devi Parikh
Ishan Misra
DiffM
VGen
35
189
0
17 Nov 2023
Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression
Animesh Sinha
Bo Sun
Anmol Kalia
Arantxa Casanova
Elliot Blanchard
...
Ankit Ramchandani
Maziar Sanjabi
Sonal Gupta
Amy Bearman
Dhruv Mahajan
DiffM
28
4
0
17 Nov 2023
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Omri Avrahami
Amir Hertz
Yael Vinker
Moab Arar
Shlomi Fruchter
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
DiffM
42
32
0
16 Nov 2023
Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical Imaging Research
Bardia Khosravi
Frank Li
Theo Dapamede
Pouria Rouzrokh
Cooper Gamble
...
C. Wyles
Andrew B. Sellergren
S. Purkayastha
Bradley J. Erickson
J. Gichoya
MedIm
22
17
0
15 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
129
106
0
14 Nov 2023
FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and Design
Zhen Huang
Yihao Li
Dong Pei
Jiapeng Zhou
Xuliang Ning
Jianlin Han
Xiaoguang Han
Xuejun Chen
33
3
0
13 Nov 2023
Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Jiahao Li
Hao Tan
Kai Zhang
Zexiang Xu
Fujun Luan
Yinghao Xu
Yicong Hong
Kalyan Sunkavalli
Greg Shakhnarovich
Sai Bi
43
254
0
10 Nov 2023
Post-training Quantization for Text-to-Image Diffusion Models with Progressive Calibration and Activation Relaxing
Siao Tang
Xin Wang
Hong Chen
Chaoyu Guan
Zewen Wu
Yansong Tang
Wenwu Zhu
MQ
33
15
0
10 Nov 2023
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Simian Luo
Yiqin Tan
Suraj Patil
Daniel Gu
Patrick von Platen
Apolinário Passos
Longbo Huang
Jian Li
Hang Zhao
MoMe
108
144
0
09 Nov 2023
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
Jinjin Xu
Liwu Xu
Yuzhe Yang
Xiang Li
Fanyi Wang
Yanchun Xie
Yi-Jie Huang
Yaqian Li
MoE
MLLM
VLM
24
12
0
09 Nov 2023
Chain of Images for Intuitively Reasoning
Fanxu Meng
Haotong Yang
Yiding Wang
Muhan Zhang
LRM
28
6
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
18
12
0
07 Nov 2023
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Shiwei Zhang
Jiayu Wang
Yingya Zhang
Kang Zhao
Hangjie Yuan
Z. Qin
Xiang Wang
Deli Zhao
Jingren Zhou
DiffM
VGen
26
198
0
07 Nov 2023
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis
Hanrong Ye
Jason Kuen
Qing Liu
Zhe-nan Lin
Brian L. Price
Dan Xu
VLM
18
10
0
06 Nov 2023
Quantum circuit synthesis with diffusion models
Florian Fürrutter
Gorka Muñoz-Gil
H. Briegel
AI4CE
DiffM
17
20
0
03 Nov 2023
The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing
Shen Nie
Hanzhong Guo
Cheng Lu
Yuhao Zhou
Chenyu Zheng
Chongxuan Li
DiffM
19
37
0
02 Nov 2023
POS: A Prompts Optimization Suite for Augmenting Text-to-Video Generation
Shijie Ma
Huayi Xu
Mengjian Li
Weidong Geng
Yaxiong Wang
Meng Wang
DiffM
VGen
11
0
0
02 Nov 2023
De-Diffusion Makes Text a Strong Cross-Modal Interface
Chen Wei
Chenxi Liu
Siyuan Qiao
Zhishuai Zhang
Alan Yuille
Jiahui Yu
VLM
DiffM
29
10
0
01 Nov 2023
Consistent Video-to-Video Transfer Using Synthetic Dataset
Jiaxin Cheng
Tianjun Xiao
Tong He
VGen
DiffM
31
14
0
01 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
21
277
0
30 Oct 2023
Noise-Free Score Distillation
Oren Katzir
Or Patashnik
Daniel Cohen-Or
Dani Lischinski
DiffM
13
70
0
26 Oct 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLM
VLM
64
21
0
26 Oct 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
46
8
0
25 Oct 2023
Integrating View Conditions for Image Synthesis
Jinbin Bai
Zhen Dong
Aosong Feng
Xiao Zhang
Tian-Chun Ye
Kaicheng Zhou
59
12
0
24 Oct 2023
Matryoshka Diffusion Models
Jiatao Gu
Shuangfei Zhai
Yizhen Zhang
Joshua M. Susskind
Navdeep Jaitly
DiffM
8
43
0
23 Oct 2023
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Ruoxi Shi
Hansheng Chen
Zhuoyang Zhang
Minghua Liu
Chao Xu
Xinyue Wei
Linghao Chen
Chong Zeng
Hao Su
VLM
14
339
0
23 Oct 2023
Λ
Λ
Λ
-Split: A Privacy-Preserving Split Computing Framework for Cloud-Powered Generative AI
Shoki Ohta
Takayuki Nishio
62
4
0
23 Oct 2023
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan
Wenxin Ding
Josephine Passananti
Stanley Wu
Haitao Zheng
Ben Y. Zhao
SILM
DiffM
24
44
0
20 Oct 2023
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
Bangbang Yang
Wenqi Dong
Lin Ma
Wenbo Hu
Xiao Liu
Zhaopeng Cui
Yuewen Ma
DiffM
27
16
0
19 Oct 2023
Previous
1
2
3
...
30
31
32
33
Next