Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,616 papers shown
Title
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Junhao Cheng
Xi Lu
Hanhui Li
Khun Loun Zai
Baiqiao Yin
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
VGen
37
10
0
03 Jun 2024
fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings
Tillmann Ohm
Andres Karjus
Mikhail Tamm
Maximilian Schich
36
1
0
03 Jun 2024
Dimba: Transformer-Mamba Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Youqiang Zhang
Junshi Huang
Mamba
62
16
0
03 Jun 2024
Improving Text Generation on Images with Synthetic Captions
Jun Young Koh
Sang Hyun Park
Joy Song
DiffM
51
2
0
01 Jun 2024
AudioLCM: Text-to-Audio Generation with Latent Consistency Models
Huadai Liu
Rongjie Huang
Yang Liu
Hengyuan Cao
Jialei Wang
Xize Cheng
Siqi Zheng
Zhou Zhao
68
8
0
01 Jun 2024
Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen
Jun Hao Liew
Long Mai
Lu Qi
Jiashi Feng
Jiaya Jia
DiffM
30
1
0
31 May 2024
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Zeyi Sun
Tong Wu
Pan Zhang
Yuhang Zang
Xiao-wen Dong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
34
0
0
31 May 2024
MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
Shurong Yang
Huadong Li
Juhao Wu
Minhao Jing
Linze Li
Renhe Ji
Jiajun Liang
Haoqiang Fan
DiffM
VGen
25
14
0
31 May 2024
Information Theoretic Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Massimo Gallo
Pietro Michiardi
72
0
0
31 May 2024
CoSy: Evaluating Textual Explanations of Neurons
Laura Kopf
P. Bommer
Anna Hedström
Sebastian Lapuschkin
Marina M.-C. Höhne
Kirill Bykov
44
7
0
30 May 2024
GECO: Generative Image-to-3D within a SECOnd
Chen Wang
Jiatao Gu
Xiaoxiao Long
Yuan Liu
Lingjie Liu
41
5
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
78
6
0
30 May 2024
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Massimo Bini
Karsten Roth
Zeynep Akata
Anna Khoreva
29
4
0
30 May 2024
MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models
Lukas Uzolas
E. Eisemann
Petr Kellnhofer
42
1
0
30 May 2024
Streaming Video Diffusion: Online Video Editing with Diffusion Models
Feng Chen
Zhen Yang
Bohan Zhuang
Qi Wu
DiffM
49
4
0
30 May 2024
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Longwen Zhang
Ziyu Wang
Qixuan Zhang
Qiwei Qiu
Anqi Pang
Haoran Jiang
Wei Yang
Lan Xu
Jingyi Yu
DiffM
AI4CE
VGen
26
116
0
30 May 2024
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Jia Li
Lijie Hu
Zhixian He
Jingfeng Zhang
Tianhang Zheng
Di Wang
DiffM
43
8
0
30 May 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen
Yan Hong
Zizheng Huang
Zhuoer Xu
Zhangxuan Gu
...
Jun Lan
Huijia Zhu
Jianfu Zhang
Weiqiang Wang
Huaxiong Li
Mamba
83
14
0
30 May 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
51
2
0
30 May 2024
Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies
Yipu Chen
Haotian Xue
Yongxin Chen
AAML
35
4
0
29 May 2024
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
Ruchika Chavhan
Da Li
Timothy M. Hospedales
41
15
0
29 May 2024
Patch-enhanced Mask Encoder Prompt Image Generation
Shusong Xu
Peiye Liu
DiffM
30
0
0
29 May 2024
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
Yuhan Li
Hao Zhou
Wenxiang Shang
Ran Lin
Xuanhong Chen
Bingbing Ni
DiffM
39
3
0
28 May 2024
EG4D: Explicit Generation of 4D Object without Score Distillation
Qi Sun
Zhiyang Guo
Ziyu Wan
Jing Nathan Yan
Shengming Yin
Wen-gang Zhou
Jing Liao
Houqiang Li
VGen
3DGS
32
13
0
28 May 2024
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
Tianchen Zhao
Xuefei Ning
Tongcheng Fang
En-hao Liu
Guyue Huang
Zinan Lin
Shengen Yan
Guohao Dai
Yu-Xiang Wang
MQ
DiffM
72
18
0
28 May 2024
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models
Hanjun Luo
Ziye Deng
Ruizhe Chen
Zuo-Qiang Liu
EGVM
40
9
0
28 May 2024
Fast Samplers for Inverse Problems in Iterative Refinement Models
Kushagra Pandey
Ruihan Yang
Stephan Mandt
DiffM
52
3
0
27 May 2024
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
JiaoJiao Fan
Haotian Xue
Qinsheng Zhang
Yongxin Chen
32
1
0
27 May 2024
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Litu Rout
Yujia Chen
Nataniel Ruiz
Abhishek Kumar
C. Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
32
23
0
27 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
65
75
0
27 May 2024
Does Diffusion Beat GAN in Image Super Resolution?
Denis Kuznedelev
Valerii Startsev
Daniil Shlenskii
Sergey Kastryulin
36
4
0
27 May 2024
From Obstacle to Opportunity: Enhancing Semi-supervised Learning with Synthetic Data
Zerun Wang
Jiafeng Mao
Liuyu Xiang
Toshihiko Yamasaki
32
0
0
27 May 2024
Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation
Liang Shi
Jie M. Zhang
Shiguang Shan
PICV
DiffM
48
1
0
27 May 2024
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
C. N. Vasconcelos
Abdullah Rashwan Austin Waters
Trevor Walker
Keyang Xu
Jimmy Yan
...
Wenlei Zhou
Kevin Swersky
David J. Fleet
Jason Baldridge
Oliver Wang
44
3
0
27 May 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
42
5
0
27 May 2024
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang
Jun Hao Liew
Hanshu Yan
Yuyang Yin
Yao Zhao
Yunchao Wei
Yunchao Wei
DiffM
90
6
0
27 May 2024
Diffusion Bridge AutoEncoders for Unsupervised Representation Learning
Yeongmin Kim
Kwanghyeon Lee
Minsang Park
Byeonghu Na
Il-Chul Moon
DiffM
44
2
0
27 May 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Brendan Park
Madeline Janecek
Naser Ezzati-Jivan
Yifeng Li
Ali Emami
37
0
0
25 May 2024
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu
Shiyu Li
Yuxuan Liu
Ping-Chia Huang
Jiulong Shan
Huimin Ma
Jian Yuan
37
4
0
24 May 2024
FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing
Kai Huang
Wei Gao
37
2
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
44
94
0
23 May 2024
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
Shuang Wu
Youtian Lin
Feihu Zhang
Yifei Zeng
Jingxi Xu
Philip H. S. Torr
Xun Cao
Yao Yao
31
47
0
23 May 2024
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Shengfang Zhai
Huanran Chen
Yinpeng Dong
Jiajun Li
Qingni Shen
Yansong Gao
Hang Su
Yang Liu
EGVM
59
9
0
23 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
39
9
0
23 May 2024
Learning Multi-dimensional Human Preference for Text-to-Image Generation
Sixian Zhang
Bohan Wang
Junqiang Wu
Yan Li
Tingting Gao
Di Zhang
Zhongyuan Wang
EGVM
48
29
0
23 May 2024
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Zhicheng Sun
Zhenhao Yang
Yang Jin
Haozhe Chi
Kun Xu
...
Hao Jiang
Di Zhang
Yang Song
Kun Gai
Yadong Mu
37
3
0
23 May 2024
Explaining Multi-modal Large Language Models by Analyzing their Vision Perception
Loris Giulivi
Giacomo Boracchi
36
2
0
23 May 2024
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
Seyedmorteza Sadat
Jakob Buhmann
Derek Bradley
Otmar Hilliges
Romann M. Weber
49
9
0
23 May 2024
PipeFusion: Displaced Patch Pipeline Parallelism for Inference of Diffusion Transformer Models
Jiannan Wang
Jiarui Fang
Aoyu Li
PengCheng Yang
AI4CE
62
3
0
23 May 2024
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Yao Teng
Yue Wu
Han Shi
Xuefei Ning
Guohao Dai
Yu-Xiang Wang
Zhenguo Li
Xihui Liu
Mamba
48
33
0
23 May 2024
Previous
1
2
3
...
21
22
23
...
31
32
33
Next