Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.01952
Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"
50 / 1,616 papers shown
Title
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Zijie Pan
Jiachen Lu
Xiatian Zhu
Li Zhang
DiffM
26
11
0
19 Oct 2023
Object-aware Inversion and Reassembly for Image Editing
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
22
14
0
18 Oct 2023
Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Qichao Wang
Tian Bian
Yian Yin
Tingyang Xu
Hong Cheng
Helen M. Meng
Zibin Zheng
Liang Chen
Bingzhe Wu
VLM
DiffM
25
3
0
18 Oct 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGen
EGVM
11
127
0
17 Oct 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
86
52
0
16 Oct 2023
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao
Zhijie Deng
DiffM
19
7
0
15 Oct 2023
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu
Jian Ren
Aliaksandr Siarohin
Ivan Skorokhodov
Yanyu Li
Dahua Lin
Xihui Liu
Ziwei Liu
Sergey Tulyakov
32
57
0
12 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation
Qiang Li
Dan Zhang
Shengzhao Lei
Xun Zhao
Porawit Kamnoedboon
WeiWei Li
Junhao Dong
Shuyan Li
VLM
22
1
0
12 Oct 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
K. Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
14
11
0
11 Oct 2023
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
Yin-Yin He
Shaoshu Yang
Haoxin Chen
Xiaodong Cun
Menghan Xia
Yong Zhang
Xintao Wang
Ran He
Qifeng Chen
Ying Shan
34
71
0
11 Oct 2023
Mitigating stereotypical biases in text to image generative systems
Piero Esposito
Parmida Atighehchian
Anastasis Germanidis
Deepti Ghadiyaram
25
16
0
10 Oct 2023
JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling
Jingyang Zhang
Shiwei Li
Yuanxun Lu
Tian Fang
David McKinnon
Yanghai Tsin
Long Quan
Yao Yao
18
10
0
10 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
22
1
0
09 Oct 2023
Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models
Jianglong Ye
Peng Wang
Kejie Li
Yichun Shi
Heng Wang
DiffM
27
72
0
04 Oct 2023
Predicated Diffusion: Predicate Logic-Based Attention Guidance for Text-to-Image Diffusion Models
Kota Sueyoshi
Takashi Matsubara
DiffM
8
8
0
03 Oct 2023
TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus
Rafael Ferreira
Diogo Tavares
Diogo Glória-Silva
Rodrigo Valerio
João Bordalo
Ines Simoes
Vasco Ramos
David Semedo
João Magalhães
12
4
0
03 Oct 2023
Prompt-tuning latent diffusion models for inverse problems
Hyungjin Chung
Jong Chul Ye
P. Milanfar
M. Delbracio
DiffM
22
40
0
02 Oct 2023
PixArt-
α
α
α
: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
28
385
0
30 Sep 2023
Leveraging Optimization for Adaptive Attacks on Image Watermarks
Nils Lukas
Abdulrahman Diaa
L. Fenaux
Florian Kerschbaum
WIGM
6
24
0
29 Sep 2023
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation
Vlad Hondru
Radu Tudor Ionescu
DiffM
32
1
0
29 Sep 2023
Text-to-3D using Gaussian Splatting
Manish Sharma
Moitreya Chatterjee
Yikai Wang
Huaping Liu
3DGS
20
223
0
28 Sep 2023
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
Xiaoliang Dai
Ji Hou
Chih-Yao Ma
Sam S. Tsai
Jialiang Wang
...
Roshan Sumbaly
Vignesh Ramanathan
Zijian He
Peter Vajda
Devi Parikh
VLM
17
198
0
27 Sep 2023
DECORAIT -- DECentralized Opt-in/out Registry for AI Training
Karthika Balan
Alexander Black
Simon Jenni
Andrew Gilbert
Andy Parsons
John Collomosse
16
7
0
25 Sep 2023
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Jiahao Xie
Wei Li
Xiangtai Li
Ziwei Liu
Yew-Soon Ong
Chen Change Loy
DiffM
VLM
60
35
0
22 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
34
168
0
20 Sep 2023
FreeU: Free Lunch in Diffusion U-Net
Chenyang Si
Ziqi Huang
Yuming Jiang
Ziwei Liu
DiffM
33
128
0
20 Sep 2023
On Copyright Risks of Text-to-Image Diffusion Models
Yang Zhang
Teoh Tze Tzun
Lim Wei Hern
Haonan Wang
Kenji Kawaguchi
42
9
0
15 Sep 2023
TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
Huayang Li
Siheng Li
Deng Cai
Longyue Wang
Lemao Liu
Taro Watanabe
Yujiu Yang
Shuming Shi
MLLM
47
17
0
14 Sep 2023
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Xingchao Liu
Xiwen Zhang
Jianzhu Ma
Jian Peng
Qiang Liu
91
193
0
12 Sep 2023
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
Zhi-Yi Chin
Chieh-Ming Jiang
Ching-Chun Huang
Pin-Yu Chen
Wei-Chen Chiu
DiffM
11
65
0
12 Sep 2023
NExT-GPT: Any-to-Any Multimodal LLM
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
46
449
0
11 Sep 2023
AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting
Veronika Shilova
Ludovic Dos Santos
Flavian Vasile
Gaetan Racic
Ugo Tanielian
DiffM
11
7
0
08 Sep 2023
Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang
Wenhao Chai
Jiayi Ye
Dapeng Tao
Yibing Zhan
Gaoang Wang
DiffM
18
15
0
07 Sep 2023
Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities
Shanyuan Liu
Dawei Leng
Yuhui Yin
DiffM
13
7
0
02 Sep 2023
MVDream: Multi-view Diffusion for 3D Generation
Yichun Shi
Peng Wang
Jianglong Ye
Mai Long
Kejie Li
X. Yang
20
588
0
31 Aug 2023
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
32
9
0
23 Aug 2023
DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization
Xiaoyu Ye
Hao Huang
Jiaqi An
Yongtao Wang
WIGM
24
22
0
19 Aug 2023
Dynamic Attention-Guided Diffusion for Image Super-Resolution
Brian B. Moser
Stanislav Frolov
Federico Raue
Sebastián M. Palacio
Andreas Dengel
DiffM
22
3
0
15 Aug 2023
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection
Cong Zhang
H. Qi
Shuhui Wang
Yuezun Li
Siwei Lyu
CVBM
20
6
0
03 Aug 2023
Revisiting DETR Pre-training for Object Detection
Yan Ma
Weicong Liang
Bo-Ying Chen
Yiduo Hao
Bojian Hou
Xiangyu Yue
Chao Zhang
Yuhui Yuan
VLM
ViT
25
4
0
02 Aug 2023
General Purpose Artificial Intelligence Systems (GPAIS): Properties, Definition, Taxonomy, Societal Implications and Responsible Governance
I. Triguero
Daniel Molina
Javier Poyatos
Javier Del Ser
Francisco Herrera
AI4TS
AI4MH
26
5
0
26 Jul 2023
Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke
Ruoshi Liu
Matthew Wallingford
Huong Ngo
Oscar Michel
...
Carl Vondrick
Georgia Gkioxari
Kiana Ehsani
Ludwig Schmidt
Ali Farhadi
20
379
0
11 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
18
781
0
10 Jul 2023
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
31
101
0
03 Jul 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffM
VLM
23
39
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
37
20
0
01 Jun 2023
Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models
Pablo Pernias
Dominic Rampas
Mats L. Richter
Christopher Pal
Marc Aubreville
DiffM
VLM
11
42
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffM
VLM
16
24
0
01 Jun 2023
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
Viraj Prabhu
Sriram Yenamandra
Prithvijit Chattopadhyay
Judy Hoffman
15
37
0
30 May 2023
Previous
1
2
3
...
31
32
33
Next