ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,616 papers shown
Title
Long-CLIP: Unlocking the Long-Text Capability of CLIP
Long-CLIP: Unlocking the Long-Text Capability of CLIP
Beichen Zhang
Pan Zhang
Xiao-wen Dong
Yuhang Zang
Jiaqi Wang
CLIP
VLM
34
108
0
22 Mar 2024
DreamFlow: High-Quality Text-to-3D Generation by Approximating
  Probability Flow
DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow
Kyungmin Lee
Kihyuk Sohn
Jinwoo Shin
40
19
0
22 Mar 2024
STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
Yifei Zeng
Yanqin Jiang
Siyu Zhu
Yuanxun Lu
Youtian Lin
Hao Zhu
Weiming Hu
Xun Cao
Yao Yao
3DGS
71
45
0
22 Mar 2024
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Alberto Baldrati
Davide Morelli
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
62
8
0
21 Mar 2024
ReNoise: Real Image Inversion Through Iterative Noising
ReNoise: Real Image Inversion Through Iterative Noising
Daniel Garibi
Or Patashnik
Andrey Voynov
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
34
52
0
21 Mar 2024
Implicit Style-Content Separation using B-LoRA
Implicit Style-Content Separation using B-LoRA
Yarden Frenkel
Yael Vinker
Ariel Shamir
Daniel Cohen-Or
MoMe
OffRL
50
40
0
21 Mar 2024
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified &
  Accurate Image Editing
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Yueru Jia
Yuhui Yuan
Aosong Cheng
Chuke Wang
Ji Li
Huizhu Jia
Shanghang Zhang
DiffM
31
7
0
21 Mar 2024
Open-Vocabulary Attention Maps with Token Optimization for Semantic
  Segmentation in Diffusion Models
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models
Pablo Marcos-Manchón
Roberto Alcover-Couso
Juan C. Sanmiguel
Jose M. Martínez
VLM
42
18
0
21 Mar 2024
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
ZoDi: Zero-Shot Domain Adaptation with Diffusion-Based Image Transfer
Hiroki Azuma
Yusuke Matsui
Atsuto Maki
VLM
34
1
0
20 Mar 2024
Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion
  Models with Noisy Data
Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data
Giannis Daras
Alexandros G. Dimakis
Constantinos Daskalakis
36
20
0
20 Mar 2024
ReGround: Improving Textual and Spatial Grounding at No Cost
ReGround: Improving Textual and Spatial Grounding at No Cost
Yuseung Lee
Minhyuk Sung
DiffM
26
2
0
20 Mar 2024
Deepfake Detection without Deepfakes: Generalization via Synthetic
  Frequency Patterns Injection
Deepfake Detection without Deepfakes: Generalization via Synthetic Frequency Patterns Injection
D. Coccomini
R. Caldelli
Claudio Gennaro
Giuseppe Fiameni
Giuseppe Amato
Fabrizio Falchi
29
1
0
20 Mar 2024
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in
  Text-to-Image Generation
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An
Yinghao Zhu
Zongjian Li
Haoran Feng
Bohua Chen
Yemin Shi
Chengwei Pan
32
2
0
20 Mar 2024
Building Optimal Neural Architectures using Interpretable Knowledge
Building Optimal Neural Architectures using Interpretable Knowledge
Keith G. Mills
Fred X. Han
Mohammad Salameh
Shengyao Lu
Chunhua Zhou
Jiao He
Fengyu Sun
Di Niu
18
1
0
20 Mar 2024
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
Yumeng Li
William H. Beluch
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
VGen
76
5
0
20 Mar 2024
Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence
  Alignment
Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment
Mengting Chen
Xi Chen
Zhonghua Zhai
Chen Ju
Xuewen Hong
Jinsong Lan
Shuai Xiao
OOD
DiffM
40
21
0
19 Mar 2024
FouriScale: A Frequency Perspective on Training-Free High-Resolution
  Image Synthesis
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Linjiang Huang
Rongyao Fang
Aiping Zhang
Guanglu Song
Si Liu
Yu Liu
Hongsheng Li
DiffM
25
22
0
19 Mar 2024
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by
  Self-Cooperative Diffusion GANs
You Only Sample Once: Taming One-Step Text-to-Image Synthesis by Self-Cooperative Diffusion GANs
Yihong Luo
Xiaolong Chen
Xinghua Qu
Jing Tang
53
6
0
19 Mar 2024
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
Jiajie Yang
32
0
0
19 Mar 2024
ContextVis: Envision Contextual Learning and Interaction with Generative
  Models
ContextVis: Envision Contextual Learning and Interaction with Generative Models
Bo Shui
Chufan Shi
Yujiu Yang
Xiaomei Nie
29
1
0
19 Mar 2024
LASPA: Latent Spatial Alignment for Fast Training-free Single Image
  Editing
LASPA: Latent Spatial Alignment for Fast Training-free Single Image Editing
Yazeed Alharbi
Peter Wonka
DiffM
30
0
0
19 Mar 2024
Can AI Outperform Human Experts in Creating Social Media Creatives?
Can AI Outperform Human Experts in Creating Social Media Creatives?
Eunkyung Park
Raymond K. Wong
Junbum Kwon
36
0
0
19 Mar 2024
Synthetic Image Generation in Cyber Influence Operations: An Emergent
  Threat?
Synthetic Image Generation in Cyber Influence Operations: An Emergent Threat?
Melanie Mathys
Marco Willi
Michael Graber
Raphael Meier
32
2
0
18 Mar 2024
Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and
  Detail
Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail
Mingjin Chen
Junhao Chen
Xiaojun Ye
Huan-ang Gao
Xiaoxue Chen
Zhaoxin Fan
Hao Zhao
3DH
29
11
0
18 Mar 2024
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D
  Generation
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation
Yushi Lan
Fangzhou Hong
Shuai Yang
Shangchen Zhou
Xuyi Meng
Bo Dai
Xingang Pan
Chen Change Loy
40
39
0
18 Mar 2024
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion
  Distillation
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation
Axel Sauer
Frederic Boesel
Tim Dockhorn
A. Blattmann
Patrick Esser
Robin Rombach
DiffM
31
107
0
18 Mar 2024
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot
  Video Editing
DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing
Hyeonho Jeong
Jinho Chang
Geon Yeong Park
Jong Chul Ye
DiffM
VGen
27
13
0
18 Mar 2024
Infinite-ID: Identity-preserved Personalization via ID-semantics
  Decoupling Paradigm
Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm
Yi Wu
Ziqiang Li
Heliang Zheng
Chaoyue Wang
Bin Li
DiffM
55
17
0
18 Mar 2024
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept
  Customization in Training-Free Diffusion Models
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
Yang Yang
Wen Wang
Liang Peng
Chaotian Song
Yao Chen
...
Xiaolong Yang
Qinglin Lu
Deng Cai
Boxi Wu
Wei Liu
MoMe
64
24
0
18 Mar 2024
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
EffiVED:Efficient Video Editing via Text-instruction Diffusion Models
Zhenghao Zhang
Zuozhuo Dai
Long Qin
Weizhi Wang
DiffM
VGen
34
2
0
18 Mar 2024
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
Paul S. Scotti
Mihir Tripathy
Cesar Kadir Torrico Villanueva
Reese Kneeland
Tong Chen
...
Charan Santhirasegaran
Jonathan Xu
Thomas Naselaris
Kenneth A. Norman
Tanishq Mathew Abraham
40
35
0
17 Mar 2024
3D Human Reconstruction in the Wild with Synthetic Data Using Generative
  Models
3D Human Reconstruction in the Wild with Synthetic Data Using Generative Models
Yongtao Ge
Wenjia Wang
Yongfan Chen
Hao Chen
Chunhua Shen
3DH
32
8
0
17 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with
  Diffusion Models
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
35
7
0
17 Mar 2024
Zippo: Zipping Color and Transparency Distributions into a Single
  Diffusion Model
Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model
Kangyang Xie
Binbin Yang
Hao Chen
Meng Wang
Cheng Zou
Hui Xue
Ming Yang
Chunhua Shen
DiffM
30
1
0
17 Mar 2024
Just Say the Name: Online Continual Learning with Category Names Only
  via Data Generation
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation
Minhyuk Seo
Diganta Misra
Seongwon Cho
Minjae Lee
Jonghyun Choi
CLL
33
7
0
16 Mar 2024
Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving
  Conditional Human Image Generation
Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation
Anton Pelykh
Ozge Mercanoglu
Richard Bowden
DiffM
31
5
0
15 Mar 2024
LightIt: Illumination Modeling and Control for Diffusion Models
LightIt: Illumination Modeling and Control for Diffusion Models
Peter Kocsis
Julien Philip
Kalyan Sunkavalli
Matthias Nießner
Yannick Hold-Geoffroy
29
21
0
15 Mar 2024
Generalized Predictive Model for Autonomous Driving
Generalized Predictive Model for Autonomous Driving
Jiazhi Yang
Shenyuan Gao
Yihang Qiu
Li Chen
Tianyu Li
...
Ping Luo
Jun Zhang
Andreas Geiger
Yu Qiao
Hongyang Li
VGen
71
57
0
14 Mar 2024
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Zeyu Liu
Weicong Liang
Zhanhao Liang
Chong Luo
Ji Li
Gao Huang
Yuhui Yuan
DiffM
64
25
0
14 Mar 2024
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based
  Real Image Editing
Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing
Wonjun Kang
Kevin Galim
Hyung Il Koo
DiffM
31
5
0
14 Mar 2024
Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion
  Models for Sparse-view CT Reconstruction
Mitigating Data Consistency Induced Discrepancy in Cascaded Diffusion Models for Sparse-view CT Reconstruction
Hanyu Chen
Zhixiu Hao
Lin Guo
Liying Xiao
39
1
0
14 Mar 2024
Rethinking Referring Object Removal
Rethinking Referring Object Removal
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
32
0
0
14 Mar 2024
The First to Know: How Token Distributions Reveal Hidden Knowledge in
  Large Vision-Language Models?
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
29
7
0
14 Mar 2024
Explore In-Context Segmentation via Latent Diffusion Models
Explore In-Context Segmentation via Latent Diffusion Models
Chaoyang Wang
Xiangtai Li
Henghui Ding
Lu Qi
Jiangning Zhang
Yunhai Tong
Chen Change Loy
Shuicheng Yan
DiffM
63
6
0
14 Mar 2024
Tackling the Singularities at the Endpoints of Time Intervals in
  Diffusion Models
Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
Pengze Zhang
Hubery Yin
Chen Li
Xiaohua Xie
35
5
0
13 Mar 2024
Visual Decoding and Reconstruction via EEG Embeddings with Guided
  Diffusion
Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion
Dongyang Li
Chen Wei
Shiying Li
Jiachen Zou
Quanying Liu
DiffM
29
18
0
12 Mar 2024
Block-wise LoRA: Revisiting Fine-grained LoRA for Effective
  Personalization and Stylization in Text-to-Image Generation
Block-wise LoRA: Revisiting Fine-grained LoRA for Effective Personalization and Stylization in Text-to-Image Generation
Likun Li
Haoqi Zeng
Changpeng Yang
Haozhe Jia
Di Xu
DiffM
32
4
0
12 Mar 2024
AesopAgent: Agent-driven Evolutionary System on Story-to-Video
  Production
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang
Zehua Du
Yuyuan Zhao
Bo Yuan
Kexiang Wang
...
Yihen Lu
Gengliang Li
Junlong Gao
Xin Tu
Zhenyu Guo
LLMAG
VGen
36
7
0
12 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
31
12
0
12 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with
  Auto-Generated Data
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMe
DiffM
39
8
0
11 Mar 2024
Previous
123...252627...313233
Next