ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,616 papers shown
Title
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image
  Generation
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Junhao Cheng
Xi Lu
Hanhui Li
Khun Loun Zai
Baiqiao Yin
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
VGen
37
10
0
03 Jun 2024
fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity
  perception in image embeddings
fruit-SALAD: A Style Aligned Artwork Dataset to reveal similarity perception in image embeddings
Tillmann Ohm
Andres Karjus
Mikhail Tamm
Maximilian Schich
36
1
0
03 Jun 2024
Dimba: Transformer-Mamba Diffusion Models
Dimba: Transformer-Mamba Diffusion Models
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Youqiang Zhang
Junshi Huang
Mamba
62
16
0
03 Jun 2024
Improving Text Generation on Images with Synthetic Captions
Improving Text Generation on Images with Synthetic Captions
Jun Young Koh
Sang Hyun Park
Joy Song
DiffM
51
2
0
01 Jun 2024
AudioLCM: Text-to-Audio Generation with Latent Consistency Models
AudioLCM: Text-to-Audio Generation with Latent Consistency Models
Huadai Liu
Rongjie Huang
Yang Liu
Hengyuan Cao
Jialei Wang
Xize Cheng
Siqi Zheng
Zhou Zhao
68
8
0
01 Jun 2024
Empowering Visual Creativity: A Vision-Language Assistant to Image
  Editing Recommendations
Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Tiancheng Shen
Jun Hao Liew
Long Mai
Lu Qi
Jiashi Feng
Jiaya Jia
DiffM
30
1
0
31 May 2024
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Bootstrap3D: Improving 3D Content Creation with Synthetic Data
Zeyi Sun
Tong Wu
Pan Zhang
Yuhang Zang
Xiao-wen Dong
Yuanjun Xiong
Dahua Lin
Jiaqi Wang
34
0
0
31 May 2024
MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
Shurong Yang
Huadong Li
Juhao Wu
Minhao Jing
Linze Li
Renhe Ji
Jiajun Liang
Haoqiang Fan
DiffM
VGen
25
14
0
31 May 2024
Information Theoretic Text-to-Image Alignment
Information Theoretic Text-to-Image Alignment
Chao Wang
Giulio Franzese
A. Finamore
Massimo Gallo
Pietro Michiardi
72
0
0
31 May 2024
CoSy: Evaluating Textual Explanations of Neurons
CoSy: Evaluating Textual Explanations of Neurons
Laura Kopf
P. Bommer
Anna Hedström
Sebastian Lapuschkin
Marina M.-C. Höhne
Kirill Bykov
44
7
0
30 May 2024
GECO: Generative Image-to-3D within a SECOnd
GECO: Generative Image-to-3D within a SECOnd
Chen Wang
Jiatao Gu
Xiaoxiao Long
Yuan Liu
Lingjie Liu
41
5
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
78
6
0
30 May 2024
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane
  Reflections
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections
Massimo Bini
Karsten Roth
Zeynep Akata
Anna Khoreva
29
4
0
30 May 2024
MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models
MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models
Lukas Uzolas
E. Eisemann
Petr Kellnhofer
42
1
0
30 May 2024
Streaming Video Diffusion: Online Video Editing with Diffusion Models
Streaming Video Diffusion: Online Video Editing with Diffusion Models
Feng Chen
Zhen Yang
Bohan Zhuang
Qi Wu
DiffM
49
4
0
30 May 2024
CLAY: A Controllable Large-scale Generative Model for Creating
  High-quality 3D Assets
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets
Longwen Zhang
Ziyu Wang
Qixuan Zhang
Qiwei Qiu
Anqi Pang
Haoran Jiang
Wei Yang
Lan Xu
Jingyi Yu
DiffM
AI4CE
VGen
26
116
0
30 May 2024
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Jia Li
Lijie Hu
Zhixian He
Jingfeng Zhang
Tianhang Zheng
Di Wang
DiffM
43
8
0
30 May 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo
  Benchmark
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen
Yan Hong
Zizheng Huang
Zhuoer Xu
Zhangxuan Gu
...
Jun Lan
Huijia Zhu
Jianfu Zhang
Weiqiang Wang
Huaxiong Li
Mamba
83
14
0
30 May 2024
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Don't drop your samples! Coherence-aware training benefits Conditional diffusion
Nicolas Dufour
Victor Besnier
Vicky Kalogeiton
David Picard
DiffM
51
2
0
30 May 2024
Diffusion Policy Attacker: Crafting Adversarial Attacks for
  Diffusion-based Policies
Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies
Yipu Chen
Haotian Xue
Yongxin Chen
AAML
35
4
0
29 May 2024
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron
  Pruning
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning
Ruchika Chavhan
Da Li
Timothy M. Hospedales
41
15
0
29 May 2024
Patch-enhanced Mask Encoder Prompt Image Generation
Patch-enhanced Mask Encoder Prompt Image Generation
Shusong Xu
Peiye Liu
DiffM
30
0
0
29 May 2024
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across
  Any Scenario
AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario
Yuhan Li
Hao Zhou
Wenxiang Shang
Ran Lin
Xuanhong Chen
Bingbing Ni
DiffM
39
3
0
28 May 2024
EG4D: Explicit Generation of 4D Object without Score Distillation
EG4D: Explicit Generation of 4D Object without Score Distillation
Qi Sun
Zhiyang Guo
Ziyu Wan
Jing Nathan Yan
Shengming Yin
Wen-gang Zhou
Jing Liao
Houqiang Li
VGen
3DGS
32
13
0
28 May 2024
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with
  Metric-Decoupled Mixed Precision Quantization
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
Tianchen Zhao
Xuefei Ning
Tongcheng Fang
En-hao Liu
Guyue Huang
Zinan Lin
Shengen Yan
Guohao Dai
Yu-Xiang Wang
MQ
DiffM
72
18
0
28 May 2024
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in
  Text-to-Image Models
FAIntbench: A Holistic and Precise Benchmark for Bias Evaluation in Text-to-Image Models
Hanjun Luo
Ziye Deng
Ruizhe Chen
Zuo-Qiang Liu
EGVM
40
9
0
28 May 2024
Fast Samplers for Inverse Problems in Iterative Refinement Models
Fast Samplers for Inverse Problems in Iterative Refinement Models
Kushagra Pandey
Ruihan Yang
Stephan Mandt
DiffM
52
3
0
27 May 2024
RefDrop: Controllable Consistency in Image or Video Generation via
  Reference Feature Guidance
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
JiaoJiao Fan
Haotian Xue
Qinsheng Zhang
Yongxin Chen
32
1
0
27 May 2024
RB-Modulation: Training-Free Personalization of Diffusion Models using
  Stochastic Optimal Control
RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control
Litu Rout
Yujia Chen
Nataniel Ruiz
Abhishek Kumar
C. Caramanis
Sanjay Shakkottai
Wen-Sheng Chu
DiffM
32
23
0
27 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
65
75
0
27 May 2024
Does Diffusion Beat GAN in Image Super Resolution?
Does Diffusion Beat GAN in Image Super Resolution?
Denis Kuznedelev
Valerii Startsev
Daniil Shlenskii
Sergey Kastryulin
36
4
0
27 May 2024
From Obstacle to Opportunity: Enhancing Semi-supervised Learning with
  Synthetic Data
From Obstacle to Opportunity: Enhancing Semi-supervised Learning with Synthetic Data
Zerun Wang
Jiafeng Mao
Liuyu Xiang
Toshihiko Yamasaki
32
0
0
27 May 2024
Anonymization Prompt Learning for Facial Privacy-Preserving
  Text-to-Image Generation
Anonymization Prompt Learning for Facial Privacy-Preserving Text-to-Image Generation
Liang Shi
Jie M. Zhang
Shiguang Shan
PICV
DiffM
48
1
0
27 May 2024
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
C. N. Vasconcelos
Abdullah Rashwan Austin Waters
Trevor Walker
Keyang Xu
Jimmy Yan
...
Wenlei Zhou
Kevin Swersky
David J. Fleet
Jason Baldridge
Oliver Wang
44
3
0
27 May 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
42
5
0
27 May 2024
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
Jiannan Huang
Jun Hao Liew
Hanshu Yan
Yuyang Yin
Yao Zhao
Yunchao Wei
Yunchao Wei
DiffM
90
6
0
27 May 2024
Diffusion Bridge AutoEncoders for Unsupervised Representation Learning
Diffusion Bridge AutoEncoders for Unsupervised Representation Learning
Yeongmin Kim
Kwanghyeon Lee
Minsang Park
Byeonghu Na
Il-Chul Moon
DiffM
44
2
0
27 May 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Brendan Park
Madeline Janecek
Naser Ezzati-Jivan
Yifeng Li
Ali Emami
37
0
0
25 May 2024
ODGEN: Domain-specific Object Detection Data Generation with Diffusion
  Models
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
Jingyuan Zhu
Shiyu Li
Yuxuan Liu
Ping-Chia Huang
Jiulong Shan
Huimin Ma
Jian Yuan
37
4
0
24 May 2024
FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via
  Selective Tensor Freezing
FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing
Kai Huang
Wei Gao
37
2
0
24 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
44
94
0
23 May 2024
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion
  Transformer
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
Shuang Wu
Youtian Lin
Feihu Zhang
Yifei Zeng
Jingxi Xu
Philip H. S. Torr
Xun Cao
Yao Yao
31
47
0
23 May 2024
Membership Inference on Text-to-Image Diffusion Models via Conditional
  Likelihood Discrepancy
Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Shengfang Zhai
Huanran Chen
Yinpeng Dong
Jiajun Li
Qingni Shen
Yansong Gao
Hang Su
Yang Liu
EGVM
59
9
0
23 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image
  Editing
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
39
9
0
23 May 2024
Learning Multi-dimensional Human Preference for Text-to-Image Generation
Learning Multi-dimensional Human Preference for Text-to-Image Generation
Sixian Zhang
Bohan Wang
Junqiang Wu
Yan Li
Tingting Gao
Di Zhang
Zhongyuan Wang
EGVM
48
29
0
23 May 2024
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Zhicheng Sun
Zhenhao Yang
Yang Jin
Haozhe Chi
Kun Xu
...
Hao Jiang
Di Zhang
Yang Song
Kun Gai
Yadong Mu
37
3
0
23 May 2024
Explaining Multi-modal Large Language Models by Analyzing their Vision
  Perception
Explaining Multi-modal Large Language Models by Analyzing their Vision Perception
Loris Giulivi
Giacomo Boracchi
36
2
0
23 May 2024
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
Seyedmorteza Sadat
Jakob Buhmann
Derek Bradley
Otmar Hilliges
Romann M. Weber
49
9
0
23 May 2024
PipeFusion: Displaced Patch Pipeline Parallelism for Inference of
  Diffusion Transformer Models
PipeFusion: Displaced Patch Pipeline Parallelism for Inference of Diffusion Transformer Models
Jiannan Wang
Jiarui Fang
Aoyu Li
PengCheng Yang
AI4CE
62
3
0
23 May 2024
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
Yao Teng
Yue Wu
Han Shi
Xuefei Ning
Guohao Dai
Yu-Xiang Wang
Zhenguo Li
Xihui Liu
Mamba
48
33
0
23 May 2024
Previous
123...212223...313233
Next