ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2307.01952
  4. Cited By
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

4 July 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
ArXivPDFHTML

Papers citing "SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis"

50 / 1,616 papers shown
Title
FSViewFusion: Few-Shots View Generation of Novel Objects
FSViewFusion: Few-Shots View Generation of Novel Objects
Rukhshanda Hussain
Hui Xian Grace Lim
Borchun Chen
Mubarak Shah
Ser Nam Lim
DiffM
33
0
0
11 Mar 2024
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level
  Annotation
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Xiaobin Hu
Xu Peng
Donghao Luo
Xiaozhong Ji
Jinlong Peng
Zhengkai Jiang
Jiangning Zhang
Taisong Jin
Chengjie Wang
Rongrong Ji
DiffM
24
4
0
10 Mar 2024
VideoElevator: Elevating Video Generation Quality with Versatile
  Text-to-Image Diffusion Models
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Yabo Zhang
Yuxiang Wei
Xianhui Lin
Zheng Hui
Peiran Ren
Xuansong Xie
Xiangyang Ji
Wangmeng Zuo
VGen
38
6
0
08 Mar 2024
Towards Effective Usage of Human-Centric Priors in Diffusion Models for
  Text-based Human Image Generation
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Junyan Wang
Zhenhong Sun
Zhiyu Tan
Xuanbai Chen
Weihua Chen
Hao Li
Cheng Zhang
Yang Song
35
9
0
08 Mar 2024
Improving Diffusion Models for Virtual Try-on
Improving Diffusion Models for Virtual Try-on
Yisol Choi
Sangkyung Kwak
Kyungmin Lee
Hyungwon Choi
Jinwoo Shin
DiffM
30
21
0
08 Mar 2024
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Xiwei Hu
Rui Wang
Yixiao Fang
Bin-Bin Fu
Pei Cheng
Gang Yu
VLM
57
39
0
08 Mar 2024
Evaluating Text-to-Image Generative Models: An Empirical Study on Human
  Image Synthesis
Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis
Mu-Hwa Chen
Yi Liu
Jian Yi
Changran Xu
Qiuxia Lai
Hongliang Wang
Tsung-Yi Ho
Qiang Xu
EGVM
27
7
0
08 Mar 2024
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion
Wendi Zheng
Jiayan Teng
Zhuoyi Yang
Weihan Wang
Jidong Chen
Xiaotao Gu
Yuxiao Dong
Ming Ding
Jie Tang
DiffM
19
34
0
08 Mar 2024
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K
  Text-to-Image Generation
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Junsong Chen
Chongjian Ge
Enze Xie
Yue Wu
Lewei Yao
Xiaozhe Ren
Zhongdao Wang
Ping Luo
Huchuan Lu
Zhenguo Li
128
86
0
07 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
67
35
0
07 Mar 2024
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on
  Noise Cropping and Merging
NoiseCollage: A Layout-Aware Text-to-Image Diffusion Model Based on Noise Cropping and Merging
Takahiro Shirakawa
Seiichi Uchida
DiffM
35
15
0
06 Mar 2024
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Patrick Esser
Sumith Kulal
A. Blattmann
Rahim Entezari
Jonas Muller
...
Zion English
Kyle Lacey
Alex Goodwin
Yannik Marek
Robin Rombach
DiffM
86
1,058
0
05 Mar 2024
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal
  Datasets
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
Hossein Aboutalebi
Hwanjun Song
Yusheng Xie
Arshit Gupta
Justin Sun
Hang Su
Igor Shalyminov
Nikolaos Pappas
Siffi Singh
Saab Mansour
DiffM
EGVM
46
4
0
05 Mar 2024
Behavior Generation with Latent Actions
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGen
OffRL
27
64
0
05 Mar 2024
Tuning-Free Noise Rectification for High Fidelity Image-to-Video
  Generation
Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation
Weijie Li
Litong Gong
Yiran Zhu
Fanda Fan
Biao Wang
Tiezheng Ge
Bo Zheng
VGen
DiffM
33
2
0
05 Mar 2024
Position: Towards Implicit Prompt For Text-To-Image Models
Position: Towards Implicit Prompt For Text-To-Image Models
Yue Yang
Yuqi Lin
Hong Liu
Wenqi Shao
Runjian Chen
Hailong Shang
Yu Wang
Yu Qiao
Kaipeng Zhang
Ping Luo
EGVM
VLM
33
2
0
04 Mar 2024
AtomoVideo: High Fidelity Image-to-Video Generation
AtomoVideo: High Fidelity Image-to-Video Generation
Litong Gong
Yiran Zhu
Weijie Li
Xiaoyang Kang
Biao Wang
Tiezheng Ge
Bo Zheng
DiffM
VGen
122
12
0
04 Mar 2024
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable
  Virtual Try-on
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Yuhao Xu
Tao Gu
Weifeng Chen
Chengcai Chen
DiffM
27
49
0
04 Mar 2024
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
Jiaxiang Cheng
Pan Xie
Xin Xia
Jiashi Li
Jie Wu
Yuxi Ren
Huixia Li
Xuefeng Xiao
Min Zheng
Lean Fu
33
12
0
04 Mar 2024
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
Yijun Yang
Ruiyuan Gao
Xiao Yang
Jianyuan Zhong
Qiang Xu
30
15
0
03 Mar 2024
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation
Hongjian Liu
Qingsong Xie
Zhijie Deng
Chen Chen
Shixiang Tang
Fueyang Fu
Zheng-Jun Zha
H. Lu
Zheng-jun Zha
41
6
0
03 Mar 2024
DistriFusion: Distributed Parallel Inference for High-Resolution
  Diffusion Models
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Muyang Li
Tianle Cai
Jiaxin Cao
Qinsheng Zhang
Han Cai
Junjie Bai
Yangqing Jia
Ming-Yu Liu
Kai Li
Song Han
DiffM
29
41
0
29 Feb 2024
Trajectory Consistency Distillation: Improved Latent Consistency
  Distillation by Semi-Linear Consistency Function with Trajectory Mapping
Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping
Jianbin Zheng
Minghui Hu
Zhongyi Fan
Chaoyue Wang
Changxing Ding
Dacheng Tao
Tat-Jen Cham
35
26
0
29 Feb 2024
From Summary to Action: Enhancing Large Language Models for Complex
  Tasks with Open World APIs
From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs
Yulong Liu
Yunlong Yuan
Chunwei Wang
Jianhua Han
Yongqiang Ma
Li Zhang
Nanning Zheng
Hang Xu
LLMAG
24
5
0
28 Feb 2024
Chaining text-to-image and large language model: A novel approach for
  generating personalized e-commerce banners
Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners
Shanu Vashishtha
Abhinav Prakash
Lalitesh Morishetti
Kaushiki Nag
Yokila Arora
Sushant Kumar
Kannan Achan
DiffM
16
4
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized
  Diffusion Models
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
Shyam Marjit
Harshit Singh
Nityanand Mathur
Sayak Paul
Chia-Mu Yu
Pin-Yu Chen
DiffM
36
2
0
27 Feb 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities
  of Large Vision Models
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Yixin Liu
Kai Zhang
Yuan Li
Zhiling Yan
Chujie Gao
...
Yue Huang
Hanchi Sun
Jianfeng Gao
Lifang He
Lichao Sun
VLM
VGen
EGVM
68
257
0
27 Feb 2024
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence
  Generation
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation
Zongying Lin
Hao Li
Liuzhenghao Lv
Lin Bin
Junwu Zhang
Calvin Yu-Chian Chwn
Li Yuan
Tian Yonghong
24
3
0
27 Feb 2024
Transparent Image Layer Diffusion using Latent Transparency
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
29
41
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
85
0
27 Feb 2024
Contextualized Diffusion Models for Text-Guided Image and Video
  Generation
Contextualized Diffusion Models for Text-Guided Image and Video Generation
Ling Yang
Zhilong Zhang
Zhaochen Yu
Jingwei Liu
Minkai Xu
Stefano Ermon
Bin Cui
36
4
0
26 Feb 2024
Referee Can Play: An Alternative Approach to Conditional Generation via
  Model Inversion
Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion
Xuantong Liu
Tianyang Hu
Wenjia Wang
Kenji Kawaguchi
Yuan Yao
DiffM
70
3
0
26 Feb 2024
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept
  Composition
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
Chun-Hsiao Yeh
Ta-Ying Cheng
He-Yen Hsieh
Chuan-En Lin
Yi Ma
Andrew Markham
Niki Trigoni
H. T. Kung
Yubei Chen
DiffM
25
3
0
23 Feb 2024
Generative Models are Self-Watermarked: Declaring Model Authentication
  through Re-Generation
Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation
Aditya Desu
Xuanli He
Qiongkai Xu
Wei Lu
WIGM
24
1
0
23 Feb 2024
Visual Hallucinations of Multi-modal Large Language Models
Visual Hallucinations of Multi-modal Large Language Models
Wen Huang
Hongbin Liu
Minxin Guo
Neil Zhenqiang Gong
MLLM
VLM
32
24
0
22 Feb 2024
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with
  Trajectory Stitching
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching
Zizheng Pan
Bohan Zhuang
De-An Huang
Weili Nie
Zhiding Yu
Chaowei Xiao
Jianfei Cai
A. Anandkumar
28
17
0
21 Feb 2024
SDXL-Lightning: Progressive Adversarial Diffusion Distillation
SDXL-Lightning: Progressive Adversarial Diffusion Distillation
Shanchuan Lin
Anran Wang
Xiao Yang
29
116
0
21 Feb 2024
A Unified Framework and Dataset for Assessing Societal Bias in
  Vision-Language Models
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models
Ashutosh Sathe
Prachi Jain
Sunayana Sitaram
50
1
0
21 Feb 2024
Visual Style Prompting with Swapping Self-Attention
Visual Style Prompting with Swapping Self-Attention
Jaeseok Jeong
Junho Kim
Yunjey Choi
Gayoung Lee
Youngjung Uh
DiffM
40
39
0
20 Feb 2024
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image
  Diffusion Models
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
Xinchen Zhang
Ling Yang
Yaqi Cai
Zhaochen Yu
Kai-Ni Wang
...
Ye Tian
Minkai Xu
Yong Tang
Yujiu Yang
Bin Cui
DiffM
27
5
0
20 Feb 2024
SDXL Finetuned with LoRA for Coloring Therapy: Generating Graphic
  Templates Inspired by United Arab Emirates Culture
SDXL Finetuned with LoRA for Coloring Therapy: Generating Graphic Templates Inspired by United Arab Emirates Culture
Abdulla Alfalasi
Esrat Khan
Mohamed Alhashmi
Raed Aldweik
Davor Svetinovic
19
0
0
20 Feb 2024
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object
  Diffusion
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion
Sen Li
Ruochen Wang
Cho-Jui Hsieh
Minhao Cheng
Tianyi Zhou
MLLM
LM&Ro
40
3
0
20 Feb 2024
From Cloud to Edge: Rethinking Generative AI for Low-Resource Design
  Challenges
From Cloud to Edge: Rethinking Generative AI for Low-Resource Design Challenges
Sai Krishna Revanth Vuruma
Ashley Margetts
Jianhai Su
Faez Ahmed
Biplav Srivastava
25
5
0
20 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRM
VLM
49
41
0
19 Feb 2024
Universal Prompt Optimizer for Safe Text-to-Image Generation
Universal Prompt Optimizer for Safe Text-to-Image Generation
Zongyu Wu
Hongcheng Gao
Yueze Wang
Xiang Zhang
Suhang Wang
EGVM
10
9
0
16 Feb 2024
MRPD: Undersampled MRI reconstruction by prompting a large latent
  diffusion model
MRPD: Undersampled MRI reconstruction by prompting a large latent diffusion model
Student Member Ieee Ziqi Gao
F. I. S. Kevin Zhou
MedIm
32
3
0
16 Feb 2024
How People Prompt to Create Interactive VR Scenes
How People Prompt to Create Interactive VR Scenes
Setareh Aghel Manesh
Tianyi Zhang
Yuki Onishi
Kotaro Hara
Scott Bateman
Jiannan Li
Anthony Tang
19
12
0
16 Feb 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
  Higher-Resolution Adaptation
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo
Yin-Yin He
Haoxin Chen
Menghan Xia
Xiaodong Cun
...
Yong Zhang
Xintao Wang
Qifeng Chen
Ying Shan
Bihan Wen
30
23
0
16 Feb 2024
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Huizhuo Yuan
Zixiang Chen
Kaixuan Ji
Quanquan Gu
57
24
0
15 Feb 2024
Previous
123...262728...313233
Next