Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 5,041 papers shown
ShaLa: Multimodal Shared Latent Space Modelling
Jiali Cui
Yan-Ying Chen
Yanxia Zhang
M. Klenk
148
0
0
24 Aug 2025
Neural Stochastic Differential Equations on Compact State-Spaces
Yue-Jane Liu
Malinda Lu
Matthew K. Nock
Yaniv Yacoby
139
0
0
23 Aug 2025
Delta-SVD: Efficient Compression for Personalized Text-to-Image Models
Tangyuan Zhang
Shangyu Chen
Qixiang Chen
Jianfei Cai
110
0
0
23 Aug 2025
HiCache: A Plug-in Scaled-Hermite Upgrade for Taylor-Style Cache-then-Forecast Diffusion Acceleration
Liang Feng
Shikang Zheng
Jiacheng Liu
Yuqi Lin
Qinming Zhou
...
Xinyu Wang
Junjie Chen
Chang Zou
Yue Ma
Linfeng Zhang
DiffM
153
3
0
23 Aug 2025
Forecast then Calibrate: Feature Caching as ODE for Efficient Diffusion Transformers
Shikang Zheng
Liang Feng
Xinyu Wang
Qinming Zhou
Peiliang Cai
...
Jiacheng Liu
Yuqi Lin
Junjie Chen
Yue Ma
Linfeng Zhang
129
5
0
22 Aug 2025
Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning
Xuan Zhang
Zhijian Zhou
Weidi Xu
Yanting Miao
Chao Qu
Yuan Qi
NAI
182
0
0
22 Aug 2025
PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting
Hohyun Na
Seunghoo Hong
Simon S. Woo
AAML
DiffM
128
0
0
22 Aug 2025
On the Collapse Errors Induced by the Deterministic Sampler for Diffusion Models
Yi Zhang
Zhenyu Liao
Jingfeng Wu
Difan Zou
DiffM
193
1
0
22 Aug 2025
Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars
NVIDIA
Chaeyeon Chung
Ilya Fedorov
Michael Huang
Aleksey Karmanov
Dmitry Korobchenko
Roger Ribera
Yeongho Seol
CVBM
259
2
0
22 Aug 2025
Scaling Group Inference for Diverse and High-Quality Generation
Gaurav Parmar
Or Patashnik
Daniil Ostashev
Kuan-Chieh Wang
Kfir Aberman
Srinivasa Narasimhan
Jun-Yan Zhu
181
2
0
21 Aug 2025
MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion
Fei Peng
Junqiang Wu
Yan Li
Tingting Gao
Di Zhang
Huiyuan Fu
DiffM
165
2
0
20 Aug 2025
Generative AI models capture realistic sea-ice evolution from days to decades
Tobias S. Finn
Marc Bocquet
Pierre Rampal
Charlotte Durand
Flavia Porro
A. Farchi
A. Carrassi
AI4CE
146
2
0
20 Aug 2025
CTA-Flux: Integrating Chinese Cultural Semantics into High-Quality English Text-to-Image Communities
Yue Gong
Shanyuan Liu
Liuzhuozheng Li
Jian Zhu
Bo Cheng
Liebucha Wu
Xiaoyu Wu
Yuhang Ma
Dawei Leng
Yuhui Yin
DiffM
236
0
0
20 Aug 2025
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Shanlin Sun
Yifan Wang
Hanwen Zhang
Yifeng Xiong
Qin Ren
Ruogu Fang
Xiaohui Xie
Chenyu You
174
4
0
20 Aug 2025
Virtual Multiplex Staining for Histological Images using a Marker-wise Conditioned Diffusion Model
Hyun-Jic Oh
Junsik Kim
Zhiyi Shi
Yichen Wu
Yu-An Chen
P. Sorger
Hanspeter Pfister
Won-Ki Jeong
MedIm
249
1
0
20 Aug 2025
SAGA: Learning Signal-Aligned Distributions for Improved Text-to-Image Generation
P. Grimal
Michael Soumm
Hervé Le Borgne
Olivier Ferret
Akihiro Sugimoto
DiffM
171
0
0
19 Aug 2025
Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score
Syed Muhmmad Israr
Feng Zhao
DiffM
155
0
0
18 Aug 2025
7Bench: a Comprehensive Benchmark for Layout-guided Text-to-image Models
Elena Izzo
Luca Parolari
Davide Vezzaro
Lamberto Ballan
112
0
0
18 Aug 2025
DualFit: A Two-Stage Virtual Try-On via Warping and Synthesis
Minh-Trieu Tran
Johnmark Clements
Annie Prasanna
T. A. Nguyen
Ngan Le
DiffM
105
0
0
16 Aug 2025
SafeCtrl: Region-Based Safety Control for Text-to-Image Diffusion via Detect-Then-Suppress
Lingyun Zhang
Yu Xie
Yanwei Fu
Ping Chen
DiffM
126
0
0
16 Aug 2025
SPG: Style-Prompting Guidance for Style-Specific Content Creation
Qian Liang
Zichong Chen
Yang Zhou
Hui Huang
DiffM
131
0
0
15 Aug 2025
LEARN: A Story-Driven Layout-to-Image Generation Framework for STEM Instruction
Maoquan Zhang
Bisser Raytchev
Xiujuan Sun
DiffM
99
0
0
15 Aug 2025
StyleMM: Stylized 3D Morphable Face Model via Text-Driven Aligned Image Translation
Seungmi Lee
Kwan Yun
Junyong Noh
3DH
136
0
0
15 Aug 2025
TimeMachine: Fine-Grained Facial Age Editing with Identity Preservation
Yilin Mi
Qixin Yan
Zheng-Peng Duan
Chunle Guo
Hubery Yin
Hao Liu
Chen Li
Chongyi Li
DiffM
163
0
0
15 Aug 2025
Remove360: Benchmarking Residuals After Object Removal in 3D Gaussian Splatting
Simona Kocour
Assia Benbihi
Torsten Sattler
3DPC
131
0
0
15 Aug 2025
Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models
H. J. Lee
Suhyung Choi
Byoung-Tak Zhang
Inwoo Hwang
194
0
0
14 Aug 2025
Object Fidelity Diffusion for Remote Sensing Image Generation
Ziqi Ye
Shuran Ma
Jie Yang
Xiaoyi Yang
Ziyang Gong
Xue Yang
Haipeng Wang
Haipeng Wang
DiffM
222
1
0
14 Aug 2025
High Fidelity Text to Image Generation with Contrastive Alignment and Structural Guidance
Danyi Gao
96
11
0
14 Aug 2025
Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models
Eunseo Koh
Seunghoo Hong
Tae-Young Kim
Simon S. Woo
Jae-Pil Heo
DiffM
282
0
0
14 Aug 2025
NanoControl: A Lightweight Framework for Precise and Efficient Control in Diffusion Transformer
Shanyuan Liu
Jian Zhu
Junda Lu
Yue Gong
Liuzhuozheng Li
...
Yuhang Ma
Liebucha Wu
Xiaoyu Wu
Dawei Leng
Yuhui Yin
79
1
0
14 Aug 2025
CountCluster: Training-Free Object Quantity Guidance with Cross-Attention Map Clustering for Text-to-Image Generation
Joohyeon Lee
Jin-Seop Lee
Jee-Hyong Lee
118
0
0
14 Aug 2025
A Survey on Diffusion Language Models
Tianyi Li
Mingda Chen
Bowei Guo
Zhiqiang Shen
323
37
0
14 Aug 2025
OneVAE: Joint Discrete and Continuous Optimization Helps Discrete Video VAE Train Better
Yupeng Zhou
Zhen Li
Ziheng Ouyang
Yuming Chen
Ruoyi Du
...
Bin Fu
Yihao Liu
Peng Gao
Ming-Ming Cheng
Qibin Hou
238
1
0
13 Aug 2025
Exploring the Equivalence of Closed-Set Generative and Real Data Augmentation in Image Classification
Haowen Wang
Guowei Zhang
Xiang Zhang
Zeyuan Chen
Haiyang Xu
Dou Hoon Kwark
Zhuowen Tu
159
0
0
13 Aug 2025
Security Analysis of ChatGPT: Threats and Privacy Risks
Yushan Xiang
Zhongwen Li
Xiaoqi Li
SILM
183
8
0
13 Aug 2025
Prototype-Guided Diffusion: Visual Conditioning without External Memory
Hanane Azzag
Hanane Azzag
M. Lebbah
DiffM
VLM
296
0
0
13 Aug 2025
Story2Board: A Training-Free Approach for Expressive Storyboard Generation
David Dinkevich
Matan Levy
Omri Avrahami
Dvir Samuel
Dani Lischinski
DiffM
133
4
0
13 Aug 2025
Animate-X++: Universal Character Image Animation with Dynamic Backgrounds
Shuai Tan
Biao Gong
Zhuoxin Liu
Yan Wang
Xi Chen
Yifan Feng
Hengshuang Zhao
VGen
277
4
0
13 Aug 2025
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy
Hao Yu
Rupayan Mallick
Margrit Betke
Sarah Adel Bargal
DiffM
90
0
0
13 Aug 2025
Per-Query Visual Concept Learning
Ori Malca
Dvir Samuel
Gal Chechik
DiffM
VLM
120
0
0
12 Aug 2025
X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents
Guoxian Song
Hongyi Xu
Xiaochen Zhao
You Xie
Tianpei Gu
Zenan Li
Chenxu Zhang
Linjie Luo
VGen
120
5
0
12 Aug 2025
Enhancing Small-Scale Dataset Expansion with Triplet-Connection-based Sample Re-Weighting
Ting Xiang
Changjian Chen
Zhuo Tang
Qifeng Zhang
Fei Lyu
Li Yang
Jiapeng Zhang
KenLi Li
MedIm
165
0
0
11 Aug 2025
Learning User Preferences for Image Generation Model
Wenyi Mo
Ying Ba
Tianyu Zhang
Yalong Bai
Biye Li
DiffM
89
2
0
11 Aug 2025
S^2VG: 3D Stereoscopic and Spatial Video Generation via Denoising Frame Matrix
Peng Dai
Feitong Tan
Qiangeng Xu
Yihua Huang
David Futschik
Ruofei Du
S. Fanello
Yinda Zhang
Xiaojuan Qi
VGen
137
0
0
11 Aug 2025
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Joonghyuk Shin
Alchan Hwang
Yujin Kim
Daneul Kim
Jaesik Park
DiffM
124
4
0
11 Aug 2025
Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion
Minseo Kim
Minchan Kwon
Dongyeun Lee
Yunho Jeon
Junmo Kim
DiffM
92
0
0
11 Aug 2025
Undress to Redress: A Training-Free Framework for Virtual Try-On
Ruoyao Xiao
Junhao Wu
Yeying Jin
Daiheng Gao
Yun Ji
...
Hao Xu
Kai Chen
Bruce Gu
Nana Wang
Zhaoxin Fan
DiffM
137
0
0
11 Aug 2025
Tailored Emotional LLM-Supporter: Enhancing Cultural Sensitivity
Chen Cecilia Liu
Hiba Arnaout
Nils Kovačić
Dana Atzil-Slonim
Iryna Gurevych
116
0
0
11 Aug 2025
Efficient Approximate Posterior Sampling with Annealed Langevin Monte Carlo
Advait Parulekar
Litu Rout
Karthikeyan Shanmugam
Sanjay Shakkottai
184
1
0
11 Aug 2025
LaRender: Training-Free Occlusion Control in Image Generation via Latent Rendering
Xiaohang Zhan
Dingming Liu
DiffM
136
2
0
11 Aug 2025
Previous
1
2
3
...
7
8
9
...
99
100
101
Next
Page 8 of 101
Page
of 101
Go