Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 5,039 papers shown
NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation
Yitong Sun
Yao Huang
Ruochen Zhang
Huanran Chen
Shouwei Ruan
Ranjie Duan
Xingxing Wei
DiffM
156
0
0
17 Oct 2025
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Qingyan Bai
Qiuyu Wang
Hao Ouyang
Yue Yu
Hanlin Wang
...
Yanhong Zeng
Zichen Liu
Yinghao Xu
Yujun Shen
Qifeng Chen
VGen
374
11
0
17 Oct 2025
Face-MakeUpV2: Facial Consistency Learning for Controllable Text-to-Image Generation
Dawei Dai
Yinxiu Zhou
Chenghang Li
Guolai Jiang
Chengfang Zhang
139
0
0
17 Oct 2025
Controlling the image generation process with parametric activation functions
Ilia Pavlov
GAN
227
0
0
17 Oct 2025
QSilk: Micrograin Stabilization and Adaptive Quantile Clipping for Detail-Friendly Latent Diffusion
Denis Rychkovskiy
148
0
0
17 Oct 2025
Salient Concept-Aware Generative Data Augmentation
Tianchen Zhao
Xuanbai Chen
Zhihua Li
J. Fang
Dongsheng An
Xiang Xu
Zhuowen Tu
Yifan Xing
DiffM
203
0
0
16 Oct 2025
DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models
Mor Ventura
Michael Toker
Or Patashnik
Yonatan Belinkov
Roi Reichart
168
0
0
16 Oct 2025
Noise Projection: Closing the Prompt-Agnostic Gap Behind Text-to-Image Misalignment in Diffusion Models
Yunze Tong
Didi Zhu
Zijing Hu
Jinluan Yang
Ziyu Zhao
DiffM
VLM
108
0
0
16 Oct 2025
Consistent text-to-image generation via scene de-contextualization
Song Tang
Peihao Gong
Kunyu Li
Kai Guo
Boyu Wang
Mao Ye
Jianwei Zhang
X. Zhu
DiffM
124
0
0
16 Oct 2025
Adaptive Visual Conditioning for Semantic Consistency in Diffusion-Based Story Continuation
Seyed Mohammad Mousavi
Morteza Analoui
DiffM
124
0
0
15 Oct 2025
Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter
Jianhui Zhang
Sheng Cheng
Qirui Sun
Jia Liu
Wang Luyang
Chaoyu Feng
Chen Fang
Lei Lei
Jue Wang
Shuaicheng Liu
DiffM
MDE
235
0
0
15 Oct 2025
End-to-End Multi-Modal Diffusion Mamba
Chunhao Lu
Qiang Lu
Meichen Dong
Jake Luo
134
3
0
15 Oct 2025
NoisePrints: Distortion-Free Watermarks for Authorship in Private Diffusion Models
Nir Goren
Oren Katzir
Abhinav Nakarmi
Eyal Ronen
Mahmood Sharif
Or Patashnik
WIGM
247
0
0
15 Oct 2025
MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars
Felix Taubner
Ruihang Zhang
Mathieu Tuli
Sherwin Bahmani
David B. Lindell
VGen
128
6
0
14 Oct 2025
Time-Correlated Video Bridge Matching
Viacheslav Vasilev
Arseny Ivanov
Nikita Gushchin
Maria Kovaleva
Alexander Korotin
DiffM
98
1
0
14 Oct 2025
Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance
Jincheng Zhong
Boyuan Jiang
Xin Tao
Pengfei Wan
Kun Gai
Mingsheng Long
DiffM
104
0
0
14 Oct 2025
SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models
Weiyang Jin
Yuwei Niu
Jiaqi Liao
Chengqi Duan
Aoxue Li
Shenghua Gao
Xihui Liu
LRM
208
4
0
14 Oct 2025
Massive Activations are the Key to Local Detail Synthesis in Diffusion Transformers
Chaofan Gan
Zicheng Zhao
Yuanpeng Tu
Xi Chen
Ziran Qin
Yun Xu
Mehrtash Harandi
W. Lin
158
1
0
13 Oct 2025
VLM-Guided Adaptive Negative Prompting for Creative Generation
Shelly Golan
Yotam Nitzan
Zongze Wu
Or Patashnik
DiffM
144
0
0
12 Oct 2025
Local-Global Context-Aware and Structure-Preserving Image Super-Resolution
Sanchar Palit
S. Chaudhuri
Biplab Banerjee
SupR
274
0
0
11 Oct 2025
Few-shot multi-token DreamBooth with LoRa for style-consistent character generation
Ruben Pascual
Mikel Sesma-Sara
A. Jurio
D. Paternain
M. Galar
DiffM
VGen
101
0
0
10 Oct 2025
Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking
Mohammad Hossein Sameti
Sepehr Harfi Moridani
Ali Zarean
Hossein Sameti
184
6
0
10 Oct 2025
Cross-Sensor Touch Generation
Samanta Rodriguez
Yiming Dou
Miquel Oller
Andrew Owens
Nima Fazeli
DiffM
103
0
0
10 Oct 2025
GTAlign: Game-Theoretic Alignment of LLM Assistants for Social Welfare
Siqi Zhu
David Zhang
Pedro Cisneros-Velarde
J. You
LRM
204
0
0
10 Oct 2025
Reinforcing Diffusion Models by Direct Group Preference Optimization
Yihong Luo
Tianyang Hu
Jing Tang
145
1
0
09 Oct 2025
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
Yuzhe Gu
Xiyu Liang
Jiaojiao Zhao
Enmao Diao
136
2
0
09 Oct 2025
InstructUDrag: Joint Text Instructions and Object Dragging for Interactive Image Editing
Haoran Yu
Yi Shi
DiffM
157
0
0
09 Oct 2025
FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching
Jiacheng Liu
Peiliang Cai
Qinming Zhou
Yuqi Lin
Deyang Kong
...
Haowen Xu
Chang Zou
J. Tang
S. Zheng
Linfeng Zhang
103
1
0
09 Oct 2025
UniVideo: Unified Understanding, Generation, and Editing for Videos
Cong Wei
Quande Liu
Zixuan Ye
Qiulin Wang
Xintao Wang
Pengfei Wan
Kun Gai
Wenhu Chen
VGen
261
14
0
09 Oct 2025
Graph Conditioned Diffusion for Controllable Histopathology Image Generation
Sarah Cechnicka
Matthew Baugh
Weitong Zhang
Mischa Dombrowski
Zhe Li
Johannes C. Paetzold
C. Roufosse
Bernhard Kainz
DiffM
MedIm
98
0
0
08 Oct 2025
Toward Reliable Clinical Coding with Language Models: Verification and Lightweight Adaptation
Zhangdie Yuan
Han-Chin Shing
Mitch Strong
Chaitanya P. Shivade
122
0
0
08 Oct 2025
Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments
CAADRIA proceedings (CAADRIA), 2025
Jingfei Huang
Han Tu
204
0
0
08 Oct 2025
Sparse deepfake detection promotes better disentanglement
Antoine Teissier
Marie Tahon
Nicolas Dugué
Aghilas Sini
209
1
0
07 Oct 2025
Mitigating Surgical Data Imbalance with Dual-Prediction Video Diffusion Model
Danush Kumar Venkatesh
Adam Schmidt
Muhammad Abdullah Jamal
Omid Mohareri
VGen
MedIm
143
0
0
07 Oct 2025
Redefining Generalization in Visual Domains: A Two-Axis Framework for Fake Image Detection with FusionDetect
Amirtaha Amanzadi
Zahra Dehghanian
Hamid Beigy
Hamid R. Rabiee
239
0
0
07 Oct 2025
Teleportraits: Training-Free People Insertion into Any Scene
Jialu Gao
K J Joseph
Fernando de la Torre
DiffM
110
0
0
07 Oct 2025
Teamwork: Collaborative Diffusion with Low-rank Coordination and Adaptation
Sam Sartor
Pieter Peers
DiffM
160
1
0
07 Oct 2025
SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder
Ronen Kamenetsky
Sara Dorfman
Daniel Garibi
Roni Paiss
Or Patashnik
Daniel Cohen-Or
DiffM
315
0
0
06 Oct 2025
Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation
Zijing Hu
Yunze Tong
Fengda Zhang
Junkun Yuan
Jun Xiao
Kun Kuang
DiffM
188
1
0
06 Oct 2025
Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning
Xiaomeng Fan
Yuchuan Mao
Zhi Gao
Yuwei Wu
Jin Chen
Yunde Jia
160
1
0
06 Oct 2025
Self Speculative Decoding for Diffusion Large Language Models
Yifeng Gao
Ziang Ji
Y. Wang
Biqing Qi
Hanlin Xu
Linfeng Zhang
DiffM
LRM
312
4
0
05 Oct 2025
MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
Xuehai He
Shijie Zhou
Thivyanth Venkateswaran
Kaizhi Zheng
Ziyu Wan
A. Kadambi
Xin Eric Wang
VGen
SyDa
AI4CE
163
0
0
05 Oct 2025
Let Features Decide Their Own Solvers: Hybrid Feature Caching for Diffusion Transformers
Shikang Zheng
Guantao Chen
Qinming Zhou
Yuqi Lin
Lixuan He
Chang Zou
Peiliang Cai
Jiacheng Liu
Linfeng Zhang
147
2
0
05 Oct 2025
Variational Diffusion Unlearning: A Variational Inference Framework for Unlearning in Diffusion Models under Data Constraints
Subhodip Panda
MS Varun
Shreyans Jain
Sarthak Kumar Maharana
Prathosh A.P.
DiffM
266
0
0
05 Oct 2025
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
Leander Girrbach
Stephan Alaniz
Genevieve Smith
Trevor Darrell
Zeynep Akata
202
3
0
04 Oct 2025
Mirage: Unveiling Hidden Artifacts in Synthetic Images with Large Vision-Language Models
Pranav Sharma
Shivank Garg
Durga Toshniwal
VLM
120
0
0
04 Oct 2025
Paris: A Decentralized Trained Open-Weight Diffusion Model
Zhiying Jiang
Raihan Seraj
Marcos Villagra
Bidhan Roy
MoE
86
0
0
03 Oct 2025
HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion
Shiyi Zhang
Dong Liang
Bingsheng Huang
Yihang Zhou
DiffM
230
0
0
03 Oct 2025
PocketSR: The Super-Resolution Expert in Your Pocket Mobiles
Haoze Sun
Linfeng Jiang
Fan Li
Renjing Pei
Zhixin Wang
...
Wei Xu
Jin Han
Fenglong Song
Yujiu Yang
Wenbo Li
DiffM
SupR
OffRL
302
2
0
03 Oct 2025
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
Zihan Zhou
Shilin Lu
Shuli Leng
Shaocong Zhang
Zhuming Lian
Xinlei Yu
A. Kong
DiffM
304
7
0
02 Oct 2025
Previous
1
2
3
4
5
...
99
100
101
Next