Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022

23 May 2022

Seyed Kamyar Seyed Ghasemipour

Burcu Karagol Ayan

S. S. Mahdavi

Raphael Gontijo-Lopes

David J Fleet

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,040 papers shown

Diffusion Models at the Drug Discovery Frontier: A Review on Generating Small Molecules versus Therapeutic Peptides

410

31 Oct 2025

Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing

145

31 Oct 2025

Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open Challenges

31 Oct 2025

Generative Semantic Coding for Ultra-Low Bitrate Visual Communication and Analysis

134

31 Oct 2025

BlurGuard: A Simple Approach for Robustifying Image Protection Against AI-Powered Editing

215

31 Oct 2025

H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models

214

31 Oct 2025

From Evidence to Verdict: An Agent-Based Forensic Framework for AI-Generated Image Detection

189

31 Oct 2025

Optimal Convergence Analysis of DDPM for General Distributions

Yuchen Jiao

Yuchen Zhou

Gen Li

295

31 Oct 2025

Who Made This? Fake Detection and Source Attribution with Diffusion Features

S. Bonechi

P. Andreini

Barbara Toniella Corradini

DiffM

160

31 Oct 2025

Group-Equivariant Diffusion Models for Lattice Field Theory

282

30 Oct 2025

Improving Temporal Consistency and Fidelity at Inference-time in Perceptual Video Restoration by Zero-shot Image-based Diffusion Models

Nasrin Rahimi

A. Murat Tekalp

DiffM VGen

149

29 Oct 2025

LGCC: Enhancing Flow Matching Based Text-Guided Image Editing with Local Gaussian Coupling and Context Consistency

29 Oct 2025

PSTF-AttControl: Per-Subject-Tuning-Free Personalized Image Generation with Controllable Face AttributesImage and Vision Computing (IVC), 2025

105

29 Oct 2025

MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

376

29 Oct 2025

ScaleDiff: Higher-Resolution Image Synthesis via Efficient and Model-Agnostic Diffusion

207

29 Oct 2025

Training-Free Safe Text Embedding Guidance for Text-to-Image Diffusion Models

228

28 Oct 2025

Decoupled MeanFlow: Turning Flow Models into Flow Maps for Accelerated Sampling

242

28 Oct 2025

AutoPrompt: Automated Red-Teaming of Text-to-Image Models via LLM-Driven Adversarial Prompts

480

28 Oct 2025

Diffusion Adaptive Text Embedding for Text-to-Image Diffusion Models

230

28 Oct 2025

Information-Theoretic Discrete Diffusion

181

28 Oct 2025

An efficient probabilistic hardware architecture for diffusion-like models

Guillaume Verdon

Trevor McCourt

DiffM

206

28 Oct 2025

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

517

27 Oct 2025

Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling

170

27 Oct 2025

T2I-RiskyPrompt: A Benchmark for Safety Evaluation, Attack, and Defense on Text-to-Image Model

286

25 Oct 2025

Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation

122

24 Oct 2025

BADiff: Bandwidth Adaptive Diffusion Model

117

24 Oct 2025

Improved Training Technique for Shortcut Models

220

24 Oct 2025

FairImagen: Post-Processing for Bias Mitigation in Text-to-Image Models

112

24 Oct 2025

FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing

Or Ronai

Vladimir Kulikov

T. Michaeli

165

24 Oct 2025

TerraGen: A Unified Multi-Task Layout Generation Framework for Remote Sensing Data Augmentation

131

24 Oct 2025

Restore Text First, Enhance Image Later: Two-Stage Scene Text Image Super-Resolution with Glyph Structure Guidance

194

24 Oct 2025

Towards a Golden Classifier-Free Guidance Path via Foresight Fixed Point Iterations

127

24 Oct 2025

UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset

164

23 Oct 2025

BadGraph: A Backdoor Attack Against Latent Diffusion Model for Text-Guided Graph Generation

141

23 Oct 2025

EditInfinity: Image Editing with Binary-Quantized Generative Models

218

23 Oct 2025

CUPID: Generative 3D Reconstruction via Joint Object and Pose Modeling

168

23 Oct 2025

RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling

...

153

23 Oct 2025

EchoDistill: Bidirectional Concept Distillation for One-Step Diffusion Personalization

165

23 Oct 2025

StableSketcher: Enhancing Diffusion Model for Pixel-based Sketch Generation via Visual Question Answering Feedback

124

23 Oct 2025

GenColorBench: A Color Evaluation Benchmark for Text-to-Image Generation Models

Muhammad Atif Butt

Alexandra Gomez-Villa

Tao Wu

Javier Vázquez-Corral

Joost van de Weijer

Kai Wang

EGVM VLM

183

23 Oct 2025

Exposing Blindspots: Cultural Bias Evaluation in Generative Image Models

...

203

22 Oct 2025

A Frequentist Statistical Introduction to Variational Inference, Autoencoders, and Diffusion Models

Yen-Chi Chen

DiffM BDL

195

21 Oct 2025

Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback

172

21 Oct 2025

Beyond Frequency: Scoring-Driven Debiasing for Object Detection via Blueprint-Prompted Image Synthesis

170

21 Oct 2025

Fine-tuning Flow Matching Generative Models with Intermediate Feedback

161

20 Oct 2025

GAS: Improving Discretization of Diffusion ODEs via Generalized Adversarial Solver

165

20 Oct 2025

Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models

100

20 Oct 2025

Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling

Erik Riise

Mehmet Onurcan Kaya

Dim P. Papadopoulos

308

19 Oct 2025

Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models

136

18 Oct 2025

NDM: A Noise-driven Detection and Mitigation Framework against Implicit Sexual Intentions in Text-to-Image Generation

157

17 Oct 2025