Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022

23 May 2022

Seyed Kamyar Seyed Ghasemipour

Burcu Karagol Ayan

S. S. Mahdavi

Raphael Gontijo-Lopes

David J Fleet

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,041 papers shown

MagicMix: Semantic Mixing with Diffusion Models

375

28 Oct 2022

UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance

...

505

28 Oct 2022

Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless TrainingComputer Vision and Pattern Recognition (CVPR), 2022

386

28 Oct 2022

Deep Generative Models on 3D Representations: A Survey

322

27 Oct 2022

How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

212

126

27 Oct 2022

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

534

397

26 Oct 2022

Categorical SDEs with Simplex Diffusion

Pierre Harvey Richemond

Sander Dieleman

Arnaud Doucet

DiffM

209

26 Oct 2022

Full-band General Audio Synthesis with Score-based DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

225

26 Oct 2022

Towards the Detection of Diffusion Model Deepfakes

382

138

26 Oct 2022

Lafite2: Few-shot Text-to-Image Generation

204

25 Oct 2022

Vitruvio: 3D Building Meshes via Single Perspective Sketches

265

24 Oct 2022

Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models

Sharan Narang

Pieter Abbeel

KELM CLL

255

24 Oct 2022

High-Resolution Image Editing via Multi-Stage Blended Diffusion

J. Ackermann

Minjun Li

DiffM

145

24 Oct 2022

Instance-Aware Image Completion

195

22 Oct 2022

Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks

294

22 Oct 2022

Z-LaVI: Zero-Shot Language Solver Fueled by Visual ImaginationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Wenlin Yao

224

21 Oct 2022

Conditional Diffusion with Less Explicit Guidance via Model Predictive Control

175

21 Oct 2022

Boomerang: Local sampling on image manifolds using diffusion models

Lorenzo Luzi

P. Mayer

Josue Casco-Rodriguez

Ali Siahkoohi

Richard G. Baraniuk

DiffM

356

21 Oct 2022

3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows

279

156

20 Oct 2022

Composing Ensembles of Pre-trained Models via Iterative ConsensusInternational Conference on Learning Representations (ICLR), 2022

Shuang Li

Antonio Torralba

162

20 Oct 2022

DiffEdit: Diffusion-based semantic image editing with mask guidanceInternational Conference on Learning Representations (ICLR), 2022

395

661

20 Oct 2022

OCR-VQGAN: Taming Text-within-Image GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

David Vazquez

276

19 Oct 2022

Language Models Understand Us, PoorlyConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Jared Moore

LRM

169

19 Oct 2022

DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image ModelsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022

Royi Rassin

Shauli Ravfogel

Yoav Goldberg

201

19 Oct 2022

Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models

Ricardo Kleinlein

Cristina Luna Jiménez

Fernando Fernández-Martínez

DiffM

146

19 Oct 2022

Differentially Private Diffusion Models

486

129

18 Oct 2022

Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation

369

18 Oct 2022

UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single ImageACM Transactions on Graphics (TOG), 2022

Yossi Matias

256

17 Oct 2022

Imagic: Text-Based Real Image Editing with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

586

1,340

17 Oct 2022

DiffuSeq: Sequence to Sequence Text Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2022

Shansan Gong

Mukai Li

Jiangtao Feng

Zhiyong Wu

Lingpeng Kong

429

459

17 Oct 2022

LAION-5B: An open large-scale dataset for training next generation image-text modelsNeural Information Processing Systems (NeurIPS), 2022

...

1.0K

4,555

16 Oct 2022

TransFusion: Transcribing Speech with Multinomial Diffusion

14 Oct 2022

Is synthetic data from generative models ready for image recognition?International Conference on Learning Representations (ICLR), 2022

Shuyang Sun

Xiaojuan Qi

497

379

14 Oct 2022

MTEB: Massive Text Embedding BenchmarkConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022

Niklas Muennighoff

Nouamane Tazi

L. Magne

Nils Reimers

1.0K

686

13 Oct 2022

The Hidden Uniform Cluster Prior in Self-Supervised LearningInternational Conference on Learning Representations (ICLR), 2022

Pascal Vincent

231

13 Oct 2022

DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation ModelsConference on Computer and Communications Security (CCS), 2022

Zheng Li

218

202

13 Oct 2022

ImaginaryNet: Learning Object Detectors without Real Images and AnnotationsInternational Conference on Learning Representations (ICLR), 2022

242

13 Oct 2022

Compute-Efficient Deep Learning: Algorithmic Trends and OpportunitiesJournal of machine learning research (JMLR), 2022

Brian Bartoldson

B. Kailkhura

Davis W. Blalock

317

13 Oct 2022

Self-Guided Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

391

12 Oct 2022

LION: Latent Point Diffusion Models for 3D Shape GenerationNeural Information Processing Systems (NeurIPS), 2022

Sanja Fidler

358

626

12 Oct 2022

Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image ManipulationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Nojun Kwak

163

12 Oct 2022

Underspecification in Scene Description-to-Depiction Tasks

Ben Hutchinson

Jason Baldridge

Vinodkumar Prabhakaran

DiffM

221

11 Oct 2022

A generic diffusion-based approach for 3D human pose prediction in the wildIEEE International Conference on Robotics and Automation (ICRA), 2022

Saeed Saadatnejad

Ali-Ahmad Rasekh

Mohammadreza Mofayezi

Alexandre Alahi

287

11 Oct 2022

Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance

Chen Henry Wu

Fernando de la Torre

DiffM

376

11 Oct 2022

GENIE: Higher-Order Denoising Diffusion SolversNeural Information Processing Systems (NeurIPS), 2022

345

141

11 Oct 2022

GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion ModelsSpoken Language Technology Workshop (SLT), 2022

Matthew Baas

Herman Kamper

DiffM

176

11 Oct 2022

Markup-to-Image Diffusion Models with Scheduled SamplingInternational Conference on Learning Representations (ICLR), 2022

190

11 Oct 2022

f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation

Jiatao Gu

Shuangfei Zhai

Yizhe Zhang

Miguel Angel Bautista

J. Susskind

DiffM

228

10 Oct 2022

What the DAAM: Interpreting Stable Diffusion Using Cross AttentionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

587

229

10 Oct 2022

CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning

Shi-You Xu

VLM DiffM

203

10 Oct 2022