Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022

23 May 2022

Seyed Kamyar Seyed Ghasemipour

Burcu Karagol Ayan

S. S. Mahdavi

Raphael Gontijo-Lopes

David J Fleet

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,039 papers shown

ISS: Image as Stepping Stone for Text-Guided 3D Shape GenerationInternational Conference on Learning Representations (ICLR), 2022

Xiaojuan Qi

406

09 Sep 2022

TEACH: Temporal Action Composition for 3D HumansInternational Conference on 3D Vision (3DV), 2022

416

185

09 Sep 2022

Text-Free Learning of a Natural Language Interface for Pretrained Face Generators

Gregory Shakhnarovich

CLIP

119

08 Sep 2022

Data Feedback Loops: Model-driven Amplification of Dataset BiasesInternational Conference on Machine Learning (ICML), 2022

Rohan Taori

Tatsunori B. Hashimoto

336

08 Sep 2022

FETA: Towards Specializing Foundation Models for Expert Task Applications

...

253

08 Sep 2022

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowInternational Conference on Learning Representations (ICLR), 2022

1.1K

1,983

07 Sep 2022

Statistical Foundation Behind Machine Learning and Its Impact on Computer Vision

Lei Zhang

H. Shum

VLM SSL

137

06 Sep 2022

A Survey on Generative Diffusion ModelIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022

Hanqun Cao

Cheng Tan

Zhangyang Gao

Yilun Xu

Guangyong Chen

Pheng-Ann Heng

Stan Z. Li

MedIm

766

411

06 Sep 2022

Diffusion Models: A Comprehensive Survey of Methods and ApplicationsACM Computing Surveys (ACM CSUR), 2022

Wentao Zhang

Ming-Hsuan Yang

1.5K

1,882

02 Sep 2022

Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D Object Sets

146

01 Sep 2022

FLAME: Free-form Language-based Motion Synthesis & EditingAAAI Conference on Artificial Intelligence (AAAI), 2022

407

251

01 Sep 2022

A Diffusion Model Predicts 3D Shapes from 2D Microscopy ImagesIEEE International Symposium on Biomedical Imaging (ISBI), 2022

Dominik Jens Elias Waibel

203

30 Aug 2022

Frido: Feature Pyramid Diffusion for Complex Scene Image SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2022

Lu Yuan

228

114

29 Aug 2022

LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems

136

29 Aug 2022

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationComputer Vision and Pattern Recognition (CVPR), 2022

Nataniel Ruiz

Yuanzhen Li

Varun Jampani

Yael Pritch

Michael Rubinstein

Kfir Aberman

1.0K

3,747

25 Aug 2022

Understanding Diffusion Models: A Unified Perspective

Calvin Luo

DiffM

295

466

25 Aug 2022

AI and 6G into the Metaverse: Fundamentals, Challenges and Future Research TrendsIEEE Open Journal of the Communications Society (OJ-COMS), 2022

234

115

23 Aug 2022

Accelerating Vision Transformer Training via a Patch Sampling Schedule

Bradley McDanel

C. Huynh

ViT

107

19 Aug 2022

Text to Image Generation: Leaving no Language Behind

Pedro Reviriego

Elena Merino-Gómez

VLM

121

19 Aug 2022

Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning

245

18 Aug 2022

Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance

186

18 Aug 2022

Multimodal foundation models are better simulators of the human brain

Mingyu Ding

...

172

17 Aug 2022

ILLUME: Rationalizing Vision-Language Models through Human InteractionsInternational Conference on Machine Learning (ICML), 2022

382

17 Aug 2022

Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling

Ki-Ung Song

DiffM

145

15 Aug 2022

Layout-Bridging Text-to-Image Synthesis

158

12 Aug 2022

Language-Guided Face Animation by Recurrent StyleGAN-based GeneratorIEEE transactions on multimedia (IEEE TMM), 2022

274

11 Aug 2022

Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIPNeural Information Processing Systems (NeurIPS), 2022

Thao Nguyen

563

122

10 Aug 2022

Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield NetworksIEEE Transactions on Image Processing (IEEE TIP), 2022

236

08 Aug 2022

SKDCGN: Source-free Knowledge Distillation of Counterfactual Generative Networks using cGANs

211

08 Aug 2022

Analog Bits: Generating Discrete Data using Diffusion Models with Self-ConditioningInternational Conference on Learning Representations (ICLR), 2022

396

398

08 Aug 2022

Sampling Based On Natural Image Statistics Improves Local Surrogate ExplainersBritish Machine Vision Conference (BMVC), 2022

Ricardo Kleinlein

Alexander Hepburn

Raúl Santos-Rodríguez

Fernando Fernández-Martínez

AAML FAtt

111

08 Aug 2022

Creative Wand: A System to Study Effects of Communications in Co-Creative SettingsArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022

Zhiyu Lin

Rohan Agarwal

Mark O. Riedl

143

04 Aug 2022

Adversarial Attacks on Image Generation With Made-Up Words

Raphael Milliere

222

04 Aug 2022

DALLE-URBAN: Capturing the urban design expertise of large text to image transformers

Sachith Seneviratne

Damith A. Senanayake

Sanka Rasnayaka

Rajith Vidanaarachchi

Jason Thompson

ViT

247

03 Aug 2022

Prompt-to-Prompt Image Editing with Cross Attention ControlInternational Conference on Learning Representations (ICLR), 2022

Amir Hertz

Ron Mokady

J. Tenenbaum

Kfir Aberman

Yael Pritch

Daniel Cohen-Or

DiffM

713

2,323

02 Aug 2022

An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual InversionInternational Conference on Learning Representations (ICLR), 2022

Daniel Cohen-Or

476

2,443

02 Aug 2022

Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Ozan Özdenizci

Robert Legenstein

DiffM

336

389

29 Jul 2022

Testing Relational Understanding in Text-Guided Image Generation

C. Conwell

T. Ullman

EGVM

362

29 Jul 2022

GAUDI: A Neural Architect for Immersive 3D Scene GenerationNeural Information Processing Systems (NeurIPS), 2022

Miguel Angel Bautista

...

243

155

27 Jul 2022

Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

256

26 Jul 2022

What is Healthy? Generative Counterfactual Diffusion for Lesion Localization

Sotirios A. Tsaftaris

MedIm DiffM

355

25 Jul 2022

Intention-Conditioned Long-Term Human Egocentric Action ForecastingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

Esteve Valls Mascaro

Hyemin Ahn

Dongheui Lee

EgoV

311

25 Jul 2022

Do Perceptually Aligned Gradients Imply Adversarial Robustness?International Conference on Machine Learning (ICML), 2022

303

22 Jul 2022

A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration

Ming-Yu Liu

Yuxiang Wei

Xiaohe Wu

Wangmeng Zuo

Lei Zhang

232

21 Jul 2022

NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual SynthesisNeural Information Processing Systems (NeurIPS), 2022

Jian Liang

Zicheng Liu

214

20 Jul 2022

Sparse Relational Reasoning with Object-Centric Representations

171

15 Jul 2022

LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and ActionConference on Robot Learning (CoRL), 2022

546

607

10 Jul 2022

Improving Diffusion Model Efficiency Through Patching

Troy Luhman

Eric Luhman

DiffM

184

09 Jul 2022

Accelerating Material Design with the Generative Toolkit for Scientific Discoverynpj Computational Materials (npj Comput. Mater.), 2022

Matteo Manica

Jannis Born

Joris Cadow

Dimitrios Christofidellis

...

265

08 Jul 2022

387

08 Jul 2022