Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022

23 May 2022

Seyed Kamyar Seyed Ghasemipour

Burcu Karagol Ayan

S. S. Mahdavi

Raphael Gontijo-Lopes

David J Fleet

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,041 papers shown

DOLCE: A Model-Based Probabilistic Diffusion Framework for Limited-Angle CT ReconstructionIEEE International Conference on Computer Vision (ICCV), 2022

Jiaming Liu

Rushil Anirudh

Jayaraman J. Thiagarajan

Ulugbek S. Kamilov

178

100

22 Nov 2022

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face AnimationComputer Vision and Pattern Recognition (CVPR), 2022

Xiaodong Cun

Yong Zhang

Ying Shan

246

408

22 Nov 2022

Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

...

117

22 Nov 2022

SinFusion: Training Diffusion Models on a Single Image or VideoInternational Conference on Machine Learning (ICML), 2022

377

21 Nov 2022

SceneComposer: Any-Level Semantic Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2022

150

21 Nov 2022

Exploring Discrete Diffusion Models for Image Captioning

Zicheng Liu

270

21 Nov 2022

VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

Ajay Jain

Amber Xie

Pieter Abbeel

DiffM

214

119

21 Nov 2022

Video Background Music Generation: Dataset, Method and EvaluationIEEE International Conference on Computer Vision (ICCV), 2022

281

21 Nov 2022

Investigating Prompt Engineering in Diffusion Models

Sam Witteveen

Martin Andrews

128

21 Nov 2022

Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training

Ming-Hsuan Yang

241

21 Nov 2022

MagicVideo: Efficient Video Generation With Latent Diffusion Models

402

470

20 Nov 2022

Synthesizing Coherent Story with Auto-Regressive Latent Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

245

20 Nov 2022

IC3D: Image-Conditioned 3D Diffusion for Shape Generation

351

20 Nov 2022

DiffStyler: Controllable Dual Diffusion for Text-Driven Image StylizationIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Yong Zhang

288

19 Nov 2022

EDGE: Editable Dance Generation From MusicComputer Vision and Pattern Recognition (CVPR), 2022

Jo-Han Tseng

Rodrigo Castellon

Chenxi Liu

354

336

19 Nov 2022

Magic3D: High-Resolution Text-to-3D Content CreationComputer Vision and Pattern Recognition (CVPR), 2022

Sanja Fidler

393

1,446

18 Nov 2022

Invariant Learning via Diffusion Dreamed Distribution Shifts

141

18 Nov 2022

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and GenerationComputer Vision and Pattern Recognition (CVPR), 2022

Niloy J. Mitra

282

203

17 Nov 2022

InstructPix2Pix: Learning to Follow Image Editing InstructionsComputer Vision and Pattern Recognition (CVPR), 2022

Tim Brooks

Aleksander Holynski

Alexei A. Efros

DiffM

879

2,543

17 Nov 2022

Conffusion: Confidence Intervals for Diffusion Models

Eliahu Horwitz

Yedid Hoshen

DiffM

209

17 Nov 2022

Null-text Inversion for Editing Real Images using Guided Diffusion Models

Ron Mokady

Amir Hertz

Kfir Aberman

Yael Pritch

Daniel Cohen-Or

DiffM

274

957

17 Nov 2022

DiffusionDet: Diffusion Model for Object DetectionIEEE International Conference on Computer Vision (ICCV), 2022

Shoufa Chen

Pei Sun

Yibing Song

Ping Luo

405

650

17 Nov 2022

Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion ModelsACM Transactions on Graphics (TOG), 2022

289

234

17 Nov 2022

Is the Elephant Flying? Resolving Ambiguities in Text-to-Image Generative Models

204

17 Nov 2022

GLAMI-1M: A Multilingual Image-Text Fashion DatasetBritish Machine Vision Conference (BMVC), 2022

169

17 Nov 2022

A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding NetworksInternational Conference on Learning Representations (ICLR), 2022

332

16 Nov 2022

Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelIEEE International Conference on Computer Vision (ICCV), 2022

562

248

15 Nov 2022

Will Large-scale Generative Models Corrupt Future Datasets?IEEE International Conference on Computer Vision (ICCV), 2022

Ryuichiro Hataya

Han Bao

Hiromi Arai

245

15 Nov 2022

Cross-Reality Re-Rendering: Manipulating between Digital and Physical Realities

Siddhartha Datta

244

15 Nov 2022

Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models

132

15 Nov 2022

Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models

204

14 Nov 2022

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

187

14 Nov 2022

EVA: Exploring the Limits of Masked Visual Representation Learning at ScaleComputer Vision and Pattern Recognition (CVPR), 2022

621

907

14 Nov 2022

Latent-NeRF for Shape-Guided Generation of 3D Shapes and TexturesComputer Vision and Pattern Recognition (CVPR), 2022

Daniel Cohen-Or

529

562

14 Nov 2022

Language models are good pathologists: using attention-based sequence reduction and text-pretrained transformers for efficient WSI classification

Juan Pisula

Katarzyna Bozek

VLM MedIm

322

14 Nov 2022

A Novel Sampling Scheme for Text- and Image-Conditional Image Synthesis in Quantized Latent Spaces

142

14 Nov 2022

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

220

13 Nov 2022

Design of Unmanned Air Vehicles Using Transformer Surrogate Models

176

11 Nov 2022

Efficient HLA imputation from sequential SNPs data by TransformerJournal of Human Genetics (J Hum Genet), 2022

130

11 Nov 2022

SSGVS: Semantic Scene Graph-to-Video Synthesis

Yuren Cong

Jinhui Yi

Bodo Rosenhahn

M. Yang

247

11 Nov 2022

Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022

511

454

09 Nov 2022

DiffPhase: Generative Diffusion-based STFT Phase RetrievalIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

180

08 Nov 2022

Self-conditioned Embedding Diffusion for Text Generation

...

239

107

08 Nov 2022

Astronomia ex machina: a history, primer, and outlook on neural networks in astronomyRoyal Society Open Science (RSOS), 2022

Michael J. Smith

James E. Geach

213

07 Nov 2022

Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation

Firas Khader

Gustav Mueller-Franzes

Soroosh Tayebi Arasteh

...

Jakob Nikolas Kather

Daniel Truhn

DiffM MedIm

486

07 Nov 2022

Rickrolling the Artist: Injecting Backdoors into Text Encoders for Text-to-Image SynthesisIEEE International Conference on Computer Vision (ICCV), 2022

467

04 Nov 2022

Evaluating a Synthetic Image Dataset Generated with Stable Diffusion

Andreas Stöckl

219

03 Nov 2022

eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers

...

670

990

02 Nov 2022

DPM-Solver++: Fast Solver for Guided Sampling of Diffusion Probabilistic ModelsMachine Intelligence Research (MIR), 2022

838

837

02 Nov 2022

MedSegDiff: Medical Image Segmentation with Diffusion Probabilistic ModelInternational Conference on Medical Imaging with Deep Learning (MIDL), 2022

Haoyi Xiong

Yanwu Xu

504

386

01 Nov 2022