ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXivPDFHTML

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 4,309 papers shown
Title
DALLE-URBAN: Capturing the urban design expertise of large text to image
  transformers
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
6
17
0
03 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
17
1,680
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Y. Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
34
1,777
0
02 Aug 2022
Restoring Vision in Adverse Weather Conditions with Patch-Based
  Denoising Diffusion Models
Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models
Ozan Özdenizci
R. Legenstein
DiffM
28
238
0
29 Jul 2022
Testing Relational Understanding in Text-Guided Image Generation
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
137
64
0
29 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa
3DGS
29
135
0
27 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented
  Diffusion Models
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
14
68
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion
  Localization
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedIm
DiffM
16
62
0
25 Jul 2022
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
EgoV
19
28
0
25 Jul 2022
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Roy Ganz
Bahjat Kawar
Michael Elad
AAML
9
8
0
22 Jul 2022
A Survey on Leveraging Pre-trained Generative Adversarial Networks for
  Image Editing and Restoration
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
Ming-Yu Liu
Yuxiang Wei
Xiaohe Wu
Wangmeng Zuo
Lei Zhang
20
1
0
21 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for
  Infinite Visual Synthesis
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
10
72
0
20 Jul 2022
Sparse Relational Reasoning with Object-Centric Representations
Sparse Relational Reasoning with Object-Centric Representations
Alex F Spies
Alessandra Russo
Murray Shanahan
OCL
NAI
14
2
0
15 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
139
435
0
10 Jul 2022
Improving Diffusion Model Efficiency Through Patching
Improving Diffusion Model Efficiency Through Patching
Troy Luhman
Eric Luhman
DiffM
9
18
0
09 Jul 2022
Accelerating Material Design with the Generative Toolkit for Scientific
  Discovery
Accelerating Material Design with the Generative Toolkit for Scientific Discovery
Matteo Manica
Jannis Born
Joris Cadow
Dimitrios Christofidellis
A. Dave
...
Lauren N. McHugh
Alexy Khrabrov
Payel Das
Seiji Takeda
John Smith
11
26
0
08 Jul 2022
Big Learning
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
19
0
0
08 Jul 2022
Distilling Model Failures as Directions in Latent Space
Distilling Model Failures as Directions in Latent Space
Saachi Jain
Hannah Lawrence
Ankur Moitra
A. Madry
16
88
0
29 Jun 2022
Neural Neural Textures Make Sim2Real Consistent
Neural Neural Textures Make Sim2Real Consistent
R. Burgert
Jinghuan Shang
Xiang Li
Michael S. Ryoo
21
6
0
27 Jun 2022
ProGen2: Exploring the Boundaries of Protein Language Models
ProGen2: Exploring the Boundaries of Protein Language Models
Erik Nijkamp
Jeffrey A. Ruffolo
Eli N. Weinstein
Nikhil Naik
Ali Madani
AI4TS
8
280
0
27 Jun 2022
Text-Driven Stylization of Video Objects
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge J. Belongie
Sagie Benaim
VGen
DiffM
22
16
0
24 Jun 2022
The ArtBench Dataset: Benchmarking Generative Models with Artworks
The ArtBench Dataset: Benchmarking Generative Models with Artworks
Peiyuan Liao
Xiuyu Li
Xihui Liu
Kurt Keutzer
11
47
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
79
1,061
0
22 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Generative Modelling With Inverse Heat Dissipation
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
11
108
0
21 Jun 2022
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Minguk Kang
Joonghyuk Shin
Jaesik Park
EGVM
6
66
0
19 Jun 2022
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for
  Inverse Problems
Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems
Giannis Daras
Y. Dagan
A. Dimakis
C. Daskalakis
BDL
21
15
0
18 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale
  Knowledge
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
39
343
0
17 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal
  Learners
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLM
AI4CE
14
15
0
15 Jun 2022
Multi-instrument Music Synthesis with Spectrogram Diffusion
Multi-instrument Music Synthesis with Spectrogram Diffusion
Curtis Hawthorne
Ian Simon
Adam Roberts
Neil Zeghidour
Josh Gardner
Ethan Manilow
Jesse Engel
DiffM
21
48
0
11 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
47
374
0
06 Jun 2022
Priors in Deep Image Restoration and Enhancement: A Survey
Priors in Deep Image Restoration and Enhancement: A Survey
Yunfan Lu
Yi-Tun Lin
Hao Wu
Yuan Luo
Xueye Zheng
Hui Xiong
Lin Wang
31
3
0
04 Jun 2022
Discovering the Hidden Vocabulary of DALLE-2
Discovering the Hidden Vocabulary of DALLE-2
Giannis Daras
A. Dimakis
119
64
0
01 Jun 2022
Decomposing NeRF for Editing via Feature Field Distillation
Decomposing NeRF for Editing via Feature Field Distillation
Sosuke Kobayashi
Eiichi Matsumoto
Vincent Sitzmann
167
327
0
31 May 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via
  Transformers
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
243
556
0
29 May 2022
Diffusion-LM Improves Controllable Text Generation
Diffusion-LM Improves Controllable Text Generation
Xiang Lisa Li
John Thickstun
Ishaan Gulrajani
Percy Liang
Tatsunori B. Hashimoto
AI4CE
171
768
0
27 May 2022
A Survey on Non-Autoregressive Generation for Neural Machine Translation
  and Beyond
A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond
Yisheng Xiao
Lijun Wu
Junliang Guo
Juntao Li
M. Zhang
Tao Qin
Tie-Yan Liu
3DV
MedIm
AI4CE
25
81
0
20 Apr 2022
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
A Taxonomy of Prompt Modifiers for Text-To-Image Generation
J. Oppenlaender
15
102
0
20 Apr 2022
Opal: Multimodal Image Generation for News Illustration
Opal: Multimodal Image Generation for News Illustration
Vivian Liu
Han Qiao
Lydia B. Chilton
11
98
0
19 Apr 2022
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
Diagnosing and Fixing Manifold Overfitting in Deep Generative Models
G. Loaiza-Ganem
Brendan Leigh Ross
Jesse C. Cresswell
Anthony L. Caterini
GAN
DRL
14
27
0
14 Apr 2022
KNN-Diffusion: Image Generation via Large-Scale Retrieval
KNN-Diffusion: Image Generation via Large-Scale Retrieval
Shelly Sheynin
Oron Ashual
Adam Polyak
Uriel Singer
Oran Gafni
Eliya Nachmani
Yaniv Taigman
VLM
SyDa
DiffM
13
111
0
06 Apr 2022
Complex Scene Image Editing by Scene Graph Comprehension
Complex Scene Image Editing by Scene Graph Comprehension
Zhongping Zhang
Huiwen He
Bryan A. Plummer
Z. Liao
Huayan Wang
DiffM
20
6
0
24 Mar 2022
The Role of ImageNet Classes in Fréchet Inception Distance
The Role of ImageNet Classes in Fréchet Inception Distance
Tuomas Kynkaanniemi
Tero Karras
M. Aittala
Timo Aila
J. Lehtinen
EGVM
VLM
10
197
0
11 Mar 2022
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
KPE: Keypoint Pose Encoding for Transformer-based Image Generation
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
ViT
25
10
0
09 Mar 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That
  Synthesizes 16K Images On A Single GPU
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU
Junseok Oh
Donghwee Yoon
Injung Kim
19
1
0
28 Feb 2022
VLP: A Survey on Vision-Language Pre-training
VLP: A Survey on Vision-Language Pre-training
Feilong Chen
Duzhen Zhang
Minglun Han
Xiuyi Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
82
211
0
18 Feb 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of
  Text-to-Image Generation Models
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
ViT
132
170
0
08 Feb 2022
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control
FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control
Dimitri von Rutte
Luca Biggio
Yannic Kilcher
Thomas Hofmann
25
0
0
26 Jan 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
24
48
0
27 Dec 2021
Palette: Image-to-Image Diffusion Models
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
325
1,584
0
10 Nov 2021
A Systematic Investigation of Commonsense Knowledge in Large Language
  Models
A Systematic Investigation of Commonsense Knowledge in Large Language Models
Xiang Lorraine Li
A. Kuncoro
Jordan Hoffmann
Cyprien de Masson dÁutume
Phil Blunsom
Aida Nematzadeh
LRM
17
56
0
31 Oct 2021
Previous
123...858687
Next