Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 5,039 papers shown
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
International Conference on Learning Representations (ICLR), 2022
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
406
29
0
09 Sep 2022
TEACH: Temporal Action Composition for 3D Humans
International Conference on 3D Vision (3DV), 2022
Nikos Athanasiou
Mathis Petrovich
Michael J. Black
Gül Varol
416
185
0
09 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
119
2
0
08 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases
International Conference on Machine Learning (ICML), 2022
Rohan Taori
Tatsunori B. Hashimoto
336
59
0
08 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
253
20
0
08 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
International Conference on Learning Representations (ICLR), 2022
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
1.1K
1,983
0
07 Sep 2022
Statistical Foundation Behind Machine Learning and Its Impact on Computer Vision
Lei Zhang
H. Shum
VLM
SSL
137
2
0
06 Sep 2022
A Survey on Generative Diffusion Model
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
766
411
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
ACM Computing Surveys (ACM CSUR), 2022
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffM
MedIm
1.5K
1,882
0
02 Sep 2022
Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D Object Sets
Kristofer Schlachter
Benjamin Ahlbrand
Zhu Wang
V. Ortenzi
Ken Perlin
DiffM
3DV
146
8
0
01 Sep 2022
FLAME: Free-form Language-based Motion Synthesis & Editing
AAAI Conference on Artificial Intelligence (AAAI), 2022
Jihoon Kim
Jiseob Kim
Sungjoon Choi
VGen
407
251
0
01 Sep 2022
A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images
IEEE International Symposium on Biomedical Imaging (ISBI), 2022
Dominik Jens Elias Waibel
Ernst Rooell
Bastian Rieck
Raja Giryes
Carsten Marr
DiffM
MedIm
203
53
0
30 Aug 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
AAAI Conference on Artificial Intelligence (AAAI), 2022
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
228
114
0
29 Aug 2022
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems
Bjorn Deiseroth
P. Schramowski
Hikaru Shindo
Devendra Singh Dhami
Kristian Kersting
EGVM
DiffM
136
2
0
29 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Computer Vision and Pattern Recognition (CVPR), 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
1.0K
3,747
0
25 Aug 2022
Understanding Diffusion Models: A Unified Perspective
Calvin Luo
DiffM
295
466
0
25 Aug 2022
AI and 6G into the Metaverse: Fundamentals, Challenges and Future Research Trends
IEEE Open Journal of the Communications Society (OJ-COMS), 2022
Muhammad Zawish
Fayaz Ali Dharejo
Sunder Ali Khowaja
Saleem Raza
Steven Davy
Kapal Dev
P. Bellavista
234
115
0
23 Aug 2022
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Bradley McDanel
C. Huynh
ViT
107
1
0
19 Aug 2022
Text to Image Generation: Leaving no Language Behind
Pedro Reviriego
Elena Merino-Gómez
VLM
121
15
0
19 Aug 2022
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning
Olivia Wiles
Isabela Albuquerque
Sven Gowal
VLM
245
52
0
18 Aug 2022
Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance
Bahjat Kawar
Roy Ganz
Michael Elad
DiffM
186
46
0
18 Aug 2022
Multimodal foundation models are better simulators of the human brain
Haoyu Lu
Qiongyi Zhou
Nanyi Fei
Zhiwu Lu
Mingyu Ding
...
Changde Du
Xin Zhao
Haoran Sun
Huiguang He
J. Wen
AI4CE
172
19
0
17 Aug 2022
ILLUME: Rationalizing Vision-Language Models through Human Interactions
International Conference on Machine Learning (ICML), 2022
Manuel Brack
P. Schramowski
Bjorn Deiseroth
Kristian Kersting
VLM
MLLM
382
5
0
17 Aug 2022
Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling
Ki-Ung Song
DiffM
145
9
0
15 Aug 2022
Layout-Bridging Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
EGVM
158
20
0
12 Aug 2022
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
IEEE transactions on multimedia (IEEE TMM), 2022
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
B. Guo
VGen
274
15
0
11 Aug 2022
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP
Neural Information Processing Systems (NeurIPS), 2022
Thao Nguyen
Gabriel Ilharco
Mitchell Wortsman
Sewoong Oh
Ludwig Schmidt
CLIP
VLM
563
122
0
10 Aug 2022
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
IEEE Transactions on Image Processing (IEEE TIP), 2022
Yonghao Xu
Weikang Yu
Pedram Ghamisi
Michael K Kopp
Sepp Hochreiter
236
51
0
08 Aug 2022
SKDCGN: Source-free Knowledge Distillation of Counterfactual Generative Networks using cGANs
Sameer Ambekar
Matteo Tafuro
Ankit Ankit
Diego van der Mast
Mark Alence
C. Athanasiadis
GAN
211
4
0
08 Aug 2022
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
International Conference on Learning Representations (ICLR), 2022
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
396
398
0
08 Aug 2022
Sampling Based On Natural Image Statistics Improves Local Surrogate Explainers
British Machine Vision Conference (BMVC), 2022
Ricardo Kleinlein
Alexander Hepburn
Raúl Santos-Rodríguez
Fernando Fernández-Martínez
AAML
FAtt
111
3
0
08 Aug 2022
Creative Wand: A System to Study Effects of Communications in Co-Creative Settings
Artificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022
Zhiyu Lin
Rohan Agarwal
Mark O. Riedl
143
12
0
04 Aug 2022
Adversarial Attacks on Image Generation With Made-Up Words
Raphael Milliere
222
40
0
04 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
247
28
0
03 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
International Conference on Learning Representations (ICLR), 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
713
2,323
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
International Conference on Learning Representations (ICLR), 2022
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
476
2,443
0
02 Aug 2022
Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ozan Özdenizci
Robert Legenstein
DiffM
336
389
0
29 Jul 2022
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
362
69
0
29 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
Neural Information Processing Systems (NeurIPS), 2022
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa
3DGS
243
155
0
27 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
256
90
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedIm
DiffM
355
79
0
25 Jul 2022
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
EgoV
311
43
0
25 Jul 2022
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
International Conference on Machine Learning (ICML), 2022
Roy Ganz
Bahjat Kawar
Michael Elad
AAML
303
15
0
22 Jul 2022
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
Ming-Yu Liu
Yuxiang Wei
Xiaohe Wu
Wangmeng Zuo
Lei Zhang
232
1
0
21 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Neural Information Processing Systems (NeurIPS), 2022
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
214
93
0
20 Jul 2022
Sparse Relational Reasoning with Object-Centric Representations
Alex F Spies
Alessandra Russo
Murray Shanahan
OCL
NAI
171
3
0
15 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Conference on Robot Learning (CoRL), 2022
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
546
607
0
10 Jul 2022
Improving Diffusion Model Efficiency Through Patching
Troy Luhman
Eric Luhman
DiffM
184
21
0
09 Jul 2022
Accelerating Material Design with the Generative Toolkit for Scientific Discovery
npj Computational Materials (npj Comput. Mater.), 2022
Matteo Manica
Jannis Born
Joris Cadow
Dimitrios Christofidellis
A. Dave
...
Lauren N. McHugh
Alexy Khrabrov
Payel Das
Seiji Takeda
John Smith
265
41
0
08 Jul 2022
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
387
0
0
08 Jul 2022
Previous
1
2
3
...
100
101
99
Next