ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,039 papers shown
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
ISS: Image as Stepping Stone for Text-Guided 3D Shape GenerationInternational Conference on Learning Representations (ICLR), 2022
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
406
29
0
09 Sep 2022
TEACH: Temporal Action Composition for 3D Humans
TEACH: Temporal Action Composition for 3D HumansInternational Conference on 3D Vision (3DV), 2022
Nikos Athanasiou
Mathis Petrovich
Michael J. Black
Gül Varol
416
185
0
09 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face
  Generators
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
119
2
0
08 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases
Data Feedback Loops: Model-driven Amplification of Dataset BiasesInternational Conference on Machine Learning (ICML), 2022
Rohan Taori
Tatsunori B. Hashimoto
336
59
0
08 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task
  Applications
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
253
20
0
08 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with
  Rectified Flow
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowInternational Conference on Learning Representations (ICLR), 2022
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
1.1K
1,983
0
07 Sep 2022
Statistical Foundation Behind Machine Learning and Its Impact on
  Computer Vision
Statistical Foundation Behind Machine Learning and Its Impact on Computer Vision
Lei Zhang
H. Shum
VLMSSL
137
2
0
06 Sep 2022
A Survey on Generative Diffusion Model
A Survey on Generative Diffusion ModelIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
766
411
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Diffusion Models: A Comprehensive Survey of Methods and ApplicationsACM Computing Surveys (ACM CSUR), 2022
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffMMedIm
1.5K
1,882
0
02 Sep 2022
Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D
  Object Sets
Zero-Shot Multi-Modal Artist-Controlled Retrieval and Exploration of 3D Object Sets
Kristofer Schlachter
Benjamin Ahlbrand
Zhu Wang
V. Ortenzi
Ken Perlin
DiffM3DV
146
8
0
01 Sep 2022
FLAME: Free-form Language-based Motion Synthesis & Editing
FLAME: Free-form Language-based Motion Synthesis & EditingAAAI Conference on Artificial Intelligence (AAAI), 2022
Jihoon Kim
Jiseob Kim
Sungjoon Choi
VGen
407
251
0
01 Sep 2022
A Diffusion Model Predicts 3D Shapes from 2D Microscopy Images
A Diffusion Model Predicts 3D Shapes from 2D Microscopy ImagesIEEE International Symposium on Biomedical Imaging (ISBI), 2022
Dominik Jens Elias Waibel
Ernst Rooell
Bastian Rieck
Raja Giryes
Carsten Marr
DiffMMedIm
203
53
0
30 Aug 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image SynthesisAAAI Conference on Artificial Intelligence (AAAI), 2022
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
228
114
0
29 Aug 2022
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems
Bjorn Deiseroth
P. Schramowski
Hikaru Shindo
Devendra Singh Dhami
Kristian Kersting
EGVMDiffM
136
2
0
29 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
1.0K
3,747
0
25 Aug 2022
Understanding Diffusion Models: A Unified Perspective
Understanding Diffusion Models: A Unified Perspective
Calvin Luo
DiffM
295
466
0
25 Aug 2022
AI and 6G into the Metaverse: Fundamentals, Challenges and Future
  Research Trends
AI and 6G into the Metaverse: Fundamentals, Challenges and Future Research TrendsIEEE Open Journal of the Communications Society (OJ-COMS), 2022
Muhammad Zawish
Fayaz Ali Dharejo
Sunder Ali Khowaja
Saleem Raza
Steven Davy
Kapal Dev
P. Bellavista
234
115
0
23 Aug 2022
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Bradley McDanel
C. Huynh
ViT
107
1
0
19 Aug 2022
Text to Image Generation: Leaving no Language Behind
Text to Image Generation: Leaving no Language Behind
Pedro Reviriego
Elena Merino-Gómez
VLM
121
15
0
19 Aug 2022
Discovering Bugs in Vision Models using Off-the-shelf Image Generation
  and Captioning
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning
Olivia Wiles
Isabela Albuquerque
Sven Gowal
VLM
245
52
0
18 Aug 2022
Enhancing Diffusion-Based Image Synthesis with Robust Classifier
  Guidance
Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance
Bahjat Kawar
Roy Ganz
Michael Elad
DiffM
186
46
0
18 Aug 2022
Multimodal foundation models are better simulators of the human brain
Multimodal foundation models are better simulators of the human brain
Haoyu Lu
Qiongyi Zhou
Nanyi Fei
Zhiwu Lu
Mingyu Ding
...
Changde Du
Xin Zhao
Haoran Sun
Huiguang He
J. Wen
AI4CE
172
19
0
17 Aug 2022
ILLUME: Rationalizing Vision-Language Models through Human Interactions
ILLUME: Rationalizing Vision-Language Models through Human InteractionsInternational Conference on Machine Learning (ICML), 2022
Manuel Brack
P. Schramowski
Bjorn Deiseroth
Kristian Kersting
VLMMLLM
382
5
0
17 Aug 2022
Applying Regularized Schrödinger-Bridge-Based Stochastic Process in
  Generative Modeling
Applying Regularized Schrödinger-Bridge-Based Stochastic Process in Generative Modeling
Ki-Ung Song
DiffM
145
9
0
15 Aug 2022
Layout-Bridging Text-to-Image Synthesis
Layout-Bridging Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
EGVM
158
20
0
12 Aug 2022
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Language-Guided Face Animation by Recurrent StyleGAN-based GeneratorIEEE transactions on multimedia (IEEE TMM), 2022
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
B. Guo
VGen
274
15
0
11 Aug 2022
Quality Not Quantity: On the Interaction between Dataset Design and
  Robustness of CLIP
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIPNeural Information Processing Systems (NeurIPS), 2022
Thao Nguyen
Gabriel Ilharco
Mitchell Wortsman
Sewoong Oh
Ludwig Schmidt
CLIPVLM
563
122
0
10 Aug 2022
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern
  Hopfield Networks
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield NetworksIEEE Transactions on Image Processing (IEEE TIP), 2022
Yonghao Xu
Weikang Yu
Pedram Ghamisi
Michael K Kopp
Sepp Hochreiter
236
51
0
08 Aug 2022
SKDCGN: Source-free Knowledge Distillation of Counterfactual Generative
  Networks using cGANs
SKDCGN: Source-free Knowledge Distillation of Counterfactual Generative Networks using cGANs
Sameer Ambekar
Matteo Tafuro
Ankit Ankit
Diego van der Mast
Mark Alence
C. Athanasiadis
GAN
211
4
0
08 Aug 2022
Analog Bits: Generating Discrete Data using Diffusion Models with
  Self-Conditioning
Analog Bits: Generating Discrete Data using Diffusion Models with Self-ConditioningInternational Conference on Learning Representations (ICLR), 2022
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
396
398
0
08 Aug 2022
Sampling Based On Natural Image Statistics Improves Local Surrogate
  Explainers
Sampling Based On Natural Image Statistics Improves Local Surrogate ExplainersBritish Machine Vision Conference (BMVC), 2022
Ricardo Kleinlein
Alexander Hepburn
Raúl Santos-Rodríguez
Fernando Fernández-Martínez
AAMLFAtt
111
3
0
08 Aug 2022
Creative Wand: A System to Study Effects of Communications in
  Co-Creative Settings
Creative Wand: A System to Study Effects of Communications in Co-Creative SettingsArtificial Intelligence and Interactive Digital Entertainment Conference (AIIDE), 2022
Zhiyu Lin
Rohan Agarwal
Mark O. Riedl
143
12
0
04 Aug 2022
Adversarial Attacks on Image Generation With Made-Up Words
Adversarial Attacks on Image Generation With Made-Up Words
Raphael Milliere
222
40
0
04 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image
  transformers
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
247
28
0
03 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Prompt-to-Prompt Image Editing with Cross Attention ControlInternational Conference on Learning Representations (ICLR), 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
713
2,323
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual InversionInternational Conference on Learning Representations (ICLR), 2022
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
476
2,443
0
02 Aug 2022
Restoring Vision in Adverse Weather Conditions with Patch-Based
  Denoising Diffusion Models
Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion ModelsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ozan Özdenizci
Robert Legenstein
DiffM
336
389
0
29 Jul 2022
Testing Relational Understanding in Text-Guided Image Generation
Testing Relational Understanding in Text-Guided Image Generation
C. Conwell
T. Ullman
EGVM
362
69
0
29 Jul 2022
GAUDI: A Neural Architect for Immersive 3D Scene Generation
GAUDI: A Neural Architect for Immersive 3D Scene GenerationNeural Information Processing Systems (NeurIPS), 2022
Miguel Angel Bautista
Pengsheng Guo
Samira Abnar
Walter A. Talbott
Alexander Toshev
...
Shuangfei Zhai
Hanlin Goh
Daniel Ulbricht
Afshin Dehghan
J. Susskind
SyDa3DGS
243
155
0
27 Jul 2022
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented
  Diffusion Models
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
256
90
0
26 Jul 2022
What is Healthy? Generative Counterfactual Diffusion for Lesion
  Localization
What is Healthy? Generative Counterfactual Diffusion for Lesion Localization
Pedro Sanchez
Antanas Kascenas
Xiao Liu
Alison Q. OÑeil
Sotirios A. Tsaftaris
MedImDiffM
355
79
0
25 Jul 2022
Intention-Conditioned Long-Term Human Egocentric Action Forecasting
Intention-Conditioned Long-Term Human Egocentric Action ForecastingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
EgoV
311
43
0
25 Jul 2022
Do Perceptually Aligned Gradients Imply Adversarial Robustness?
Do Perceptually Aligned Gradients Imply Adversarial Robustness?International Conference on Machine Learning (ICML), 2022
Roy Ganz
Bahjat Kawar
Michael Elad
AAML
303
15
0
22 Jul 2022
A Survey on Leveraging Pre-trained Generative Adversarial Networks for
  Image Editing and Restoration
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration
Ming-Yu Liu
Yuxiang Wei
Xiaohe Wu
Wangmeng Zuo
Lei Zhang
232
1
0
21 Jul 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for
  Infinite Visual Synthesis
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual SynthesisNeural Information Processing Systems (NeurIPS), 2022
Chenfei Wu
Jian Liang
Xiaowei Hu
Zhe Gan
Jianfeng Wang
Lijuan Wang
Zicheng Liu
Yuejian Fang
Nan Duan
VGen
214
93
0
20 Jul 2022
Sparse Relational Reasoning with Object-Centric Representations
Sparse Relational Reasoning with Object-Centric Representations
Alex F Spies
Alessandra Russo
Murray Shanahan
OCLNAI
171
3
0
15 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and ActionConference on Robot Learning (CoRL), 2022
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
546
607
0
10 Jul 2022
Improving Diffusion Model Efficiency Through Patching
Improving Diffusion Model Efficiency Through Patching
Troy Luhman
Eric Luhman
DiffM
184
21
0
09 Jul 2022
Accelerating Material Design with the Generative Toolkit for Scientific
  Discovery
Accelerating Material Design with the Generative Toolkit for Scientific Discoverynpj Computational Materials (npj Comput. Mater.), 2022
Matteo Manica
Jannis Born
Joris Cadow
Dimitrios Christofidellis
A. Dave
...
Lauren N. McHugh
Alexy Khrabrov
Payel Das
Seiji Takeda
John Smith
265
41
0
08 Jul 2022
Big Learning
Big Learning
Yulai Cong
Miaoyun Zhao
AI4CE
387
0
0
08 Jul 2022
Previous
123...10010199
Next