ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,040 papers shown
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the
  Underlying Score Fokker-Planck Equation
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck EquationInternational Conference on Machine Learning (ICML), 2022
Chieh-Hsin Lai
Yuhta Takida
Naoki Murata
Toshimitsu Uesaka
Yuki Mitsufuji
Stefano Ermon
DiffM
283
38
0
09 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical
  Imaging Domains
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre J. Chambon
Christian Blüthgen
C. Langlotz
Akshay S. Chaudhari
DiffMMedImLM&MA
176
135
0
09 Oct 2022
Can Artificial Intelligence Reconstruct Ancient Mosaics?
Can Artificial Intelligence Reconstruct Ancient Mosaics?Studies in Conservation (SIC), 2022
Fernando Moral-Andrés
Elena Merino-Gómez
Pedro Reviriego
Fabrizio Lombardi
90
9
0
07 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text
  Generation
Visualize Before You Write: Imagination-Guided Open-Ended Text GenerationFindings (Findings), 2022
Wanrong Zhu
An Yan
Yujie Lu
Wenda Xu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
320
38
0
07 Oct 2022
Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via
  Simulation-based Synthetic Data Augmentation and Multitask Learning
Trustworthiness of Laser-Induced Breakdown Spectroscopy Predictions via Simulation-based Synthetic Data Augmentation and Multitask LearningEPJ Web of Conferences (EPJ Web Conf.), 2022
Riccardo Finotello
D. L’hermite
Celine Quéré
Benjamin Rouge
M. Tamaazousti
J. Sirven
164
2
0
07 Oct 2022
Efficient Diffusion Models for Vision: A Survey
Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq
Naveed Akhtar
MedIm
417
86
0
07 Oct 2022
On Distillation of Guided Diffusion Models
On Distillation of Guided Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Chenlin Meng
Robin Rombach
Ruiqi Gao
Diederik P. Kingma
Stefano Ermon
Jonathan Ho
Tim Salimans
VLMDiffM
249
697
0
06 Oct 2022
Content-Based Search for Deep Generative Models
Content-Based Search for Deep Generative ModelsACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2022
Daohan Lu
Sheng-Yu Wang
Nupur Kumari
Rohan Agarwal
Mia Tang
David Bau
Jun-Yan Zhu
DiffMSyDa
327
8
0
06 Oct 2022
Novel View Synthesis with Diffusion Models
Novel View Synthesis with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2022
Daniel Watson
William Chan
Ricardo Martín Brualla
Jonathan Ho
Andrea Tagliasacchi
Mohammad Norouzi
DiffM
422
320
0
06 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
DALL-E-Bot: Introducing Web-Scale Diffusion Models to RoboticsIEEE Robotics and Automation Letters (RA-L), 2022
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&RoDiffM
525
176
0
05 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual
  Description
Phenaki: Variable Length Video Generation From Open Domain Textual DescriptionInternational Conference on Learning Representations (ICLR), 2022
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffMVGen
362
486
0
05 Oct 2022
Bayesian Prompt Learning for Image-Language Model Generalization
Bayesian Prompt Learning for Image-Language Model GeneralizationIEEE International Conference on Computer Vision (ICCV), 2022
Mohammad Mahdi Derakhshani
Enrique Sanchez
Adrian Bulat
Victor G. Turrisi da Costa
Cees G. M. Snoek
Georgios Tzimiropoulos
Brais Martínez
VPVLMVLM
419
60
0
05 Oct 2022
clip2latent: Text driven sampling of a pre-trained StyleGAN using
  denoising diffusion and CLIP
clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIPBritish Machine Vision Conference (BMVC), 2022
Justin N. M. Pinkney
Chuan Li
CLIPVLM
233
26
0
05 Oct 2022
Imagen Video: High Definition Video Generation with Diffusion Models
Imagen Video: High Definition Video Generation with Diffusion Models
Jonathan Ho
William Chan
Chitwan Saharia
Jay Whang
Ruiqi Gao
...
Diederik P. Kingma
Ben Poole
Mohammad Norouzi
David J. Fleet
Tim Salimans
VGen
441
1,862
0
05 Oct 2022
Progressive Text-to-Image Generation
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
301
4
0
05 Oct 2022
When and why vision-language models behave like bags-of-words, and what
  to do about it?
When and why vision-language models behave like bags-of-words, and what to do about it?International Conference on Learning Representations (ICLR), 2022
Mert Yuksekgonul
Federico Bianchi
Pratyusha Kalluri
Dan Jurafsky
James Zou
VLMCoGe
438
524
0
04 Oct 2022
Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor
  Communication
Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor Communication
Tristan Karch
Yoann Lemesle
Romain Laroche
Clément Moulin-Frier
Pierre-Yves Oudeyer
161
1
0
03 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Visual Prompt Tuning for Generative Transfer LearningComputer Vision and Pattern Recognition (CVPR), 2022
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLMVLM
324
105
0
03 Oct 2022
Membership Inference Attacks Against Text-to-image Generation Models
Membership Inference Attacks Against Text-to-image Generation Models
Yixin Wu
Ning Yu
Zheng Li
Michael Backes
Yang Zhang
DiffM
197
79
0
03 Oct 2022
Red-Teaming the Stable Diffusion Safety Filter
Red-Teaming the Stable Diffusion Safety Filter
Javier Rando
Daniel Paleka
David Lindner
Lennard Heim
Florian Tramèr
DiffM
668
255
0
03 Oct 2022
Improving Sample Quality of Diffusion Models Using Self-Attention
  Guidance
Improving Sample Quality of Diffusion Models Using Self-Attention GuidanceIEEE International Conference on Computer Vision (ICCV), 2022
Susung Hong
Gyuseong Lee
Wooseok Jang
Seung Wook Kim
DiffM
529
146
0
03 Oct 2022
OCD: Learning to Overfit with Conditional Diffusion Models
OCD: Learning to Overfit with Conditional Diffusion ModelsInternational Conference on Machine Learning (ICML), 2022
Shahar Lutati
Lior Wolf
DiffM
347
11
0
02 Oct 2022
NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review
NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review
K. Gao
Yina Gao
Hongjie He
Dening Lu
Linlin Xu
Jonathan Li
672
53
0
01 Oct 2022
Protein structure generation via folding diffusion
Protein structure generation via folding diffusionNature Communications (Nat Commun), 2022
Kevin E. Wu
Kevin Kaichuang Yang
Rianne van den Berg
James Zou
Alex X. Lu
Ava P. Amini
DiffM
388
259
0
30 Sep 2022
TabDDPM: Modelling Tabular Data with Diffusion Models
TabDDPM: Modelling Tabular Data with Diffusion ModelsInternational Conference on Machine Learning (ICML), 2022
Akim Kotelnikov
Dmitry Baranchuk
Ivan Rubachev
Artem Babenko
DiffM
248
404
0
30 Sep 2022
AudioGen: Textually Guided Audio Generation
AudioGen: Textually Guided Audio GenerationInternational Conference on Learning Representations (ICLR), 2022
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
410
392
0
30 Sep 2022
Diffusion-based Image Translation using Disentangled Style and Content
  Representation
Diffusion-based Image Translation using Disentangled Style and Content RepresentationInternational Conference on Learning Representations (ICLR), 2022
Gihyun Kwon
Jong Chul Ye
DiffM
628
201
0
30 Sep 2022
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Han-Hung Lee
Angel X. Chang
148
68
0
30 Sep 2022
State-specific protein-ligand complex structure prediction with a
  multi-scale deep generative model
State-specific protein-ligand complex structure prediction with a multi-scale deep generative model
Zhuoran Qiao
Weili Nie
Arash Vahdat
Thomas F. Miller
Anima Anandkumar
DiffM
245
140
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D DiffusionInternational Conference on Learning Representations (ICLR), 2022
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
876
3,151
0
29 Sep 2022
Spotlight: Mobile UI Understanding using Vision-Language Models with a
  Focus
Spotlight: Mobile UI Understanding using Vision-Language Models with a FocusInternational Conference on Learning Representations (ICLR), 2022
Gang Li
Yang Li
345
82
0
29 Sep 2022
Human Motion Diffusion Model
Human Motion Diffusion ModelInternational Conference on Learning Representations (ICLR), 2022
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffMVGen
677
1,043
0
29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video DataInternational Conference on Learning Representations (ICLR), 2022
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffMVGen
298
1,795
0
29 Sep 2022
Offline Reinforcement Learning via High-Fidelity Generative Behavior
  Modeling
Offline Reinforcement Learning via High-Fidelity Generative Behavior ModelingInternational Conference on Learning Representations (ICLR), 2022
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffMOffRL
395
159
0
29 Sep 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Re-Imagen: Retrieval-Augmented Text-to-Image GeneratorInternational Conference on Learning Representations (ICLR), 2022
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
568
230
0
29 Sep 2022
Compositional Score Modeling for Simulation-based Inference
Compositional Score Modeling for Simulation-based InferenceInternational Conference on Machine Learning (ICML), 2022
Tomas Geffner
George Papamakarios
A. Mnih
391
40
0
28 Sep 2022
What Does DALL-E 2 Know About Radiology?
What Does DALL-E 2 Know About Radiology?Journal of Medical Internet Research (JMIR), 2022
Lisa Christine Adams
Felix Busch
Daniel Truhn
Marcus R. Makowski
Hugo J. W. L. Aerts
Keno K. Bressem
MedIm
159
71
0
27 Sep 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
272
83
0
26 Sep 2022
A Collaborative, Interactive and Context-Aware Drawing Agent for
  Co-Creative Design
A Collaborative, Interactive and Context-Aware Drawing Agent for Co-Creative DesignIEEE Transactions on Visualization and Computer Graphics (TVCG), 2022
F. Ibarrola
Tomas Lawton
Kazjon Grace
159
25
0
26 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
All are Worth Words: A ViT Backbone for Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
553
499
0
25 Sep 2022
Face Super-Resolution Using Stochastic Differential Equations
Face Super-Resolution Using Stochastic Differential EquationsSIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), 2022
Marcelo dos Santos
Rayson Laroca
Rafael O. Ribeiro
João Neves
Hugo Proencca
David Menotti
DiffM
206
12
0
24 Sep 2022
A Case Report On The "A.I. Locked-In Problem": social concerns with
  modern NLP
A Case Report On The "A.I. Locked-In Problem": social concerns with modern NLP
Yoshija Walter
LLMAG
128
3
0
22 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image
  Generation
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
135
5
0
22 Sep 2022
Deep Lake: a Lakehouse for Deep Learning
Deep Lake: a Lakehouse for Deep LearningConference on Innovative Data Systems Research (CIDR), 2022
S. Hambardzumyan
Abhina Tuli
Levon Ghukasyan
Fariz Rahman
Hrant Topchyan
...
Mark McQuade
M. Harutyunyan
Tatevik Hakobyan
I. Stranic
Davit Buniatyan
214
30
0
22 Sep 2022
Extremely Simple Activation Shaping for Out-of-Distribution Detection
Extremely Simple Activation Shaping for Out-of-Distribution DetectionInternational Conference on Learning Representations (ICLR), 2022
Andrija Djurisic
Nebojsa Bozanic
Arjun Ashok
Rosanne Liu
OODD
412
201
0
20 Sep 2022
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Exploiting Cultural Biases via Homoglyphs in Text-to-Image SynthesisJournal of Artificial Intelligence Research (JAIR), 2022
Lukas Struppek
Dominik Hintersdorf
Felix Friedrich
Manuel Brack
P. Schramowski
Kristian Kersting
389
41
0
19 Sep 2022
Can There be Art Without an Artist?
Can There be Art Without an Artist?
A. Ghosh
Genoveva Fossas
192
32
0
16 Sep 2022
Does CLIP Know My Face?
Does CLIP Know My Face?Journal of Artificial Intelligence Research (JAIR), 2022
Dominik Hintersdorf
Lukas Struppek
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
VLM
261
17
0
15 Sep 2022
Brain Imaging Generation with Latent Diffusion Models
Brain Imaging Generation with Latent Diffusion Models
W. H. Pinaya
Petru-Daniel Tudosiu
J. Dafflon
P. F. D. Costa
Virginia Fernandez
P. Nachev
Sebastien Ourselin
M. Jorge Cardoso
DiffMMedIm
288
389
0
15 Sep 2022
Soft Diffusion: Score Matching for General Corruptions
Soft Diffusion: Score Matching for General Corruptions
Giannis Daras
M. Delbracio
Hossein Talebi
A. Dimakis
P. Milanfar
DiffM
302
121
0
12 Sep 2022
Previous
123...1001019899
Next