ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11487
  4. Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
    VLM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"

50 / 5,041 papers shown
MagicMix: Semantic Mixing with Diffusion Models
MagicMix: Semantic Mixing with Diffusion Models
Jun Hao Liew
Hanshu Yan
Daquan Zhou
Jiashi Feng
DiffM
375
76
0
28 Oct 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal
  Guidance
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li
Xue Xu
Xinyan Xiao
Jiacheng Liu
Hu Yang
...
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
Hua Wu
505
31
0
28 Oct 2022
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation
  with Wordless Training
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless TrainingComputer Vision and Pattern Recognition (CVPR), 2022
Junfan Lin
Jianlong Chang
Lingbo Liu
Guanbin Li
Guanbin Li
Qi Tian
Changan Chen
VGen
386
57
0
28 Oct 2022
Deep Generative Models on 3D Representations: A Survey
Deep Generative Models on 3D Representations: A Survey
Zifan Shi
Sida Peng
Yinghao Xu
Andreas Geiger
Yiyi Liao
Yujun Shen
MedIm3DV
322
0
0
27 Oct 2022
How well can Text-to-Image Generative Models understand Ethical Natural
  Language Interventions?
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hritik Bansal
Da Yin
Masoud Monajatipoor
Kai-Wei Chang
212
126
0
27 Oct 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image
  Generative Models
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
534
397
0
26 Oct 2022
Categorical SDEs with Simplex Diffusion
Categorical SDEs with Simplex Diffusion
Pierre Harvey Richemond
Sander Dieleman
Arnaud Doucet
DiffM
209
32
0
26 Oct 2022
Full-band General Audio Synthesis with Score-based Diffusion
Full-band General Audio Synthesis with Score-based DiffusionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Santiago Pascual
Gautam Bhattacharya
Chunghsin Yeh
Jordi Pons
Joan Serrà
DiffM
225
39
0
26 Oct 2022
Towards the Detection of Diffusion Model Deepfakes
Towards the Detection of Diffusion Model Deepfakes
Jonas Ricker
Simon Damm
Thorsten Holz
Asja Fischer
DiffM
382
138
0
26 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Lafite2: Few-shot Text-to-Image Generation
Jiuxiang Gu
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
204
14
0
25 Oct 2022
Vitruvio: 3D Building Meshes via Single Perspective Sketches
Vitruvio: 3D Building Meshes via Single Perspective Sketches
Alberto Tono
Heyaojing Huang
Ashwin Agrawal
Martin Fischer
265
6
0
24 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal
  Language Models
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELMCLL
255
3
0
24 Oct 2022
High-Resolution Image Editing via Multi-Stage Blended Diffusion
High-Resolution Image Editing via Multi-Stage Blended Diffusion
J. Ackermann
Minjun Li
DiffM
145
16
0
24 Oct 2022
Instance-Aware Image Completion
Instance-Aware Image Completion
Ji-Ho Cho
Minguk Kang
Vibhav Vineet
Jaesik Park
ISegVLM
195
2
0
22 Oct 2022
Tools for Extracting Spatio-Temporal Patterns in Meteorological Image
  Sequences: From Feature Engineering to Attention-Based Neural Networks
Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks
A. S. Bansal
Yoonjin Lee
Kyle Hilburn
I. Ebert‐Uphoff
AI4TS
294
2
0
22 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Z-LaVI: Zero-Shot Language Solver Fueled by Visual ImaginationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
224
24
0
21 Oct 2022
Conditional Diffusion with Less Explicit Guidance via Model Predictive
  Control
Conditional Diffusion with Less Explicit Guidance via Model Predictive Control
Max W. Shen
Ehsan Hajiramezanali
Gabriele Scalia
Alex Tseng
N. Diamant
Tommaso Biancalani
Andreas Loukas
175
1
0
21 Oct 2022
Boomerang: Local sampling on image manifolds using diffusion models
Boomerang: Local sampling on image manifolds using diffusion models
Lorenzo Luzi
P. Mayer
Josue Casco-Rodriguez
Ali Siahkoohi
Richard G. Baraniuk
DiffM
356
21
0
21 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
279
156
0
20 Oct 2022
Composing Ensembles of Pre-trained Models via Iterative Consensus
Composing Ensembles of Pre-trained Models via Iterative ConsensusInternational Conference on Learning Representations (ICLR), 2022
Shuang Li
Yilun Du
J. Tenenbaum
Antonio Torralba
Igor Mordatch
MoMe
162
31
0
20 Oct 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
DiffEdit: Diffusion-based semantic image editing with mask guidanceInternational Conference on Learning Representations (ICLR), 2022
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
395
661
0
20 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
OCR-VQGAN: Taming Text-within-Image GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
276
30
0
19 Oct 2022
Language Models Understand Us, Poorly
Language Models Understand Us, PoorlyConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jared Moore
LRM
169
5
0
19 Oct 2022
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image
  Models
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image ModelsBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Royi Rassin
Shauli Ravfogel
Yoav Goldberg
201
66
0
19 Oct 2022
Language Does More Than Describe: On The Lack Of Figurative Speech in
  Text-To-Image Models
Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models
Ricardo Kleinlein
Cristina Luna Jiménez
Fernando Fernández-Martínez
DiffM
146
2
0
19 Oct 2022
Differentially Private Diffusion Models
Differentially Private Diffusion Models
Tim Dockhorn
Tianshi Cao
Arash Vahdat
Karsten Kreis
DiffM
486
129
0
18 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for
  Text-to-Image Generation
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
369
17
0
18 Oct 2022
UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a
  Single Image
UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single ImageACM Transactions on Graphics (TOG), 2022
Dani Valevski
Matan Kalman
Eyal Molad
Eyal Segalis
Yossi Matias
Yaniv Leviathan
DiffM
256
54
0
17 Oct 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Imagic: Text-Based Real Image Editing with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
586
1,340
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
DiffuSeq: Sequence to Sequence Text Generation with Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2022
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
429
459
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text modelsNeural Information Processing Systems (NeurIPS), 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLMMLLMCLIP
1.0K
4,555
0
16 Oct 2022
TransFusion: Transcribing Speech with Multinomial Diffusion
TransFusion: Transcribing Speech with Multinomial Diffusion
Matthew Baas
Kevin Eloff
Herman Kamper
DiffM
96
6
0
14 Oct 2022
Is synthetic data from generative models ready for image recognition?
Is synthetic data from generative models ready for image recognition?International Conference on Learning Representations (ICLR), 2022
Ruifei He
Shuyang Sun
Xin Yu
Chuhui Xue
Wenqing Zhang
Juil Sock
Song Bai
Xiaojuan Qi
497
379
0
14 Oct 2022
MTEB: Massive Text Embedding Benchmark
MTEB: Massive Text Embedding BenchmarkConference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
1.0K
686
0
13 Oct 2022
The Hidden Uniform Cluster Prior in Self-Supervised Learning
The Hidden Uniform Cluster Prior in Self-Supervised LearningInternational Conference on Learning Representations (ICLR), 2022
Mahmoud Assran
Randall Balestriero
Quentin Duval
Florian Bordes
Ishan Misra
Piotr Bojanowski
Pascal Vincent
Michael G. Rabbat
Nicolas Ballas
SSL
231
62
0
13 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by
  Text-to-Image Generation Models
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation ModelsConference on Computer and Communications Security (CCS), 2022
Zeyang Sha
Zheng Li
Ning Yu
Yang Zhang
DiffM
218
202
0
13 Oct 2022
ImaginaryNet: Learning Object Detectors without Real Images and
  Annotations
ImaginaryNet: Learning Object Detectors without Real Images and AnnotationsInternational Conference on Learning Representations (ICLR), 2022
Minheng Ni
Zitong Huang
Kai-Hua Feng
W. Zuo
VLM
242
19
0
13 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Compute-Efficient Deep Learning: Algorithmic Trends and OpportunitiesJournal of machine learning research (JMLR), 2022
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
317
64
0
13 Oct 2022
Self-Guided Diffusion Models
Self-Guided Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Vincent Tao Hu
David W. Zhang
Yuki M. Asano
Gertjan J. Burghouts
Cees G. M. Snoek
391
42
0
12 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
LION: Latent Point Diffusion Models for 3D Shape GenerationNeural Information Processing Systems (NeurIPS), 2022
Fangyin Wei
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
358
626
0
12 Oct 2022
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion
  Image Manipulation
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image ManipulationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chaerin Kong
D. Jeon
Oh-Hun Kwon
Nojun Kwak
DiffM
163
19
0
12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
221
39
0
11 Oct 2022
A generic diffusion-based approach for 3D human pose prediction in the
  wild
A generic diffusion-based approach for 3D human pose prediction in the wildIEEE International Conference on Robotics and Automation (ICRA), 2022
Saeed Saadatnejad
Ali-Ahmad Rasekh
Mohammadreza Mofayezi
Yasamin Medghalchi
Sara Rajabzadeh
Taylor Mordan
Alexandre Alahi
DiffM
287
45
0
11 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to
  CycleDiffusion and Guidance
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
376
79
0
11 Oct 2022
GENIE: Higher-Order Denoising Diffusion Solvers
GENIE: Higher-Order Denoising Diffusion SolversNeural Information Processing Systems (NeurIPS), 2022
Tim Dockhorn
Arash Vahdat
Karsten Kreis
DiffM
345
141
0
11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from
  Diffusion Models
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion ModelsSpoken Language Technology Workshop (SLT), 2022
Matthew Baas
Herman Kamper
DiffM
176
10
0
11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
Markup-to-Image Diffusion Models with Scheduled SamplingInternational Conference on Learning Representations (ICLR), 2022
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
190
6
0
11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal
  Transformation
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Miguel Angel Bautista
J. Susskind
DiffM
228
32
0
10 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
What the DAAM: Interpreting Stable Diffusion Using Cross AttentionAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
587
229
0
10 Oct 2022
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shi-You Xu
VLMDiffM
203
18
0
10 Oct 2022
Previous
123...100101979899
Next
Page 98 of 101
Pageof 101