Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.11487
Cited By
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Neural Information Processing Systems (NeurIPS), 2022
23 May 2022
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
Emily L. Denton
Seyed Kamyar Seyed Ghasemipour
Burcu Karagol Ayan
S. S. Mahdavi
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding"
50 / 5,041 papers shown
MagicMix: Semantic Mixing with Diffusion Models
Jun Hao Liew
Hanshu Yan
Daquan Zhou
Jiashi Feng
DiffM
375
76
0
28 Oct 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li
Xue Xu
Xinyan Xiao
Jiacheng Liu
Hu Yang
...
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
Hua Wu
505
31
0
28 Oct 2022
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless Training
Computer Vision and Pattern Recognition (CVPR), 2022
Junfan Lin
Jianlong Chang
Lingbo Liu
Guanbin Li
Guanbin Li
Qi Tian
Changan Chen
VGen
386
57
0
28 Oct 2022
Deep Generative Models on 3D Representations: A Survey
Zifan Shi
Sida Peng
Yinghao Xu
Andreas Geiger
Yiyi Liao
Yujun Shen
MedIm
3DV
322
0
0
27 Oct 2022
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hritik Bansal
Da Yin
Masoud Monajatipoor
Kai-Wei Chang
212
126
0
27 Oct 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
534
397
0
26 Oct 2022
Categorical SDEs with Simplex Diffusion
Pierre Harvey Richemond
Sander Dieleman
Arnaud Doucet
DiffM
209
32
0
26 Oct 2022
Full-band General Audio Synthesis with Score-based Diffusion
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Santiago Pascual
Gautam Bhattacharya
Chunghsin Yeh
Jordi Pons
Joan Serrà
DiffM
225
39
0
26 Oct 2022
Towards the Detection of Diffusion Model Deepfakes
Jonas Ricker
Simon Damm
Thorsten Holz
Asja Fischer
DiffM
382
138
0
26 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Jiuxiang Gu
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
204
14
0
25 Oct 2022
Vitruvio: 3D Building Meshes via Single Perspective Sketches
Alberto Tono
Heyaojing Huang
Ashwin Agrawal
Martin Fischer
265
6
0
24 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
255
3
0
24 Oct 2022
High-Resolution Image Editing via Multi-Stage Blended Diffusion
J. Ackermann
Minjun Li
DiffM
145
16
0
24 Oct 2022
Instance-Aware Image Completion
Ji-Ho Cho
Minguk Kang
Vibhav Vineet
Jaesik Park
ISeg
VLM
195
2
0
22 Oct 2022
Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks
A. S. Bansal
Yoonjin Lee
Kyle Hilburn
I. Ebert‐Uphoff
AI4TS
294
2
0
22 Oct 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
224
24
0
21 Oct 2022
Conditional Diffusion with Less Explicit Guidance via Model Predictive Control
Max W. Shen
Ehsan Hajiramezanali
Gabriele Scalia
Alex Tseng
N. Diamant
Tommaso Biancalani
Andreas Loukas
175
1
0
21 Oct 2022
Boomerang: Local sampling on image manifolds using diffusion models
Lorenzo Luzi
P. Mayer
Josue Casco-Rodriguez
Ali Siahkoohi
Richard G. Baraniuk
DiffM
356
21
0
21 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
279
156
0
20 Oct 2022
Composing Ensembles of Pre-trained Models via Iterative Consensus
International Conference on Learning Representations (ICLR), 2022
Shuang Li
Yilun Du
J. Tenenbaum
Antonio Torralba
Igor Mordatch
MoMe
162
31
0
20 Oct 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
International Conference on Learning Representations (ICLR), 2022
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
395
661
0
20 Oct 2022
OCR-VQGAN: Taming Text-within-Image Generation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Juan A. Rodriguez
David Vazquez
I. Laradji
M. Pedersoli
Pau Rodríguez López
276
30
0
19 Oct 2022
Language Models Understand Us, Poorly
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jared Moore
LRM
169
5
0
19 Oct 2022
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022
Royi Rassin
Shauli Ravfogel
Yoav Goldberg
201
66
0
19 Oct 2022
Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models
Ricardo Kleinlein
Cristina Luna Jiménez
Fernando Fernández-Martínez
DiffM
146
2
0
19 Oct 2022
Differentially Private Diffusion Models
Tim Dockhorn
Tianshi Cao
Arash Vahdat
Karsten Kreis
DiffM
486
129
0
18 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
369
17
0
18 Oct 2022
UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single Image
ACM Transactions on Graphics (TOG), 2022
Dani Valevski
Matan Kalman
Eyal Molad
Eyal Segalis
Yossi Matias
Yaniv Leviathan
DiffM
256
54
0
17 Oct 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
586
1,340
0
17 Oct 2022
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
International Conference on Learning Representations (ICLR), 2022
Shansan Gong
Mukai Li
Jiangtao Feng
Zhiyong Wu
Lingpeng Kong
429
459
0
17 Oct 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Neural Information Processing Systems (NeurIPS), 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
1.0K
4,555
0
16 Oct 2022
TransFusion: Transcribing Speech with Multinomial Diffusion
Matthew Baas
Kevin Eloff
Herman Kamper
DiffM
96
6
0
14 Oct 2022
Is synthetic data from generative models ready for image recognition?
International Conference on Learning Representations (ICLR), 2022
Ruifei He
Shuyang Sun
Xin Yu
Chuhui Xue
Wenqing Zhang
Juil Sock
Song Bai
Xiaojuan Qi
497
379
0
14 Oct 2022
MTEB: Massive Text Embedding Benchmark
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2022
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
1.0K
686
0
13 Oct 2022
The Hidden Uniform Cluster Prior in Self-Supervised Learning
International Conference on Learning Representations (ICLR), 2022
Mahmoud Assran
Randall Balestriero
Quentin Duval
Florian Bordes
Ishan Misra
Piotr Bojanowski
Pascal Vincent
Michael G. Rabbat
Nicolas Ballas
SSL
231
62
0
13 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models
Conference on Computer and Communications Security (CCS), 2022
Zeyang Sha
Zheng Li
Ning Yu
Yang Zhang
DiffM
218
202
0
13 Oct 2022
ImaginaryNet: Learning Object Detectors without Real Images and Annotations
International Conference on Learning Representations (ICLR), 2022
Minheng Ni
Zitong Huang
Kai-Hua Feng
W. Zuo
VLM
242
19
0
13 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Journal of machine learning research (JMLR), 2022
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
317
64
0
13 Oct 2022
Self-Guided Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Vincent Tao Hu
David W. Zhang
Yuki M. Asano
Gertjan J. Burghouts
Cees G. M. Snoek
391
42
0
12 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
Neural Information Processing Systems (NeurIPS), 2022
Fangyin Wei
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
358
626
0
12 Oct 2022
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chaerin Kong
D. Jeon
Oh-Hun Kwon
Nojun Kwak
DiffM
163
19
0
12 Oct 2022
Underspecification in Scene Description-to-Depiction Tasks
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
DiffM
221
39
0
11 Oct 2022
A generic diffusion-based approach for 3D human pose prediction in the wild
IEEE International Conference on Robotics and Automation (ICRA), 2022
Saeed Saadatnejad
Ali-Ahmad Rasekh
Mohammadreza Mofayezi
Yasamin Medghalchi
Sara Rajabzadeh
Taylor Mordan
Alexandre Alahi
DiffM
287
45
0
11 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
376
79
0
11 Oct 2022
GENIE: Higher-Order Denoising Diffusion Solvers
Neural Information Processing Systems (NeurIPS), 2022
Tim Dockhorn
Arash Vahdat
Karsten Kreis
DiffM
345
141
0
11 Oct 2022
GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models
Spoken Language Technology Workshop (SLT), 2022
Matthew Baas
Herman Kamper
DiffM
176
10
0
11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
International Conference on Learning Representations (ICLR), 2022
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
190
6
0
11 Oct 2022
f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Miguel Angel Bautista
J. Susskind
DiffM
228
32
0
10 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
587
229
0
10 Oct 2022
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning
Shi-You Xu
VLM
DiffM
203
18
0
10 Oct 2022
Previous
1
2
3
...
100
101
97
98
99
Next
Page 98 of 101
Page
of 101
Go