Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10741
Cited By
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
20 December 2021
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models"
50 / 2,594 papers shown
Title
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
28
16
0
05 Oct 2022
PLOT: Prompt Learning with Optimal Transport for Vision-Language Models
Guangyi Chen
Weiran Yao
Xiangchen Song
Xinyue Li
Yongming Rao
Kun Zhang
VPVLM
VLM
6
62
0
03 Oct 2022
Membership Inference Attacks Against Text-to-image Generation Models
Yixin Wu
Ning Yu
Zheng Li
Michael Backes
Yang Zhang
DiffM
8
65
0
03 Oct 2022
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Susung Hong
Gyuseong Lee
Wooseok Jang
Seung Wook Kim
DiffM
19
97
0
03 Oct 2022
AudioGen: Textually Guided Audio Generation
Felix Kreuk
Gabriel Synnaeve
Adam Polyak
Uriel Singer
Alexandre Défossez
Jade Copet
Devi Parikh
Yaniv Taigman
Yossi Adi
DiffM
17
288
0
30 Sep 2022
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Han-Hung Lee
Angel X. Chang
14
63
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
47
2,304
0
29 Sep 2022
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
188
723
0
29 Sep 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffM
VGen
22
1,345
0
29 Sep 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
114
161
0
29 Sep 2022
Compositional Score Modeling for Simulation-based Inference
Tomas Geffner
George Papamakarios
A. Mnih
60
24
0
28 Sep 2022
What Does DALL-E 2 Know About Radiology?
Lisa Christine Adams
Felix Busch
Daniel Truhn
Marcus R. Makowski
Hugo J. W. L. Aerts
Keno K. Bressem
MedIm
34
57
0
27 Sep 2022
Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion
Nisha Huang
Fan Tang
Weiming Dong
Changsheng Xu
DiffM
64
40
0
27 Sep 2022
A Collaborative, Interactive and Context-Aware Drawing Agent for Co-Creative Design
F. Ibarrola
Tomas Lawton
Kazjon Grace
35
13
0
26 Sep 2022
Personalizing Text-to-Image Generation via Aesthetic Gradients
Víctor Gallego
42
16
0
25 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
14
314
0
25 Sep 2022
MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation
Junyoung Seo
Gyuseong Lee
Seokju Cho
Jiyoung Lee
Seung Wook Kim
DiffM
21
27
0
22 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
22
3
0
22 Sep 2022
Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
Zhaoxi Chen
Guangcong Wang
Ziwei Liu
88
30
0
20 Sep 2022
Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Lukas Struppek
Dominik Hintersdorf
Felix Friedrich
Manuel Brack
P. Schramowski
Kristian Kersting
66
26
0
19 Sep 2022
Does CLIP Know My Face?
Dominik Hintersdorf
Lukas Struppek
Manuel Brack
Felix Friedrich
P. Schramowski
Kristian Kersting
VLM
13
9
0
15 Sep 2022
Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models
Chen Henry Wu
Saman Motamed
Shaunak Srivastava
Fernando De la Torre
VLM
DiffM
6
34
0
14 Sep 2022
Diffusion Models in Vision: A Survey
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
DiffM
VLM
MedIm
191
1,133
0
10 Sep 2022
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
176
25
0
09 Sep 2022
FETA: Towards Specializing Foundation Models for Expert Task Applications
Amit Alfassy
Assaf Arbelle
Oshri Halimi
Sivan Harary
Roei Herzig
...
Christoph Auer
Kate Saenko
Peter W. J. Staar
Rogerio Feris
Leonid Karlinsky
21
19
0
08 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
30
838
0
07 Sep 2022
Statistical Foundation Behind Machine Learning and Its Impact on Computer Vision
Lei Zhang
H. Shum
VLM
SSL
8
2
0
06 Sep 2022
A Survey on Generative Diffusion Model
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
37
205
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Bin Cui
Ming-Hsuan Yang
DiffM
MedIm
224
1,296
0
02 Sep 2022
FLAME: Free-form Language-based Motion Synthesis & Editing
Jihoon Kim
Jiseob Kim
Sungjoon Choi
VGen
17
195
0
01 Sep 2022
SketchBetween: Video-to-Video Synthesis for Sprite Animation via Sketches
Dagmar Lukka Loftsdóttir
Matthew J. Guzdial
VGen
15
3
0
01 Sep 2022
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
Mingyuan Zhang
Zhongang Cai
Liang Pan
Fangzhou Hong
Xinying Guo
Lei Yang
Ziwei Liu
DiffM
VGen
24
538
0
31 Aug 2022
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data
Jihye Park
Sunwoo Kim
Soohyun Kim
Seokju Cho
Jaejun Yoo
Youngjung Uh
Seung Wook Kim
VLM
28
9
0
31 Aug 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
15
90
0
29 Aug 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
14
2,688
0
25 Aug 2022
Diffusion Models Beat GANs on Topology Optimization
Franccois Mazé
Faez Ahmed
DiffM
AI4CE
22
57
0
20 Aug 2022
Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
Arpit Bansal
Eitan Borgnia
Hong-Min Chu
Jie S. Li
Hamid Kazemi
Furong Huang
Micah Goldblum
Jonas Geiping
Tom Goldstein
VLM
DiffM
30
262
0
19 Aug 2022
Text to Image Generation: Leaving no Language Behind
Pedro Reviriego
Elena Merino-Gómez
VLM
8
13
0
19 Aug 2022
Pathway to Future Symbiotic Creativity
Yi-Ting Guo
Qi-fei Liu
Jie Chen
Wei Xue
Jie Fu
...
Fernando Rosas
Jeffrey Shaw
Xing Wu
Jiji Zhang
Jianliang Xu
21
0
0
18 Aug 2022
Language-guided Semantic Style Transfer of 3D Indoor Scenes
Bu Jin
Beiwen Tian
Hao Zhao
Guyue Zhou
3DV
13
11
0
16 Aug 2022
Memory-Driven Text-to-Image Generation
Bowen Li
Philip H. S. Torr
Thomas Lukasiewicz
DiffM
19
12
0
15 Aug 2022
Layout-Bridging Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
EGVM
14
15
0
12 Aug 2022
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
B. Guo
VGen
18
13
0
11 Aug 2022
Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP
Thao Nguyen
Gabriel Ilharco
Mitchell Wortsman
Sewoong Oh
Ludwig Schmidt
CLIP
VLM
38
97
0
10 Aug 2022
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
27
209
0
08 Aug 2022
Pyramidal Denoising Diffusion Probabilistic Models
Dohoon Ryu
Jong Chul Ye
18
25
0
03 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
11
17
0
03 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
38
1,689
0
02 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Y. Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
34
1,777
0
02 Aug 2022
Exploring the GLIDE model for Human Action-effect Prediction
Fangjun Li
David C. Hogg
Anthony G. Cohn
26
0
0
01 Aug 2022
Previous
1
2
3
...
49
50
51
52
Next