Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2208.12242
Cited By
v1
v2 (latest)
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
27 / 2,527 papers shown
Title
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLM
MoE
505
973
0
02 Nov 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
International Conference on Learning Representations (ICLR), 2022
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
353
642
0
20 Oct 2022
UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single Image
ACM Transactions on Graphics (TOG), 2022
Dani Valevski
Matan Kalman
Eyal Molad
Eyal Segalis
Yossi Matias
Yaniv Leviathan
DiffM
228
53
0
17 Oct 2022
Imagic: Text-Based Real Image Editing with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2022
Bahjat Kawar
Shiran Zada
Oran Lang
Omer Tov
Hui-Tang Chang
Tali Dekel
Inbar Mosseri
Michal Irani
475
1,315
0
17 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models
Conference on Computer and Communications Security (CCS), 2022
Zeyang Sha
Zheng Li
Ning Yu
Yang Zhang
DiffM
180
192
0
13 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
338
78
0
11 Oct 2022
Bridging CLIP and StyleGAN through Latent Alignment for Image Editing
Wanfeng Zheng
Qiang Li
Xiaoyan Guo
Pengfei Wan
Zhong-ming Wang
233
15
0
10 Oct 2022
Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre J. Chambon
Christian Blüthgen
C. Langlotz
Akshay S. Chaudhari
DiffM
MedIm
LM&MA
154
133
0
09 Oct 2022
Efficient Diffusion Models for Vision: A Survey
Anwaar Ulhaq
Naveed Akhtar
MedIm
344
85
0
07 Oct 2022
Content-Based Search for Deep Generative Models
ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2022
Daohan Lu
Sheng-Yu Wang
Nupur Kumari
Rohan Agarwal
Mia Tang
David Bau
Jun-Yan Zhu
DiffM
SyDa
290
8
0
06 Oct 2022
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
IEEE Robotics and Automation Letters (RA-L), 2022
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&Ro
DiffM
436
174
0
05 Oct 2022
NeRF: Neural Radiance Field in 3D Vision, A Comprehensive Review
K. Gao
Yina Gao
Hongjie He
Dening Lu
Linlin Xu
Jonathan Li
572
53
0
01 Oct 2022
Re-Imagen: Retrieval-Augmented Text-to-Image Generator
International Conference on Learning Representations (ICLR), 2022
Wenhu Chen
Hexiang Hu
Chitwan Saharia
William W. Cohen
VLM
433
226
0
29 Sep 2022
Personalizing Text-to-Image Generation via Aesthetic Gradients
Víctor Gallego
166
17
0
25 Sep 2022
A Survey on Generative Diffusion Model
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
673
397
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
ACM Computing Surveys (ACM CSUR), 2022
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffM
MedIm
1.2K
1,839
0
02 Sep 2022
LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data
Computer Vision and Pattern Recognition (CVPR), 2022
Jihye Park
Sunwoo Kim
Soohyun Kim
Seokju Cho
Jaejun Yoo
Youngjung Uh
Seung Wook Kim
VLM
390
12
0
31 Aug 2022
Pathway to Future Symbiotic Creativity
Yi-Ting Guo
Qi-fei Liu
Jie Chen
Wei Xue
Jie Fu
...
Fernando Rosas
Jeffrey Shaw
Xing Wu
Jiji Zhang
Jianliang Xu
229
0
0
18 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
211
26
0
03 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
International Conference on Learning Representations (ICLR), 2022
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
429
2,395
0
02 Aug 2022
Complex Scene Image Editing by Scene Graph Comprehension
British Machine Vision Conference (BMVC), 2022
Zhongping Zhang
Huiwen He
Bryan A. Plummer
Z. Liao
Huayan Wang
DiffM
183
7
0
24 Mar 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU
Image and Vision Computing (IVC), 2022
Junseok Oh
Donghwee Yoon
Injung Kim
304
2
0
28 Feb 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
554
77
0
27 Dec 2021
Making Images Real Again: A Comprehensive Survey on Deep Image Composition
Li Niu
Wenyan Cong
Liu Liu
Yan Hong
Bo Zhang
Jing Liang
Liqing Zhang
VLM
DiffM
CoGe
426
95
0
28 Jun 2021
Creativity and Machine Learning: A Survey
ACM Computing Surveys (CSUR), 2021
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
461
54
0
06 Apr 2021
Generating Novel Scene Compositions from Single Images and Videos
Computer Vision and Image Understanding (CVIU), 2021
V. Sushko
Dan Zhang
Juergen Gall
Anna Khoreva
GAN
257
16
0
24 Mar 2021
Semantic Object Accuracy for Generative Text-to-Image Synthesis
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Tobias Hinz
Stefan Heinrich
S. Wermter
EGVM
351
177
0
29 Oct 2019
Previous
1
2
3
...
49
50
51