ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12242
  4. Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
v1v2 (latest)

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
ArXiv (abs)PDFHTMLHuggingFace (12 upvotes)

Papers citing "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"

50 / 2,538 papers shown
Training-Free Layout Control with Cross-Attention Guidance
Training-Free Layout Control with Cross-Attention GuidanceIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Minghao Chen
Iro Laina
Andrea Vedaldi
DiffM
440
311
0
06 Apr 2023
Taming Encoder for Zero Fine-tuning Image Customization with
  Text-to-Image Diffusion Models
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia
Yang Zhao
Kelvin C. K. Chan
Yandong Li
Han-Ying Zhang
Boqing Gong
Tingbo Hou
Jian Shu
Yu-Chuan Su
DiffM
217
123
0
05 Apr 2023
Few-shot Semantic Image Synthesis with Class Affinity Transfer
Few-shot Semantic Image Synthesis with Class Affinity TransferComputer Vision and Pattern Recognition (CVPR), 2023
Marlene Careil
Jakob Verbeek
Stéphane Lathuilière
DiffM
172
6
0
05 Apr 2023
JPEG Compressed Images Can Bypass Protections Against AI Editing
JPEG Compressed Images Can Bypass Protections Against AI Editing
Pedro Sandoval-Segura
Jonas Geiping
Tom Goldstein
DiffM
172
16
0
05 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image
  Generation
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
224
77
0
04 Apr 2023
The Work Avatar Face-Off: Knowledge Worker Preferences for Realism in
  Meetings
The Work Avatar Face-Off: Knowledge Worker Preferences for Realism in MeetingsInternational Symposium on Mixed and Augmented Reality (ISMAR), 2023
Vrushank Phadnis
Kristin Moore
Mar Gonzalez-Franco
CVBM
140
5
0
03 Apr 2023
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via
  Diffusion Models
DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Yukang Cao
Yan-Pei Cao
Kai Han
Ying Shan
Kwan-Yee K. Wong
DiffM
290
171
0
03 Apr 2023
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Subject-driven Text-to-Image Generation via Apprenticeship LearningNeural Information Processing Systems (NeurIPS), 2023
Wenhu Chen
Hexiang Hu
Yandong Li
Nataniel Rui
Xuhui Jia
Ming-Wei Chang
William W. Cohen
DiffM
919
227
0
01 Apr 2023
A Closer Look at Parameter-Efficient Tuning in Diffusion Models
A Closer Look at Parameter-Efficient Tuning in Diffusion Models
Chendong Xiang
Fan Bao
Chongxuan Li
Hang Su
Jun Zhu
DiffM
127
17
0
31 Mar 2023
One-shot Unsupervised Domain Adaptation with Personalized Diffusion
  Models
One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
Yasser Benigmim
Subhankar Roy
S. Essid
Vicky Kalogeiton
Stéphane Lathuilière
DiffM
268
44
0
31 Mar 2023
Reference-based Image Composition with Sketch via Structure-aware
  Diffusion Model
Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
Kangyeol Kim
S. Park
Junsoo Lee
Jaegul Choo
DiffM
111
18
0
31 Mar 2023
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models
Wen Wang
Yan Jiang
K. Xie
Zide Liu
Hao Chen
Yue Cao
Xinlong Wang
Chunhua Shen
DiffMVGen
267
136
0
30 Mar 2023
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Eric Zhang
Kai Wang
Xingqian Xu
Zinan Lin
Humphrey Shi
DiffM
350
260
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image EditorComputer Vision and Pattern Recognition (CVPR), 2023
Vidit Goel
E. Peruzzo
Lezhi Li
Dejia Xu
Xingqian Xu
Andrii Zadaianchuk
Trevor Darrell
Zinan Lin
Humphrey Shi
DiffM
284
17
0
30 Mar 2023
Discriminative Class Tokens for Text-to-Image Diffusion Models
Discriminative Class Tokens for Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Idan Schwartz
Vésteinn Snaebjarnarson
Hila Chefer
Robert Bamler
Serge Belongie
Lior Wolf
Sagie Benaim
398
12
0
30 Mar 2023
DiffCollage: Parallel Generation of Large Content with Diffusion Models
DiffCollage: Parallel Generation of Large Content with Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Qinsheng Zhang
Jiaming Song
Xun Huang
Yongxin Chen
Xuan Li
DiffM
256
107
0
30 Mar 2023
Bi-directional Training for Composed Image Retrieval via Text Prompt
  Learning
Bi-directional Training for Composed Image Retrieval via Text Prompt LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Zheyuan Liu
Weixuan Sun
Yicong Hong
Damien Teney
Stephen Gould
305
54
0
29 Mar 2023
Your Diffusion Model is Secretly a Zero-Shot Classifier
Your Diffusion Model is Secretly a Zero-Shot ClassifierIEEE International Conference on Computer Vision (ICCV), 2023
Alexander C. Li
Mihir Prabhudesai
Shivam Duggal
Ellis L Brown
Deepak Pathak
DiffMVLM
697
309
0
28 Mar 2023
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing
Senmao Li
Joost van de Weijer
Taihang Hu
Fahad Shahbaz Khan
Qibin Hou
Yaxing Wang
Jian Yang
DiffM
396
75
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Pierre Fernandez
Guillaume Couairon
Edouard Grave
Matthijs Douze
Teddy Furon
WIGM
331
298
0
27 Mar 2023
Anti-DreamBooth: Protecting users from personalized text-to-image
  synthesis
Anti-DreamBooth: Protecting users from personalized text-to-image synthesisIEEE International Conference on Computer Vision (ICCV), 2023
T. Le
Hao Phung
Thuan Hoang Nguyen
Quan Dao
Ngoc N. Tran
Anh Tran
353
134
0
27 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Training-free Content Injection using h-space in Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
276
39
0
27 Mar 2023
Zero-Shot Composed Image Retrieval with Textual Inversion
Zero-Shot Composed Image Retrieval with Textual InversionIEEE International Conference on Computer Vision (ICCV), 2023
Alberto Baldrati
Lorenzo Agnolucci
Marco Bertini
Marco Bertini
278
160
0
27 Mar 2023
DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
DiffTAD: Temporal Action Detection with Proposal Denoising DiffusionIEEE International Conference on Computer Vision (ICCV), 2023
Sauradip Nag
Xiatian Zhu
Jiankang Deng
Yi-Zhe Song
Tao Xiang
DiffMVGen
273
31
0
27 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human
  Preference
Human Preference Score: Better Aligning Text-to-Image Models with Human PreferenceIEEE International Conference on Computer Vision (ICCV), 2023
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Jiaming Song
243
264
0
25 Mar 2023
Freestyle Layout-to-Image Synthesis
Freestyle Layout-to-Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023
Han Xue
Z. Huang
Qianru Sun
Li Song
Wenjun Zhang
DiffM
321
82
0
25 Mar 2023
CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D
  Scene Layout
CompoNeRF: Text-guided Multi-object Compositional NeRF with Editable 3D Scene Layout
Haotian Bai
Yiqi Lin
Hui Xiong
Sijia Li
H. Lu
Xiaodong Lin
Lin Wang
DiffM
296
62
0
24 Mar 2023
End-to-End Diffusion Latent Optimization Improves Classifier Guidance
End-to-End Diffusion Latent Optimization Improves Classifier GuidanceIEEE International Conference on Computer Vision (ICCV), 2023
Bram Wallace
Akash Gokul
Stefano Ermon
Nikhil Naik
459
104
0
23 Mar 2023
NOPE: Novel Object Pose Estimation from a Single Image
NOPE: Novel Object Pose Estimation from a Single ImageComputer Vision and Pattern Recognition (CVPR), 2023
Van Nguyen Nguyen
Thibault Groueix
Yinlin Hu
Mathieu Salzmann
Vincent Lepetit
206
40
0
23 Mar 2023
Ablating Concepts in Text-to-Image Diffusion Models
Ablating Concepts in Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Nupur Kumari
Bin Zhang
Sheng-Yu Wang
Eli Shechtman
Richard Y. Zhang
Jun-Yan Zhu
VLM
478
282
0
23 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
DreamBooth3D: Subject-Driven Text-to-3D GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
317
268
0
23 Mar 2023
ReVersion: Diffusion-Based Relation Inversion from Images
ReVersion: Diffusion-Based Relation Inversion from ImagesACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023
Ziqi Huang
Tianxing Wu
Yuming Jiang
Kelvin C. K. Chan
Ziwei Liu
275
89
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video
  Generators
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video GeneratorsIEEE International Conference on Computer Vision (ICCV), 2023
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zinan Lin
Shant Navasardyan
Humphrey Shi
VGen
308
730
0
23 Mar 2023
Medical diffusion on a budget: textual inversion for medical image
  generation
Medical diffusion on a budget: textual inversion for medical image generationInternational Conference on Medical Imaging with Deep Learning (MIDL), 2023
B. D. Wilde
A. Saha
R. T. Broek
Henkjan Huisman
DiffMMedIm
204
22
0
23 Mar 2023
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing
  Diffusion Models
MagicFusion: Boosting Text-to-Image Generation Performance by Fusing Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Jing Zhao
Heliang Zheng
Chaoyue Wang
L. Lan
Wenjing Yang
VLM
330
24
0
23 Mar 2023
Democratising AI: Multiple Meanings, Goals, and Methods
Democratising AI: Multiple Meanings, Goals, and MethodsAAAI/ACM Conference on AI, Ethics, and Society (AIES), 2023
Elizabeth Seger
Aviv Ovadya
Ben Garfinkel
Divya Siddarth
Allan Dafoe
145
68
0
22 Mar 2023
Affordance Diffusion: Synthesizing Hand-Object Interactions
Affordance Diffusion: Synthesizing Hand-Object InteractionsComputer Vision and Pattern Recognition (CVPR), 2023
Yufei Ye
Xueting Li
Abhi Gupta
Shalini De Mello
Stan Birchfield
Jiaming Song
Shubham Tulsiani
Sifei Liu
DiffM
340
105
0
21 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion
  Models
Localizing Object-level Shape Variations with Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
398
142
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
668
367
0
20 Mar 2023
Discovering Interpretable Directions in the Semantic Latent Space of
  Diffusion Models
Discovering Interpretable Directions in the Semantic Latent Space of Diffusion ModelsIEEE International Conference on Automatic Face & Gesture Recognition (FG), 2023
René Haas
Inbar Huberman-Spiegelglas
Rotem Mulayoff
Stella Graßhof
Sami S. Brandt
T. Michaeli
DiffM
300
62
0
20 Mar 2023
Deep Image Fingerprint: Towards Low Budget Synthetic Image Detection and
  Model Lineage Analysis
Deep Image Fingerprint: Towards Low Budget Synthetic Image Detection and Model Lineage AnalysisIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Sergey Sinitsa
Ohad Fried
238
27
0
19 Mar 2023
A Recipe for Watermarking Diffusion Models
A Recipe for Watermarking Diffusion Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Ngai-Man Cheung
Min Lin
WIGM
341
152
0
17 Mar 2023
DialogPaint: A Dialog-based Image Editing Model
DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei
Shiyu Wu
Xin Jiang
Yequan Wang
KELMDiffM
202
6
0
17 Mar 2023
P+: Extended Textual Conditioning in Text-to-Image Generation
P+: Extended Textual Conditioning in Text-to-Image Generation
A. Voynov
Qinghao Chu
Daniel Cohen-Or
Kfir Aberman
VLMDiffM
364
244
0
16 Mar 2023
Unified Multi-Modal Latent Diffusion for Joint Subject and Text
  Conditional Image Generation
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation
Yi Ma
Huan Yang
Wenjing Wang
Jianlong Fu
Jiaying Liu
136
71
0
16 Mar 2023
DIRE for Diffusion-Generated Image Detection
DIRE for Diffusion-Generated Image DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Zhendong Wang
Jianmin Bao
Wen-gang Zhou
Weilun Wang
Hezhen Hu
Hong Chen
Houqiang Li
220
381
0
16 Mar 2023
Automatic Geo-alignment of Artwork in Children's Story Books
Automatic Geo-alignment of Artwork in Children's Story Books
Jakub J Dylag
V. Suarez
James Wald
Aneesha Amodini Uvara
DiffM
150
0
0
16 Mar 2023
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a
  Single Image using Diffusion Models
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
D. Kothandaraman
Wanrong Zhu
Ming Lin
Dinesh Manocha
229
6
0
15 Mar 2023
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action
  Recognition with Language Knowledge
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language KnowledgeIEEE International Conference on Computer Vision (ICCV), 2023
Wei Lin
Leonid Karlinsky
Nina Shvetsova
Horst Possegger
Mateusz Koziñski
Yikang Shen
Rogerio Feris
Hilde Kuehne
Horst Bischof
VLM
368
49
0
15 Mar 2023
Highly Personalized Text Embedding for Image Manipulation by Stable
  Diffusion
Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion
Inhwa Han
Serin Yang
Taesung Kwon
Jong Chul Ye
DiffM
279
41
0
15 Mar 2023
Previous
123...4748495051
Next