ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12242
  4. Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
v1v2 (latest)

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
ArXiv (abs)PDFHTMLHuggingFace (12 upvotes)

Papers citing "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"

50 / 2,539 papers shown
Content-based Unrestricted Adversarial Attack
Content-based Unrestricted Adversarial AttackNeural Information Processing Systems (NeurIPS), 2023
Zhaoyu Chen
Yue Liu
Shuang Wu
Kaixun Jiang
Shouhong Ding
Wenqiang Zhang
DiffM
349
104
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized
  Attention
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized AttentionInternational Journal of Computer Vision (IJCV), 2023
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGenDiffM
331
349
0
17 May 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Preserve Your Own Correlation: A Noise Prior for Video Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Songwei Ge
Seungjun Nah
Guilin Liu
Tyler Poon
Andrew Tao
Bryan Catanzaro
David Jacobs
Jia-Bin Huang
Ming-Yuan Liu
Yogesh Balaji
DiffMVGen
387
299
0
17 May 2023
Generating coherent comic with rich story using ChatGPT and Stable
  Diffusion
Generating coherent comic with rich story using ChatGPT and Stable Diffusion
Ze Jin
Zorina Song
DiffM
131
16
0
16 May 2023
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao
Enze Xie
Lanqing Hong
Zhenguo Li
G. Lee
DiffMVGen
196
41
0
15 May 2023
Null-text Guidance in Diffusion Models is Secretly a Cartoon-style
  Creator
Null-text Guidance in Diffusion Models is Secretly a Cartoon-style CreatorACM Multimedia (ACM MM), 2023
Jing Zhao
Heliang Zheng
Chaoyue Wang
Long Lan
Wanrong Huang
Wenjing Yang
DiffM
313
14
0
11 May 2023
Visual Tuning
Visual TuningACM Computing Surveys (ACM Comput. Surv.), 2023
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
438
60
0
10 May 2023
iEdit: Localised Text-guided Image Editing with Weak Supervision
iEdit: Localised Text-guided Image Editing with Weak Supervision
Rumeysa Bodur
Erhan Gundogdu
Binod Bhattarai
Tae-Kyun Kim
M. Donoser
Loris Bazzani
DiffM
196
20
0
10 May 2023
Text-guided High-definition Consistency Texture Model
Text-guided High-definition Consistency Texture Model
Zhibin Tang
Tiantong He
DiffM
118
6
0
10 May 2023
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with
  Large Language Models
SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language ModelsACM Multimedia (ACM MM), 2023
Shan Zhong
Zhongzhan Huang
Wushao Wen
Jinghui Qin
Guanbin Li
376
51
0
09 May 2023
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion
  Models
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Wenkai Dong
Song Xue
Xiaoyue Duan
Shumin Han
DiffM
265
93
0
08 May 2023
Text-to-Image Diffusion Models can be Easily Backdoored through
  Multimodal Data Poisoning
Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data PoisoningACM Multimedia (ACM MM), 2023
Shengfang Zhai
Yinpeng Dong
Qingni Shen
Shih-Chieh Pu
Yuejian Fang
Hang Su
233
100
0
07 May 2023
AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
AADiff: Audio-Aligned Video Synthesis with Text-to-Image Diffusion
Seungwoo Lee
Chaerin Kong
D. Jeon
Nojun Kwak
DiffM
280
24
0
06 May 2023
Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling
  Augmentation Framework
Towards Prompt-robust Face Privacy Protection via Adversarial Decoupling Augmentation Framework
Ruijia Wu
Yuhang Wang
Huafeng Shi
Zhipeng Yu
Yichao Wu
Ding Liang
DiffM
184
11
0
06 May 2023
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven
  Text-to-Image Generation
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image GenerationInternational Conference on Learning Representations (ICLR), 2023
Hong Chen
Yipeng Zhang
Simin Wu
Xin Eric Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu
DiffM
351
73
0
05 May 2023
Personalize Segment Anything Model with One Shot
Personalize Segment Anything Model with One ShotInternational Conference on Learning Representations (ICLR), 2023
Renrui Zhang
Zhengkai Jiang
Ziyu Guo
Shilin Yan
Junting Pan
Xianzheng Ma
Hao Dong
Shiyang Feng
Jiaming Song
MLLMVLM
396
295
0
04 May 2023
Multimodal-driven Talking Face Generation via a Unified Diffusion-based
  Generator
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator
Chao Xu
Shaoting Zhu
Junwei Zhu
Alexander I. Rudnicky
Jiangning Zhang
Ying Tai
Yong Liu
DiffM
239
16
0
04 May 2023
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Few-shot Domain-Adaptive Visually-fused Event Detection from TextFusion (Fusion), 2023
Farhad Moghimifar
Fatemeh Shiri
Van Nguyen
Gholamreza Haffari
Yuanyou Li
VLM
223
4
0
04 May 2023
Key-Locked Rank One Editing for Text-to-Image Personalization
Key-Locked Rank One Editing for Text-to-Image PersonalizationInternational Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), 2023
Yoad Tewel
Rinon Gal
Gal Chechik
Yuval Atzmon
DiffM
432
217
0
02 May 2023
DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On
  without 3D Modeling
DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling
M. S. Seyfioglu
Karim Bouyarmane
Suren Kumar
A. Tavanaei
Ismail B. Tutar
DiffM
183
8
0
02 May 2023
In-Context Learning Unlocked for Diffusion Models
In-Context Learning Unlocked for Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Zhendong Wang
Lezhi Li
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zinan Lin
Mingyuan Zhou
VLMDiffM
338
97
0
01 May 2023
Let the Chart Spark: Embedding Semantic Context into Chart with
  Text-to-Image Generative Model
Let the Chart Spark: Embedding Semantic Context into Chart with Text-to-Image Generative ModelIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Shishi Xiao
Suizi Huang
Yue Lin
Yilin Ye
Weizhen Zeng
346
47
0
28 Apr 2023
Generating images of rare concepts using pre-trained diffusion models
Generating images of rare concepts using pre-trained diffusion modelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Dvir Samuel
Rami Ben-Ari
Simon Raviv
N. Darshan
Gal Chechik
531
67
0
27 Apr 2023
Motion-Conditioned Diffusion Model for Controllable Video Synthesis
Motion-Conditioned Diffusion Model for Controllable Video Synthesis
Tsai-Shien Chen
C. Lin
Hung-Yu Tseng
Nayeon Lee
Ming-Hsuan Yang
DiffMVGen
399
90
0
27 Apr 2023
Seeing is not always believing: Benchmarking Human and Model Perception
  of AI-Generated Images
Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated ImagesNeural Information Processing Systems (NeurIPS), 2023
Zeyu Lu
Di Huang
Mengwei He
Jingjing Qu
Chengzhi Wu
Xihui Liu
Wanli Ouyang
265
96
0
25 Apr 2023
Exploring Compositional Visual Generation with Latent Classifier
  Guidance
Exploring Compositional Visual Generation with Latent Classifier Guidance
Changhao Shi
Haomiao Ni
Kaican Li
Shaobo Han
Mingfu Liang
Martin Renqiang Min
DiffM
297
13
0
25 Apr 2023
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion
  Models
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Zhendong Wang
Lezhi Li
Huangjie Zheng
Peihao Wang
Pengcheng He
Zinan Lin
Weizhu Chen
Mingyuan Zhou
253
160
0
25 Apr 2023
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation
Hierarchical Diffusion Autoencoders and Disentangled Image ManipulationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Zeyu Lu
Chengyue Wu
Xinyuan Chen
Yaohui Wang
Junlin Wu
Yu Qiao
Xihui Liu
DiffM
271
21
0
24 Apr 2023
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models
  via GPU-Aware Optimizations
Speed Is All You Need: On-Device Acceleration of Large Diffusion Models via GPU-Aware Optimizations
Yu-Hui Chen
Raman Sarokin
Juhyun Lee
Jiuqiang Tang
Chuo-Ling Chang
Andrei Kulik
Matthias Grundmann
VLM
276
55
0
21 Apr 2023
Building Multimodal AI Chatbots
Building Multimodal AI Chatbots
Mingyu Lee
156
3
0
21 Apr 2023
Collaborative Diffusion for Multi-Modal Face Generation and Editing
Collaborative Diffusion for Multi-Modal Face Generation and EditingComputer Vision and Pattern Recognition (CVPR), 2023
Ziqi Huang
Kelvin C. K. Chan
Yuming Jiang
Ziwei Liu
DiffM
229
154
0
20 Apr 2023
Image retrieval outperforms diffusion models on data augmentation
Image retrieval outperforms diffusion models on data augmentation
Max F. Burg
F. Wenzel
Dominik Zietlow
Max Horn
Osama Makansi
Francesco Locatello
Chris Russell
VLMDiffM
273
22
0
20 Apr 2023
UPGPT: Universal Diffusion Model for Person Image Generation, Editing
  and Pose Transfer
UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer
Soon Yau Cheong
A. Mustafa
Andrew Gilbert
DiffM
230
19
0
18 Apr 2023
Align your Latents: High-Resolution Video Synthesis with Latent
  Diffusion Models
Align your Latents: High-Resolution Video Synthesis with Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
A. Blattmann
Robin Rombach
Huan Ling
Tim Dockhorn
Seung Wook Kim
Sanja Fidler
Karsten Kreis
3DGSVGen
610
1,440
0
18 Apr 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image
  Synthesis and Editing
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and EditingIEEE International Conference on Computer Vision (ICCV), 2023
Ming Cao
Xintao Wang
Chen Ma
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
232
680
0
17 Apr 2023
Identity Encoder for Personalized Diffusion
Identity Encoder for Personalized Diffusion
Yu-Chuan Su
Kelvin C. K. Chan
Yandong Li
Yang Zhao
Han-Ying Zhang
Boqing Gong
Jian Shu
Xuhui Jia
DiffM
208
10
0
14 Apr 2023
Text-Conditional Contextualized Avatars For Zero-Shot Personalization
Text-Conditional Contextualized Avatars For Zero-Shot Personalization
S. Azadi
Thomas Hayes
Akbar Shah
Guan Pang
Devi Parikh
Sonal Gupta
DiffM
145
4
0
14 Apr 2023
Delta Denoising Score
Delta Denoising ScoreIEEE International Conference on Computer Vision (ICCV), 2023
Amir Hertz
Kfir Aberman
Daniel Cohen-Or
DiffM
281
118
0
14 Apr 2023
One-Shot Stylization for Full-Body Human Images
One-Shot Stylization for Full-Body Human Images
Aiyu Cui
Svetlana Lazebnik
3DH
231
0
0
14 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Expressive Text-to-Image Generation with Rich TextIEEE International Conference on Computer Vision (ICCV), 2023
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
482
97
0
13 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple
  Parameter-Efficient Fine-Tuning
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
620
91
0
13 Apr 2023
PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face
  Inpainting
PATMAT: Person Aware Tuning of Mask-Aware Transformer for Face InpaintingIEEE International Conference on Computer Vision (ICCV), 2023
Saman Motamed
Jianjin Xu
Chenhuan Wu
Fernando de la Torre
DiffM
279
4
0
12 Apr 2023
Continual Diffusion: Continual Customization of Text-to-Image Diffusion
  with C-LoRA
Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA
James Smith
Yen-Chang Hsu
Lingyu Zhang
Ting Hua
Z. Kira
Yilin Shen
Hongxia Jin
DiffM
449
144
0
12 Apr 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
DreamPose: Fashion Image-to-Video Synthesis via Stable DiffusionIEEE International Conference on Computer Vision (ICCV), 2023
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffMVGen
363
205
0
12 Apr 2023
Gradient-Free Textual Inversion
Gradient-Free Textual InversionACM Multimedia (ACM MM), 2023
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
260
38
0
12 Apr 2023
CLIP Surgery for Better Explainability with Enhancement in
  Open-Vocabulary Tasks
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary TasksPattern Recognition (Pattern Recogn.), 2023
Yi Li
Hualiang Wang
Yiqun Duan
Xuelong Li
VLMMedImAAML
129
69
0
12 Apr 2023
NeAT: Neural Artistic Tracing for Beautiful Style Transfer
NeAT: Neural Artistic Tracing for Beautiful Style Transfer
Dan Ruta
Andrew Gilbert
John Collomosse
Eli Shechtman
Nicholas I. Kolkin
3DH
219
4
0
11 Apr 2023
EKILA: Synthetic Media Provenance and Attribution for Generative Art
EKILA: Synthetic Media Provenance and Attribution for Generative Art
Kar Balan
S. Agarwal
Simon Jenni
Andy Parsons
Andrew Gilbert
John Collomosse
195
17
0
10 Apr 2023
Defense-Prefix for Preventing Typographic Attacks on CLIP
Defense-Prefix for Preventing Typographic Attacks on CLIP
Hiroki Azuma
Yusuke Matsui
VLMAAML
293
25
0
10 Apr 2023
Towards Real-time Text-driven Image Manipulation with Unconditional
  Diffusion Models
Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Nikita Starodubcev
Dmitry Baranchuk
Valentin Khrulkov
Artem Babenko
DiffM
272
5
0
10 Apr 2023
Previous
123...464748495051
Next
Page 47 of 51
Pageof 51