ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12242
  4. Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
v1v2 (latest)

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
ArXiv (abs)PDFHTMLHuggingFace (12 upvotes)

Papers citing "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"

50 / 2,538 papers shown
From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation
From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation
Ziwei Huang
Ying Shu
Hao Fang
Quanyu Long
Wenya Wang
Qiushi Guo
Tiezheng Ge
Yaoyao Yu
EGVM
190
0
0
21 Oct 2025
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
Yuanhe Guo
Linxi Xie
Zhuoran Chen
Kangrui Yu
Ryan Po
Guandao Yang
Gordon Wetztein
Hongyi Wen
VLM
88
0
0
21 Oct 2025
Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model
Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model
Zhenxing Zhang
Jiayan Teng
Zhuoyi Yang
Tiankun Cao
C. Wang
Xiaohan Zhang
J. Tang
Dan Guo
Meng Wang
VGen
115
0
0
21 Oct 2025
Beyond Real Faces: Synthetic Datasets Can Achieve Reliable Recognition Performance without Privacy Compromise
Beyond Real Faces: Synthetic Datasets Can Achieve Reliable Recognition Performance without Privacy Compromise
Paweł Jakub Borsukiewicz
Fadi Boutros
Iyiola Emmanuel Olatunji
Charles Beumier
Wendkûuni C. Ouedraogo
Jacques Klein
Tegawende F. Bissyande
212
1
0
20 Oct 2025
Chimera: Compositional Image Generation using Part-based Concepting
Chimera: Compositional Image Generation using Part-based Concepting
Shivam Singh
Yiming Chen
Agneet Chatterjee
Amit Raj
James Hays
Yezhou Yang
Chitta Baral
DiffM
296
0
0
20 Oct 2025
Personalized Image Filter: Mastering Your Photographic Style
Personalized Image Filter: Mastering Your Photographic Style
Chengxuan Zhu
Shuchen Weng
Jiacong Fang
Peixuan Zhang
Si Li
Chao Xu
Boxin Shi
DiffM
157
0
0
19 Oct 2025
Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models
Noise Aggregation Analysis Driven by Small-Noise Injection: Efficient Membership Inference for Diffusion Models
Guo Li
Yuyang Yu
Xuemiao Xu
DiffM
139
0
0
18 Oct 2025
DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution
DiffusionX: Efficient Edge-Cloud Collaborative Image Generation with Multi-Round Prompt Evolution
Yi Wei
Shunpu Tang
Liang Zhao
Qiangian Yang
120
0
0
18 Oct 2025
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
Haiyue Sun
Qingdong He
Jinlong Peng
Peng Tang
Jiangning Zhang
Junwei Zhu
Xiaobin Hu
Shuicheng Yan
DiffMVGen
116
0
0
18 Oct 2025
Face-MakeUpV2: Facial Consistency Learning for Controllable Text-to-Image Generation
Face-MakeUpV2: Facial Consistency Learning for Controllable Text-to-Image Generation
Dawei Dai
Yinxiu Zhou
Chenghang Li
Guolai Jiang
Chengfang Zhang
140
0
0
17 Oct 2025
The Face of Persuasion: Analyzing Bias and Generating Culture-Aware Ads
The Face of Persuasion: Analyzing Bias and Generating Culture-Aware Ads
Aysan Aghazadeh
Adriana Kovashka
DiffM
97
0
0
17 Oct 2025
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
Shr-Ruei Tsai
Wei-Cheng Chang
Jie-Ying Lee
Chih-Hai Su
Yu-Lun Liu
DiffM
181
5
0
17 Oct 2025
Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
Zheng Huang
Enpei Zhang
Yinghao Cai
Weikang Qiu
Carl Yang
Elynn Chen
Xiang Zhang
Rex Ying
Dawei Zhou
Yujun Yan
DiffM
128
0
0
17 Oct 2025
Learning an Image Editing Model without Image Editing Pairs
Learning an Image Editing Model without Image Editing Pairs
Nupur Kumari
Sheng-Yu Wang
Nanxuan Zhao
Yotam Nitzan
Yuheng Li
Krishna Kumar Singh
Richard Zhang
Eli Shechtman
Jun-Yan Zhu
Xun Huang
DiffM
309
3
0
16 Oct 2025
Consistent text-to-image generation via scene de-contextualization
Consistent text-to-image generation via scene de-contextualization
Song Tang
Peihao Gong
Kunyu Li
Kai Guo
Boyu Wang
Mao Ye
Jianwei Zhang
X. Zhu
DiffM
126
0
0
16 Oct 2025
Salient Concept-Aware Generative Data Augmentation
Salient Concept-Aware Generative Data Augmentation
Tianchen Zhao
Xuanbai Chen
Zhihua Li
J. Fang
Dongsheng An
Xiang Xu
Zhuowen Tu
Yifan Xing
DiffM
206
0
0
16 Oct 2025
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models
Mert Sonmezer
Matthew Zheng
Pinar Yanardag
DiffMMoMe
339
1
0
16 Oct 2025
WithAnyone: Towards Controllable and ID Consistent Image Generation
WithAnyone: Towards Controllable and ID Consistent Image Generation
H. Xu
Wei Cheng
Peng Xing
Yixiao Fang
Shuhan Wu
...
Xianfang Zeng
Daxin Jiang
Gang Yu
Xingjun Ma
Yu-Gang Jiang
DiffM
227
5
0
16 Oct 2025
MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion
MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion
Minjung Shin
Hyunin Cho
Sooyeon Go
Jin-Hwa Kim
Youngjung Uh
123
1
0
15 Oct 2025
SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion
SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion
Jungbin Cho
Minsu Kim
Jisoo Kim
Ce Zheng
László A. Jeni
Ming-Hsuan Yang
Youngjae Yu
Seonjoo Kim
DiffMVGenTTA
256
0
0
14 Oct 2025
FedMMKT:Co-Enhancing a Server Text-to-Image Model and Client Task Models in Multi-Modal Federated Learning
FedMMKT:Co-Enhancing a Server Text-to-Image Model and Client Task Models in Multi-Modal Federated Learning
Ningxin He
Yang Liu
Wei Sun
Xiaozhou Ye
Ye Ouyang
Tiegang Gao
Z. Zhang
99
0
0
14 Oct 2025
Point Prompting: Counterfactual Tracking with Video Diffusion Models
Point Prompting: Counterfactual Tracking with Video Diffusion Models
Ayush Shrivastava
Sanyam Mehta
Daniel Geng
Andrew Owens
DiffMVGen
129
1
0
13 Oct 2025
CharCom: Composable Identity Control for Multi-Character Story Illustration
CharCom: Composable Identity Control for Multi-Character Story Illustration
Zhongsheng Wang
Ming Lin
Zhedong Lin
Yaser Shakib
Qian Liu
Jiamou Liu
DiffM
130
0
0
11 Oct 2025
ReMix: Towards a Unified View of Consistent Character Generation and Editing
ReMix: Towards a Unified View of Consistent Character Generation and Editing
Benjia Zhou
Bin-Bin Fu
Pei Cheng
Y. Wang
Jiayuan Fan
Tao Chen
DiffM
117
0
0
11 Oct 2025
Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking
Accent-Invariant Automatic Speech Recognition via Saliency-Driven Spectrogram Masking
Mohammad Hossein Sameti
Sepehr Harfi Moridani
Ali Zarean
Hossein Sameti
184
6
0
10 Oct 2025
Cross-Sensor Touch Generation
Cross-Sensor Touch Generation
Samanta Rodriguez
Yiming Dou
Miquel Oller
Andrew Owens
Nima Fazeli
DiffM
104
0
0
10 Oct 2025
Multimodal Policy Internalization for Conversational Agents
Multimodal Policy Internalization for Conversational Agents
Zhenhailong Wang
Jiateng Liu
Amin Fazel
Ritesh Sarkhel
Xing Fan
Xiang Li
Chenlei Guo
Heng Ji
R. Sarikaya
LLMAG
158
1
0
10 Oct 2025
Few-shot multi-token DreamBooth with LoRa for style-consistent character generation
Few-shot multi-token DreamBooth with LoRa for style-consistent character generation
Ruben Pascual
Mikel Sesma-Sara
A. Jurio
D. Paternain
M. Galar
DiffMVGen
101
0
0
10 Oct 2025
InstructX: Towards Unified Visual Editing with MLLM Guidance
InstructX: Towards Unified Visual Editing with MLLM Guidance
Chong Mou
Qichao Sun
Yanze Wu
Pengze Zhang
Xinghui Li
Fulong Ye
Songtao Zhao
Qian He
MLLM
256
7
0
09 Oct 2025
Textual Entailment is not a Better Bias Metric than Token Probability
Textual Entailment is not a Better Bias Metric than Token Probability
Virginia K. Felkner
Allison Lim
Jonathan May
110
0
0
09 Oct 2025
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
Jaeseok Jeong
Junho Kim
Gayoung Lee
Yunjey Choi
Youngjung Uh
DiffM
172
2
0
08 Oct 2025
DreamOmni2: Multimodal Instruction-based Editing and Generation
DreamOmni2: Multimodal Instruction-based Editing and Generation
Bin Xia
Bohao Peng
Yuechen Zhang
Junjia Huang
Jiyang Liu
...
Chengyao Wang
Yitong Wang
Xinglong Wu
Bei Yu
Jiaya Jia
118
9
0
08 Oct 2025
Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments
Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban EnvironmentsCAADRIA proceedings (CAADRIA), 2025
Jingfei Huang
Han Tu
211
0
0
08 Oct 2025
Teleportraits: Training-Free People Insertion into Any Scene
Teleportraits: Training-Free People Insertion into Any Scene
Jialu Gao
K J Joseph
Fernando de la Torre
DiffM
111
0
0
07 Oct 2025
SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation
SIGMA-GEN: Structure and Identity Guided Multi-subject Assembly for Image Generation
Oindrila Saha
Vojtech Krs
R. Měch
Subhransu Maji
Kevin Blackburn-Matzen
Matheus Gadelha
127
1
0
07 Oct 2025
Sparse deepfake detection promotes better disentanglement
Sparse deepfake detection promotes better disentanglement
Antoine Teissier
Marie Tahon
Nicolas Dugué
Aghilas Sini
213
1
0
07 Oct 2025
REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization
REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization
Qiyuan He
Y. Li
Haotian Ye
Jinghao Wang
Xinyao Liao
Pheng-Ann Heng
Stefano Ermon
James Zou
Angela Yao
DiffMVGen
232
2
0
06 Oct 2025
ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement
ConceptSplit: Decoupled Multi-Concept Personalization of Diffusion Models via Token-wise Adaptation and Attention Disentanglement
Habin Lim
Yeongseob Won
Juwon Seo
Gyeong-Moon Park
165
0
0
06 Oct 2025
SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder
SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder
Ronen Kamenetsky
Sara Dorfman
Daniel Garibi
Roni Paiss
Or Patashnik
Daniel Cohen-Or
DiffM
315
0
0
06 Oct 2025
Domain Generalization for Semantic Segmentation: A Survey
Domain Generalization for Semantic Segmentation: A Survey
Manuel Schwonberg
Hanno Gottschalk
OODAI4CE
140
1
0
03 Oct 2025
Latent Diffusion Unlearning: Protecting Against Unauthorized Personalization Through Trajectory Shifted Perturbations
Latent Diffusion Unlearning: Protecting Against Unauthorized Personalization Through Trajectory Shifted Perturbations
Naresh Kumar Devulapally
S. Agarwal
Tejas Gokhale
Vishnu Suresh Lokhande
DiffMAAML
416
0
0
03 Oct 2025
When and Where do Events Switch in Multi-Event Video Generation?
When and Where do Events Switch in Multi-Event Video Generation?
Ruotong Liao
Guowen Huang
Qing Cheng
Thomas Seidl
Daniel Cremers
Volker Tresp
DiffMVGen
213
0
0
03 Oct 2025
Continual Personalization for Diffusion Models
Continual Personalization for Diffusion Models
Yu-Chien Liao
Jr-Jen Chen
Chi-Pin Huang
Ci-Siang Lin
Meng-Lin Wu
Yu-Chun Wang
DiffM
130
0
0
02 Oct 2025
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
DragFlow: Unleashing DiT Priors with Region Based Supervision for Drag Editing
Zihan Zhou
Shilin Lu
Shuli Leng
Shaocong Zhang
Zhuming Lian
Xinlei Yu
A. Kong
DiffM
312
7
0
02 Oct 2025
Image Generation Based on Image Style Extraction
Image Generation Based on Image Style Extraction
Shuochen Chang
128
0
0
01 Oct 2025
IMAGEdit: Let Any Subject Transform
IMAGEdit: Let Any Subject Transform
Fei Shen
Weihao Xu
Rui Yan
Dong Zhang
Xiangbo Shu
Jinhui Tang
VGen
120
1
0
01 Oct 2025
EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model
EchoGen: Generating Visual Echoes in Any Scene via Feed-Forward Subject-Driven Auto-Regressive Model
Ruixiao Dong
Z. Wang
Keli Liu
Li Li
Ying Chen
Kai Li
Daowen Li
Houqiang Li
DiffMVGen
143
0
0
30 Sep 2025
GaussEdit: Adaptive 3D Scene Editing with Text and Image Prompts
GaussEdit: Adaptive 3D Scene Editing with Text and Image PromptsIEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Zhenyu Shu
Junlong Yu
Kai Chao
Shiqing Xin
Ligang Liu
3DGS
202
3
0
30 Sep 2025
OmniDFA: A Unified Framework for Open Set Synthesis Image Detection and Few-Shot Attribution
OmniDFA: A Unified Framework for Open Set Synthesis Image Detection and Few-Shot Attribution
Shiyu Wu
Shuyan Li
Jing Li
Jing Liu
Yequan Wang
156
0
0
30 Sep 2025
dParallel: Learnable Parallel Decoding for dLLMs
dParallel: Learnable Parallel Decoding for dLLMs
Zigeng Chen
Gongfan Fang
Xinyin Ma
Ruonan Yu
Xinchao Wang
115
10
0
30 Sep 2025
Previous
123456...495051
Next