ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12242
  4. Cited By
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for
  Subject-Driven Generation
v1v2 (latest)

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
ArXiv (abs)PDFHTMLHuggingFace (12 upvotes)

Papers citing "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"

50 / 2,527 papers shown
Title
LoRA meets Riemannion: Muon Optimizer for Parametrization-independent Low-Rank Adapters
LoRA meets Riemannion: Muon Optimizer for Parametrization-independent Low-Rank Adapters
Vladimir Bogachev
Vladimir Aletov
Alexander Molozhavenko
Denis Bobkov
Vera Soboleva
Aibek Alanov
Maxim Rakhuba
121
1
0
16 Jul 2025
TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update
TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update
Jeongyun Kim
Seunghoon Jeong
Giseop Kim
Myung-Hwan Jeon
Eunji Jun
Ayoung Kim
3DGS
154
1
0
15 Jul 2025
Implementing Adaptations for Vision AutoRegressive Model
Implementing Adaptations for Vision AutoRegressive Model
Kaif Shaikh
Franziska Boenisch
Adam Dziedzic
188
0
0
15 Jul 2025
Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies
Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies
Seokeon Choi
S. Park
Hyoungwoo Park
J. Kim
Sungrack Yun
139
1
0
14 Jul 2025
From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation
From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation
J. Kim
S. Park
Hyoungwoo Park
Sungrack Yun
Jaegul Choo
Seokeon Choi
DiffM
239
0
0
14 Jul 2025
Text Embedding Knows How to Quantize Text-Guided Diffusion Models
Text Embedding Knows How to Quantize Text-Guided Diffusion Models
H. Lee
Myungjun Son
Dongjea Kang
Seung-Won Jung
DiffMMQ
207
1
0
14 Jul 2025
SnapMoGen: Human Motion Generation from Expressive Texts
SnapMoGen: Human Motion Generation from Expressive Texts
Chuan Guo
Inwoo Hwang
Jian Wang
Bing Zhou
VGen
152
3
0
12 Jul 2025
Contrastive Conditional-Unconditional Alignment for Long-tailed Diffusion Model
Contrastive Conditional-Unconditional Alignment for Long-tailed Diffusion Model
Fang Chen
Alex Villa
Gongbo Liang
Xiaoyi Lu
Meng Tang
121
1
0
11 Jul 2025
When and Where do Data Poisons Attack Textual Inversion?
When and Where do Data Poisons Attack Textual Inversion?
Jeremy A. Styborski
Mingzhi Lyu
Jiayou Lu
Nupur Kapur
A. Kong
DiffMAAML
375
0
0
11 Jul 2025
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation
Jiahui Yang
Yongjia Ma
Donglin Di
Hao Li
Wei Chen
Yan Xie
Jianxun Cui
Xun Yang
W. Zuo
MoMe
252
1
0
07 Jul 2025
CytoDiff: AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics
CytoDiff: AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics
Jan Carreras Boada
Rao Muhammad Umer
Carsten Marr
DiffMMedIm
99
0
0
07 Jul 2025
A Training-Free Style-Personalization via SVD-Based Feature Decomposition
A Training-Free Style-Personalization via SVD-Based Feature Decomposition
Kyoungmin Lee
Jihun Park
Jongmin Gim
Wonhyeok Choi
K. Hwang
Jaeyeul Kim
Sunghoon Im
DiffM
100
0
0
06 Jul 2025
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
Liheng Zhang
Lexi Pang
Hang Ye
Xiaoxuan Ma
Yizhou Wang
DiffM
235
0
0
03 Jul 2025
IC-Custom: Diverse Image Customization via In-Context Learning
IC-Custom: Diverse Image Customization via In-Context Learning
Yaowei Li
Xiaoyu Li
Zhaoyang Zhang
Yuxuan Bian
Gan Liu
...
Lingen Li
Jing Cai
Y. Zou
Yancheng He
Mingyu Ding
143
2
0
02 Jul 2025
Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization
Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization
Barış Batuhan Topal
Umut Özyurt
Zafer Doğan Budak
Ramazan Gokberk Cinbis
328
1
0
01 Jul 2025
Edit360: 2D Image Edits to 3D Assets from Any Angle
Edit360: 2D Image Edits to 3D Assets from Any Angle
Junchao Huang
Xinting Hu
Zhuotao Tian
Shaoshuai Shi
Li Jiang
VGen
240
4
0
01 Jul 2025
Parameter-aware high-fidelity microstructure generation using stable diffusion
Parameter-aware high-fidelity microstructure generation using stable diffusionAdvanced Engineering Informatics (AEI), 2025
Hoang Cuong Phan
Minh Tien Tran
Chihun Lee
Hoheok Kim
Sehyeok Oh
Dong-Kyu Kim
Ho Won Lee
DiffM
124
0
0
01 Jul 2025
Sim2Real Diffusion: Leveraging Foundation Vision Language Models for Adaptive Automated Driving
Sim2Real Diffusion: Leveraging Foundation Vision Language Models for Adaptive Automated Driving
Chinmay Vilas Samak
Tanmay Vilas Samak
Bing Li
Venkat Krovi
153
0
0
30 Jun 2025
OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
Yuanhao Cai
Chentao Song
Xi Chen
Jinbo Xing
Yiwei Hu
...
Tianyu Wang
Y. Zhang
Xiaokang Yang
Zhe Lin
Alan Yuille
DiffMVGen
230
3
0
29 Jun 2025
Preserve Anything: Controllable Image Synthesis with Object Preservation
Preserve Anything: Controllable Image Synthesis with Object Preservation
Prasen Kumar Sharma
Neeraj Matiyali
Siddharth Srivastava
Gaurav Sharma
DiffM
170
0
0
27 Jun 2025
Mitigating Semantic Collapse in Generative Personalization with Test-Time Embedding Adjustment
Mitigating Semantic Collapse in Generative Personalization with Test-Time Embedding Adjustment
Anh Tuan Bui
Trang Vu
Trung Le
Junae Kim
Tamas Abraham
Rollin Omari
Amar Kaur
Dinh Q. Phung
164
0
0
27 Jun 2025
Orthogonal Finetuning Made Scalable
Orthogonal Finetuning Made Scalable
Zeju Qiu
Weiyang Liu
Adrian Weller
Bernhard Schölkopf
169
1
0
24 Jun 2025
OmniGen2: Exploration to Advanced Multimodal Generation
OmniGen2: Exploration to Advanced Multimodal Generation
Chenyuan Wu
PengFei Zheng
Ruiran Yan
Shitao Xiao
Xin Luo
...
Defu Lian
X. Wang
Zhongyuan Wang
Tiejun Huang
Zheng Liu
MLLMSyDaVLM
220
150
0
23 Jun 2025
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models
Yeongtak Oh
J. Mok
Juhyeon Shin
Juhyeon Shin
Sangha Park
J. Mok
Sungroh Yoon
VLM
322
1
0
23 Jun 2025
Controllable and Expressive One-Shot Video Head Swapping
Controllable and Expressive One-Shot Video Head Swapping
Chaonan Ji
Jinwei Qi
Peng Zhang
Bang Zhang
Liefeng Bo
DiffMVGen
187
1
0
20 Jun 2025
VS-Singer: Vision-Guided Stereo Singing Voice Synthesis with Consistency Schrödinger Bridge
VS-Singer: Vision-Guided Stereo Singing Voice Synthesis with Consistency Schrödinger Bridge
Zijing Zhao
Kai Wang
Hao-Ming Huang
Ying Hu
Liang He
J. Yang
154
0
0
19 Jun 2025
Break Stylistic Sophon: Are We Really Meant to Confine the Imagination in Style Transfer?
Break Stylistic Sophon: Are We Really Meant to Confine the Imagination in Style Transfer?
Gary Song Yan
Yusen Zhang
Jinyu Zhao
Hao Zhang
Zhangping Yang
...
Tao Zhang
Yujie He
Siyuan Tian
Yao Gou
Min Li
DiffM
300
0
0
18 Jun 2025
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
Anirud Aggarwal
Abhinav Shrivastava
M. Gwilliam
352
0
0
18 Jun 2025
Control and Realism: Best of Both Worlds in Layout-to-Image without Training
Control and Realism: Best of Both Worlds in Layout-to-Image without Training
Bonan li
Yinhan Hu
Songhua Liu
Xinchao Wang
DiffM
212
2
0
18 Jun 2025
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space
Black Forest Labs
Stephen Batifol
A. Blattmann
Frederic Boesel
Saksham Consul
...
Dustin Podell
Robin Rombach
Harry Saini
Axel Sauer
Luke Smith
DiffM
293
304
0
17 Jun 2025
Toward Rich Video Human-Motion2D Generation
Toward Rich Video Human-Motion2D Generation
Ruihao Xi
Xuekuan Wang
Yongcheng Li
Shuhua Li
Zichen Wang
Yiwei Wang
Feng Wei
Cairong Zhao
VGen
182
0
0
17 Jun 2025
Sharp Generalization Bounds for Foundation Models with Asymmetric Randomized Low-Rank Adapters
Sharp Generalization Bounds for Foundation Models with Asymmetric Randomized Low-Rank Adapters
Anastasis Kratsios
Tin Sum Cheng
Aurelien Lucchi
Haitz Sáez de Ocáriz Borde
194
1
0
17 Jun 2025
UltraZoom: Generating Gigapixel Images from Regular Photos
UltraZoom: Generating Gigapixel Images from Regular Photos
Jingwei Ma
V. Jayaram
Brian L. Curless
Ira Kemelmacher-Shlizerman
S. M. Seitz
3DGS
143
0
0
16 Jun 2025
Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing
Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing
Zhuoying Li
Zhu Xu
Yuxin Peng
Yang Liu
141
3
0
15 Jun 2025
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
Hsi-Che Lin
Yu-Chu Yu
Kai-Po Chang
Y. Wang
234
0
0
13 Jun 2025
Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning
Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter LearningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Chun-Mei Feng
Kai-An Yu
Xinxing Xu
Salman Khan
Rick Siow Mong Goh
Wangmeng Zuo
Yong Liu
VLM
258
0
0
12 Jun 2025
Improving Personalized Search with Regularized Low-Rank Parameter Updates
Improving Personalized Search with Regularized Low-Rank Parameter UpdatesComputer Vision and Pattern Recognition (CVPR), 2025
Fiona Ryan
Josef Sivic
Fabian Caba Heilbron
Judy Hoffman
James M. Rehg
Bryan C. Russell
207
1
0
11 Jun 2025
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
Geometric Regularity in Deterministic Sampling Dynamics of Diffusion-based Generative Models
Defang Chen
Zhenyu Zhou
C. Wang
Siwei Lyu
DiffM
275
1
0
11 Jun 2025
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
Consistent Story Generation: Unlocking the Potential of Zigzag Sampling
Mingxiao Li
Mang Ning
Marie-Francine Moens
DiffM
396
0
0
11 Jun 2025
SPARKE: Scalable Prompt-Aware Diversity and Novelty Guidance in Diffusion Models via RKE Score
SPARKE: Scalable Prompt-Aware Diversity and Novelty Guidance in Diffusion Models via RKE Score
Mohammad Jalali
Haoyu Lei
Amin Gohari
Farzan Farnia
DiffM
287
1
0
11 Jun 2025
Only-Style: Stylistic Consistency in Image Generation without Content Leakage
Tilemachos Aravanis
P. Filntisis
Petros Maragos
George Retsinas
226
0
0
11 Jun 2025
ORIDa: Object-centric Real-world Image Composition DatasetComputer Vision and Pattern Recognition (CVPR), 2025
Jinwoo Kim
Sangmin Han
Jinho Jeong
Jiwoo Choi
Dongyoung Kim
Seon Joo Kim
136
2
0
10 Jun 2025
CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics
Shravan Nayak
Mehar Bhatia
Xiaofeng Zhang
Verena Rieser
Lisa Anne Hendricks
Sjoerd van Steenkiste
Yash Goyal
Karolina Stañczak
Aishwarya Agrawal
EGVM
306
4
0
10 Jun 2025
RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping
Yang Bai
Liudi Yang
George Eskandar
Fengyi Shen
Dong Chen
Mohammad Altillawi
Z. Liu
Gitta Kutyniok
VGen
174
0
0
10 Jun 2025
Dreamland: Controllable World Creation with Simulator and Generative Models
Dreamland: Controllable World Creation with Simulator and Generative Models
Sicheng Mo
Ziyang Leng
Leon Liu
Weizhen Wang
Honglin He
Bolei Zhou
VGen
107
1
0
09 Jun 2025
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy GenerationComputer Vision and Pattern Recognition (CVPR), 2025
H. Kim
Donghyun Kim
Suhyun Kim
DiffM
190
1
0
09 Jun 2025
Diffusion Counterfactual Generation with Semantic Abduction
Diffusion Counterfactual Generation with Semantic Abduction
Rajat Rasal
Avinash Kori
Fabio De Sousa Ribeiro
Tian Xia
Ben Glocker
DiffM
212
3
0
09 Jun 2025
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Teng Hu
Zhentao Yu
Zhengguang Zhou
Jiangning Zhang
Yuan Zhou
Qinglin Lu
Ran Yi
VGen
192
4
0
09 Jun 2025
Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to Model Optimisation
Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to Model Optimisation
Christopher Subia-Waud
142
0
0
09 Jun 2025
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Ge Wang
Songlin Fan
Hangxu Liu
Quanjian Song
Hewei Wang
Jinfeng Xu
DiffMVGen
217
4
0
09 Jun 2025
Previous
123...678...495051
Next