Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2208.12242
Cited By
v1
v2 (latest)
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 2,538 papers shown
Title
ORIDa: Object-centric Real-world Image Composition Dataset
Computer Vision and Pattern Recognition (CVPR), 2025
Jinwoo Kim
Sangmin Han
Jinho Jeong
Jiwoo Choi
Dongyoung Kim
Seon Joo Kim
188
2
0
10 Jun 2025
CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics
Shravan Nayak
Mehar Bhatia
Xiaofeng Zhang
Verena Rieser
Lisa Anne Hendricks
Sjoerd van Steenkiste
Yash Goyal
Karolina Stañczak
Aishwarya Agrawal
EGVM
350
4
0
10 Jun 2025
RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping
Yang Bai
Liudi Yang
George Eskandar
Fengyi Shen
Dong Chen
Mohammad Altillawi
Z. Liu
Gitta Kutyniok
VGen
232
0
0
10 Jun 2025
Diffusion Counterfactual Generation with Semantic Abduction
Rajat Rasal
Avinash Kori
Fabio De Sousa Ribeiro
Tian Xia
Ben Glocker
DiffM
220
3
0
09 Jun 2025
Consistent Video Editing as Flow-Driven Image-to-Video Generation
Ge Wang
Songlin Fan
Hangxu Liu
Quanjian Song
Hewei Wang
Jinfeng Xu
DiffM
VGen
233
4
0
09 Jun 2025
Evaluating Robustness in Latent Diffusion Models via Embedding Level Augmentation
Boris Martirosyan
Alexey Karmanov
DiffM
138
0
0
09 Jun 2025
Dreamland: Controllable World Creation with Simulator and Generative Models
Sicheng Mo
Ziyang Leng
Leon Liu
Weizhen Wang
Honglin He
Bolei Zhou
VGen
131
1
0
09 Jun 2025
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Teng Hu
Zhentao Yu
Zhengguang Zhou
Jiangning Zhang
Yuan Zhou
Qinglin Lu
Ran Yi
VGen
208
4
0
09 Jun 2025
Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
Computer Vision and Pattern Recognition (CVPR), 2025
H. Kim
Donghyun Kim
Suhyun Kim
DiffM
222
1
0
09 Jun 2025
R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation
William Ljungbergh
Bernardo Taveira
Wenzhao Zheng
Adam Tonderski
Chensheng Peng
...
Christoffer Petersson
Michael Felsberg
Kurt Keutzer
Masayoshi Tomizuka
Wei Zhan
220
6
0
09 Jun 2025
Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to Model Optimisation
Christopher Subia-Waud
170
0
0
09 Jun 2025
Self-Adapting Improvement Loops for Robotic Learning
Calvin Luo
Zilai Zeng
Mingxi Jia
Yilun Du
Chen Sun
155
1
0
07 Jun 2025
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
Yao Ni
Song Wen
Piotr Koniusz
A. Cherian
196
1
0
06 Jun 2025
Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation
Zhan Zhuang
Xiequn Wang
Wei Li
Yulong Zhang
Qiushi Huang
...
Yanbin Wei
Yuhe Nie
Kede Ma
Yu Zhang
Ying Wei
261
0
0
06 Jun 2025
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development
Zhenran Xu
Xue Yang
Yiyu Wang
Qingli Hu
Zijiao Wu
L. Wang
Weihua Luo
Kaifu Zhang
Baotian Hu
Min Zhang
LLMAG
241
4
0
05 Jun 2025
MARBLE: Material Recomposition and Blending in CLIP-Space
Computer Vision and Pattern Recognition (CVPR), 2025
Ta-Ying Cheng
Prafull Sharma
Mark Boss
Varun Jampani
DiffM
247
4
0
05 Jun 2025
AuthGuard: Generalizable Deepfake Detection via Language Guidance
Guangyu Shen
Zhihua Li
Xiang Xu
Tianchen Zhao
Zheng Zhang
Dongsheng An
Zhuowen Tu
Yifan Xing
Qin Zhang
192
1
0
04 Jun 2025
Is Perturbation-Based Image Protection Disruptive to Image Editing?
International Conference on Information Photonics (ICIP), 2025
Qiuyu Tang
Bonor Ayambem
Mooi Choo Chuah
Aparna Bharati
DiffM
280
1
0
04 Jun 2025
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation
Chaehun Shin
Jooyoung Choi
J. Mok
Jungbeom Lee
Sungroh Yoon
DiffM
342
0
0
04 Jun 2025
FlexPainter: Flexible and Multi-View Consistent Texture Generation
Dongyu Yan
Leyi Wu
Jiantao Lin
Luozhou Wang
Tianshuo Xu
Zhifei Chen
Zhen Yang
Lie Xu
Shunsi Zhang
Yingcong Chen
DiffM
240
1
0
03 Jun 2025
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models
Mingzhe Li
Gehao Zhang
Zhenting Wang
Guanhong Tao
Siqi Pan
Richard Cartwright
Juan Zhai
Shiqing Ma
DiffM
235
0
0
03 Jun 2025
Beyond Invisibility: Learning Robust Visible Watermarks for Stronger Copyright Protection
Conference on Uncertainty in Artificial Intelligence (UAI), 2025
Tianci Liu
Tong Yang
Quan Zhang
Qi Lei
WIGM
AAML
297
0
0
03 Jun 2025
PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples
Junyu Liu
R. K. Jones
Daniel E. Ritchie
DiffM
CoGe
310
2
0
03 Jun 2025
RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers
Yan Gong
Yiren Song
Yicheng Li
Chenglin Li
Yin Zhang
KELM
207
13
0
03 Jun 2025
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Yuan Gan
Jiaxu Miao
Yunze Wang
Yi Yang
AAML
DiffM
182
2
0
02 Jun 2025
Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks
Tao Yang
Ruibin Li
Yangming Shi
Yuqi Zhang
Qide Dong
Haoran Cheng
Weiguo Feng
Shilei Wen
Bingyue Peng
Lei Zhang
DiffM
VGen
264
0
0
02 Jun 2025
Efficiency without Compromise: CLIP-aided Text-to-Image GANs with Increased Diversity
Yuya Kobayashi
Yuhta Takida
Takashi Shibuya
Yuki Mitsufuji
DiffM
243
0
0
02 Jun 2025
Minimal Impact ControlNet: Advancing Multi-ControlNet Integration
International Conference on Learning Representations (ICLR), 2025
Shikun Sun
Min Zhou
Zixuan Wang
Xubin Li
Bo Xiao
Zijie Ye
Xiaoyu Qin
Junliang Xing
Bo Zheng
J. Jia
224
0
0
02 Jun 2025
Dual-Process Image Generation
Grace Luo
Jonathan Granskog
Aleksander Holynski
Trevor Darrell
VLM
283
6
0
02 Jun 2025
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
Tianjiao Zhang
Fei Zhang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
DiffM
341
4
0
02 Jun 2025
TaxaDiffusion: Progressively Trained Diffusion Model for Fine-Grained Species Generation
Amin Karimi Monsefi
Mridul Khurana
R. Ramnath
Anuj Karpatne
Wei-Lun Chao
Cheng Zhang
273
3
0
02 Jun 2025
WorldExplorer: Towards Generating Fully Navigable 3D Scenes
Manuel-Andreas Schneider
Lukas Höllein
Matthias Nießner
VGen
249
8
0
02 Jun 2025
Parallel Rescaling: Rebalancing Consistency Guidance for Personalized Diffusion Models
Jungwoo Chae
J. Kim
Sangheum Hwang
DiffM
141
0
0
31 May 2025
MotionPersona: Characteristics-aware Locomotion Control
Mingyi Shi
Wei Liu
Jidong Mei
Wangpok Tse
Rui Chen
Xuelin Chen
Taku Komura
VGen
183
0
0
30 May 2025
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Computer Vision and Pattern Recognition (CVPR), 2025
Jinlu Zhang
Yixin Chen
Zan Wang
Jie Yang
Yizhou Wang
Siyuan Huang
260
6
0
30 May 2025
MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement
Yufan Deng
Xun Guo
Yuanyang Yin
Yizhi Wang
Yiding Yang
...
Shenghai Yuan
Angtian Wang
Bo Liu
Haibin Huang
Chongyang Ma
DiffM
VGen
VOS
286
4
0
29 May 2025
EquiReg: Equivariance Regularized Diffusion for Inverse Problems
Bahareh Tolooshams
Aditi Chandrashekar
Rayhan Zirvi
Abbas Mammadov
Jiachen Yao
Chuwei Wang
Julius Berner
DiffM
238
2
0
29 May 2025
Generating Fit Check Videos with a Handheld Camera
B. Chen
Brian L. Curless
Ira Kemelmacher-Shlizerman
Steven M. Seitz
DiffM
213
0
0
29 May 2025
MAP: Revisiting Weight Decomposition for Low-Rank Adaptation
Chongjie Si
Zhiyi Shi
Yadao Wang
Yunbo Wang
Susanto Rahardja
Wei Shen
268
1
0
29 May 2025
Fooling the Watchers: Breaking AIGC Detectors via Semantic Prompt Attacks
Run Hao
Peng Ying
347
0
0
29 May 2025
LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers
Yusuf Dalva
Hidir Yesiltepe
Pinar Yanardag
OffRL
247
5
0
29 May 2025
GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion
Gwanghyun Kim
Xueting Li
Ye Yuan
Koki Nagano
Tianye Li
Jan Kautz
Se Young Chun
Umar Iqbal
DiffM
206
0
0
29 May 2025
Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis
H. Cao
Yutong Feng
Biao Gong
Yijing Tian
Yunhong Lu
Chuang Liu
Bin Wang
DiffM
VGen
186
3
0
29 May 2025
PALADIN : Robust Neural Fingerprinting for Text-to-Image Diffusion Models
Murthy L
Subarna Tripathi
209
0
0
28 May 2025
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Jinhong Ni
Chang-Bin Zhang
Qiang Zhang
Jing Zhang
MDE
181
5
0
28 May 2025
Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion
Kewen Chen
Xiaobin Hu
Wenqi Ren
DiffM
216
2
0
28 May 2025
One-Way Ticket:Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
S. Li
Lei Wang
Kai Wang
Tao Liu
J. Xie
Joost van de Weijer
Fahad Shahbaz Khan
Shiqi Yang
Yaxing Wang
Zhiqiang Wang
259
4
0
28 May 2025
SineLoRA
Δ
Δ
Δ
: Sine-Activated Delta Compression
Cameron Gordon
Yiping Ji
Hemanth Saratchandran
Paul Albert
Simon Lucey
MQ
332
0
0
28 May 2025
AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment
Yiheng Lin
Shifang Zhao
Ting Liu
Xiaochao Qu
Luoqi Liu
Yao Zhao
Yunchao Wei
DiffM
179
1
0
28 May 2025
Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects
Wei Li
Hebei Li
Yansong Peng
Siying Wu
Yueyi Zhang
Xiaoyan Sun
DiffM
308
1
0
27 May 2025
Previous
1
2
3
...
7
8
9
...
49
50
51
Next