Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2208.12242
Cited By
v1
v2 (latest)
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Computer Vision and Pattern Recognition (CVPR), 2022
25 August 2022
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (12 upvotes)
Papers citing
"DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"
50 / 2,538 papers shown
OrienText: Surface Oriented Textual Image Generation
Shubham Paliwal
Arushi Jain
Monika Sharma
Vikram Jamwal
Lovekesh Vig
DiffM
925
1
0
27 May 2025
ConsiStyle: Style Diversity in Training-Free Consistent T2I Generation
Yohai Mazuz
Janna Bruner
Lior Wolf
DiffM
238
0
0
27 May 2025
DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization
Shamil Ayupov
M. Nakhodnov
Anastasia Yaschenko
Andrey Kuznetsov
Aibek Alanov
219
1
0
27 May 2025
EF-VI: Enhancing End-Frame Injection for Video Inbetweening
Liuhan Chen
Xiaodong Cun
Xiaoyu Li
Xianyi He
Shenghai Yuan
Jie Chen
Mingyu Ding
Lichao Sun
VGen
284
0
0
27 May 2025
Geometry-Editable and Appearance-Preserving Object Compositon
Jianman Lin
Haojie Li
Chunmei Qing
Zhijing Yang
Liang Lin
Tianshui Chen
194
2
0
27 May 2025
DenseLoRA: Dense Low-Rank Adaptation of Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Lin Mu
Xiaoyu Wang
Li Ni
Yang Li
Zhize Wu
Peiquan Jin
Yiwen Zhang
ALM
AI4CE
135
0
0
27 May 2025
StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
Yi Wu
Lingting Zhu
Shengju Qian
Lei Liu
Wandi Qiao
Lequan Yu
Bin Li
238
3
0
26 May 2025
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
Yu Xu
Fan Tang
You Wu
Lin Gao
Oliver Deussen
Hongbin Yan
Jintao Li
Juan Cao
Tong-Yee Lee
DiffM
208
2
0
26 May 2025
PiCa: Parameter-Efficient Fine-Tuning with Column Space Projection
Junseo Hwang
Wonguk Cho
Taesup Kim
271
0
0
26 May 2025
MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models
Hang Hua
Ziyun Zeng
Yizhi Song
Yunlong Tang
Liu He
Daniel G. Aliaga
Wei Xiong
Jiebo Luo
EGVM
393
2
0
26 May 2025
Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation
Yuhao He
Jinyu Tian
Haiwei Wu
Jianqing Li
DiffM
AAML
255
0
0
26 May 2025
Regularized Personalization of Text-to-Image Diffusion Models without Distributional Drift
Gihoon Kim
Hyungjin Park
Taesup Kim
DiffM
VLM
476
0
0
26 May 2025
Towards Robust Influence Functions with Flat Validation Minima
International Conference on Machine Learning (ICML), 2025
Xichen Ye
Yifan Wu
Weizhong Zhang
Cheng Jin
Yifan Chen
TDI
329
3
0
25 May 2025
CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
H. Zhang
Dexiang Hong
Maoke Yang
Yutao Chen
Zhao Zhang
Jie Shao
Xinglong Wu
Zuxuan Wu
Yu Jiang
DiffM
AI4CE
519
12
0
25 May 2025
Jodi: Unification of Visual Generation and Understanding via Joint Modeling
Yifeng Xu
Zhenliang He
Meina Kan
Shiguang Shan
Xilin Chen
VLM
332
1
0
25 May 2025
Affective Image Editing: Shaping Emotional Factors via Text Descriptions
Peixuan Zhang
Shuchen Weng
Chengxuan Zhu
Binghao Tang
Zijian Jia
Si Li
Boxin Shi
DiffM
162
2
0
24 May 2025
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
Wenchao Zhang
Jiahe Tian
Runze He
Jizhong Han
Jiao Dai
Miaomiao Feng
Wei Mi
Xiaodan Zhang
265
0
0
24 May 2025
StyleGuard: Preventing Text-to-Image-Model-based Style Mimicry Attacks by Style Perturbations
Yanjie Li
Wenxuan Zhang
Xinqi Lyu
Yihao Liu
Bin Xiao
AAML
DiffM
WIGM
529
0
0
24 May 2025
OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data
Yiren Song
Cheng Liu
Mike Zheng Shou
DiffM
408
10
0
24 May 2025
Localizing Knowledge in Diffusion Transformers
Arman Zarei
Samyadeep Basu
Keivan Rezaei
Zihao Lin
Sayan Nag
Soheil Feizi
302
1
0
24 May 2025
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Yilang Zhang
Bingcong Li
G. Giannakis
613
2
0
24 May 2025
T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models
Xiaoyu Ye
Songjie Cheng
Yongtao Wang
Yajiao Xiong
Yishen Li
DiffM
420
3
0
23 May 2025
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
Zhihua Liu
Amrutha Saseendran
Lei Tong
Xilin He
Fariba Yousefi
...
Dino Oglic
Tom Diethe
Philip Teare
Huiyu Zhou
Chen Jin
VLM
605
3
0
23 May 2025
HOFT: Householder Orthogonal Fine-tuning
Alejandro Moreno Arcas
Albert Sanchis
Jorge Civera
Alfons Juan
316
0
0
22 May 2025
Style Transfer with Diffusion Models for Synthetic-to-Real Domain Adaptation
Computer Vision and Image Understanding (CVIU), 2025
Estelle Chigot
Dennis G. Wilson
Meriem Ghrib
Thomas Oberlin
DiffM
299
6
0
22 May 2025
Forward-only Diffusion Probabilistic Models
Ziwei Luo
Fredrik K. Gustafsson
Jens Sjölund
Thomas B. Schön
405
0
0
22 May 2025
Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On
International Conference on Learning Representations (ICLR), 2025
Siqi Wan
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
505
4
0
22 May 2025
Erased or Dormant? Rethinking Concept Erasure Through Reversibility
Ping Liu
Fangqiu Yi
KELM
415
1
0
22 May 2025
CDST: Color Disentangled Style Transfer for Universal Style Reference Customization
Shiwen Zhang
Zhuowei Chen
Lang Chen
Yanze Wu
186
0
0
22 May 2025
FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion
Kazuaki Mishima
Antoni Bigata Casademunt
Stavros Petridis
Maja Pantic
Kenji Suzuki
DiffM
454
1
0
21 May 2025
OmniStyle: Filtering High Quality Style Transfer Data at Scale
Computer Vision and Pattern Recognition (CVPR), 2025
Ye Wang
Ruiqi Liu
Jiang Lin
Fei Liu
Zili Yi
Yilin Wang
Rui Ma
302
10
0
20 May 2025
Replace in Translation: Boost Concept Alignment in Counterfactual Text-to-Image
Sifan Li
Ming Tao
Hao Zhao
Ling Shao
Hao Tang
DiffM
356
0
0
20 May 2025
Is Artificial Intelligence Generated Image Detection a Solved Problem?
Wandi Qiao
Jiazhen Yan
Ziwen He
Kai Zeng
Weiwei Jiang
Lizhi Xiong
Zhangjie Fu
AAML
276
14
0
18 May 2025
Guiding Diffusion with Deep Geometric Moments: Balancing Fidelity and Variation
Sangmin Jung
Utkarsh Nath
Yezhou Yang
Giulia Pedrielli
Joydeep Biswas
Amy Zhang
Hassan Ghasemzadeh
Pavan Turaga
DiffM
406
0
0
18 May 2025
VoiceCloak: A Multi-Dimensional Defense Framework against Unauthorized Diffusion-based Voice Cloning
Qianyue Hu
Junyan Wu
Wei Lu
Xiangyang Luo
DiffM
AAML
290
0
0
18 May 2025
DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model
Siwei Xia
Li Sun
Tiantian Sun
Qingli Li
DiffM
472
5
0
18 May 2025
SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation
Yixuan Dong
Fang-Yi Su
Jung-Hsien Chiang
DiffM
224
0
0
17 May 2025
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
447
3
0
16 May 2025
NeuSEditor: From Multi-View Images to Text-Guided Neural Surface Edits
Nail Ibrahimli
Julian F. P. Kooij
Liangliang Nan
190
0
0
16 May 2025
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
Peiying Zhang
Nanxuan Zhao
Jing Liao
DiffM
222
2
0
15 May 2025
IMAGE-ALCHEMY: Advancing subject fidelity in personalised text-to-image generation
Amritanshu Tiwari
Cherish Puniani
Kaustubh Sharma
Ojasva Nema
DiffM
325
0
0
15 May 2025
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Bingxin Ke
Kevin Qu
Tianfu Wang
Nando Metzger
Shengyu Huang
Bo Li
Anton Obukhov
Konrad Schindler
DiffM
VLM
378
31
0
14 May 2025
Don't Forget your Inverse DDIM for Image Editing
IEEE Computational Intelligence Magazine (IEEE CIM), 2025
Guillermo Gomez-Trenado
Pablo Mesejo
Ó. Cordón
Stéphane Lathuilière
DiffM
217
1
0
14 May 2025
Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation
Guan Gui
Bin-Bin Gao
Jing Liu
Chengjie Wang
Yongpeng Wu
DiffM
233
0
0
14 May 2025
Visual Watermarking in the Era of Diffusion Models: Advances and Challenges
Junxian Duan
Jiyang Guan
Wenkui Yang
Xiao-Yu Zhang
WIGM
569
2
0
13 May 2025
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
International Conference on Learning Representations (ICLR), 2025
Donghoon Kim
Minji Bae
Kyuhong Shim
B. Shim
419
5
0
13 May 2025
Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Anle Ke
Xu Zhang
Tong Chen
Ming Lu
Chao Zhou
Jiawen Gu
Zhan Ma
DiffM
243
3
0
13 May 2025
ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Ozgur Kara
Krishna Kumar Singh
Feng Liu
Duygu Ceylan
James M. Rehg
Tobias Hinz
DiffM
VGen
475
12
0
12 May 2025
FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images
Raul Ismayilov
Dzemila Sero
Luuk Spreeuwers
478
1
0
12 May 2025
TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis
Longtian Wang
Xiaofei Xie
Tianlin Li
Yuhan Zhi
Chao Shen
235
1
0
11 May 2025
Previous
1
2
3
...
8
9
10
...
49
50
51
Next