Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2312.04461
Cited By
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
7 December 2023
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding"
50 / 151 papers shown
Title
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Junjie He
Yifeng Geng
Liefeng Bo
DiffM
36
20
0
12 Aug 2024
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models
Kushal Kumar Jain
Steven A. Grosz
A. Namboodiri
Anil K. Jain
DiffM
22
2
0
02 Aug 2024
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
24
0
0
30 Jul 2024
Towards Localized Fine-Grained Control for Facial Expression Generation
Tuomas Varanka
Huai-Qian Khor
Yante Li
Mengting Wei
Hanwei Kung
N. Sebe
Guoying Zhao
35
3
0
25 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
49
11
0
17 Jul 2024
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge
Young-Jun Lee
Dokyong Lee
Junyoung Youn
Kyeongjin Oh
ByungSoo Ko
Jonghwan Hyeon
Ho-Jin Choi
23
2
0
04 Jul 2024
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Jian Ma
Yonglin Deng
Chen Chen
H. Lu
Zhenyu Yang
Zhenyu Yang
VLM
DiffM
79
6
0
02 Jul 2024
Compositional Image Decomposition with Diffusion Models
Jocelin Su
Nan Liu
Yanbo Wang
Joshua B. Tenenbaum
Yilun Du
CoGe
25
5
0
27 Jun 2024
LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
Aoyang Liu
Qingnan Fan
Shuai Qin
Hong Gu
Yansong Tang
DiffM
40
1
0
25 Jun 2024
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma
Wenting Xu
Jiji Tang
Qinfeng Jin
Rongsheng Zhang
Zeng Zhao
Changjie Fan
Zhipeng Hu
21
6
0
24 Jun 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Pei Cheng
Bin-Bin Fu
Hanwang Zhang
65
6
0
13 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
34
40
0
11 Jun 2024
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Baoquan Zhao
Feize Wu
Fu Lee Wang
Qing Li
Xudong Mao
DiffM
31
1
0
07 Jun 2024
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
Peng-Fei Xing
Ning Wang
Jianbo Ouyang
Zechao Li
DiffM
28
1
0
05 Jun 2024
GenPalm: Contactless Palmprint Generation with Diffusion Models
Steven A. Grosz
Anil K. Jain
29
2
0
01 Jun 2024
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
JiaoJiao Fan
Haotian Xue
Qinsheng Zhang
Yongxin Chen
30
0
0
27 May 2024
Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation
Runyi Li
Xuanyu Zhang
Zhipei Xu
Yongbing Zhang
Jian Zhang
WIGM
39
3
0
26 May 2024
Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model
Mingyang Yi
Aoxue Li
Yi Xin
Zhenguo Li
DiffM
29
11
0
24 May 2024
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Zhicheng Sun
Zhenhao Yang
Yang Jin
Haozhe Chi
Kun Xu
...
Hao Jiang
Di Zhang
Yang Song
Kun Gai
Yadong Mu
18
3
0
23 May 2024
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
Hongzhi Zhang
Lei Zhang
W. Zuo
DiffM
33
0
0
09 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Zhaoxiang Zhang
Zhen Lei
Qing Li
Zhen Lei
Qing Li
EGVM
119
18
0
09 May 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Yupeng Zhou
Daquan Zhou
Ming-Ming Cheng
Jiashi Feng
Qibin Hou
DiffM
VGen
25
86
0
02 May 2024
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Chanran Kim
Jeongin Lee
Shichang Joung
Bongmo Kim
Yeul-Min Baek
98
16
0
30 Apr 2024
Hide and Seek: How Does Watermarking Impact Face Recognition?
Yuguang Yao
Steven Grosz
Sijia Liu
Anil K. Jain
30
1
0
29 Apr 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
33
57
0
24 Apr 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
29
6
0
24 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
23
12
0
23 Apr 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
21
39
0
23 Apr 2024
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Zehuan Huang
Hongxing Fan
Lipeng Wang
Lu Sheng
DiffM
16
10
0
23 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
92
22
0
22 Apr 2024
Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions
Steven A. Grosz
Anil K. Jain
32
2
0
21 Apr 2024
MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation
Kuan-Chieh Jackson Wang
Daniil Ostashev
Yuwei Fang
Sergey Tulyakov
Kfir Aberman
17
8
0
17 Apr 2024
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal
Or Lichter
Elad Richardson
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
31
29
0
04 Apr 2024
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
22
4
0
27 Mar 2024
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Shilong Zhang
Lianghua Huang
Xi Chen
Yifei Zhang
Zhigang Wu
Yutong Feng
Wei Wang
Yujun Shen
Yu Liu
Ping Luo
35
7
0
25 Mar 2024
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration
Zhichao Wei
Qingkun Su
Long Qin
Weizhi Wang
DiffM
23
6
0
22 Mar 2024
Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm
Yi Wu
Ziqiang Li
Heliang Zheng
Chaoyue Wang
Bin Li
DiffM
42
17
0
18 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
16
7
0
17 Mar 2024
OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Zhe Kong
Yong Zhang
Tianyu Yang
Tao Wang
Kaihao Zhang
Bizhu Wu
Guanying Chen
Wei Liu
Wenhan Luo
DiffM
31
8
0
16 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
64
35
0
07 Mar 2024
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
29
41
0
27 Feb 2024
Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation
Yang Li
Songlin Yang
Wei Wang
Jing Dong
DiffM
11
1
0
31 Jan 2024
StableIdentity: Inserting Anybody into Anywhere at First Sight
Qinghe Wang
Xu Jia
Xiaomin Li
Taiqing Li
Liqian Ma
Yunzhi Zhuge
Huchuan Lu
26
20
0
29 Jan 2024
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion
Wei Li
Xue Xu
Jiachen Liu
Xinyan Xiao
10
5
0
24 Jan 2024
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Qixun Wang
Xu Bai
Haofan Wang
Zekui Qin
Anthony Chen
Huaxia Li
Xu Tang
Yao Hu
16
234
0
15 Jan 2024
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe-nan Lin
H. J. Jung
DiffM
116
272
0
06 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
126
161
0
06 Dec 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
273
845
0
17 Feb 2021
Previous
1
2
3
4
Next