ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.04461
  4. Cited By
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

7 December 2023
Zhen Li
Mingdeng Cao
Xintao Wang
Zhongang Qi
Ming-Ming Cheng
Ying Shan
    DiffM
ArXivPDFHTML

Papers citing "PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding"

50 / 151 papers shown
Title
UniPortrait: A Unified Framework for Identity-Preserving Single- and
  Multi-Human Image Personalization
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Junjie He
Yifeng Geng
Liefeng Bo
DiffM
36
20
0
12 Aug 2024
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset
  Augmentation using Diffusion Models
CLIP4Sketch: Enhancing Sketch to Mugshot Matching through Dataset Augmentation using Diffusion Models
Kushal Kumar Jain
Steven A. Grosz
A. Namboodiri
Anil K. Jain
DiffM
22
2
0
02 Aug 2024
Add-SD: Rational Generation without Manual Reference
Add-SD: Rational Generation without Manual Reference
Lingfeng Yang
Xinyu Zhang
Xiang Li
Jinwen Chen
Kun Yao
Gang Zhang
Errui Ding
Ling-Ling Liu
Jingdong Wang
Jian Yang
24
0
0
30 Jul 2024
Towards Localized Fine-Grained Control for Facial Expression Generation
Towards Localized Fine-Grained Control for Facial Expression Generation
Tuomas Varanka
Huai-Qian Khor
Yante Li
Mengting Wei
Hanwei Kung
N. Sebe
Guoying Zhao
35
3
0
25 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
49
11
0
17 Jul 2024
Stark: Social Long-Term Multi-Modal Conversation with Persona
  Commonsense Knowledge
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge
Young-Jun Lee
Dokyong Lee
Junyoung Youn
Kyeongjin Oh
ByungSoo Ko
Jonghwan Hyeon
Ho-Jin Choi
23
2
0
04 Jul 2024
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models
Jian Ma
Yonglin Deng
Chen Chen
H. Lu
Zhenyu Yang
Zhenyu Yang
VLM
DiffM
79
6
0
02 Jul 2024
Compositional Image Decomposition with Diffusion Models
Compositional Image Decomposition with Diffusion Models
Jocelin Su
Nan Liu
Yanbo Wang
Joshua B. Tenenbaum
Yilun Du
CoGe
25
5
0
27 Jun 2024
LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
Aoyang Liu
Qingnan Fan
Shuai Qin
Hong Gu
Yansong Tang
DiffM
40
1
0
25 Jun 2024
Character-Adapter: Prompt-Guided Region Control for High-Fidelity
  Character Customization
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma
Wenting Xu
Jiji Tang
Qinfeng Jin
Rongsheng Zhang
Zeng Zhao
Changjie Fan
Zhipeng Hu
21
6
0
24 Jun 2024
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal
  Prompts
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Yucheng Han
Rui Wang
Chi Zhang
Juntao Hu
Pei Cheng
Bin-Bin Fu
Hanwang Zhang
65
6
0
13 Jun 2024
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
X. Wang
Siming Fu
Qihan Huang
Wanggui He
Hao Jiang
DiffM
34
40
0
11 Jun 2024
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image
  Generation
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation
Lianyu Pang
Jian Yin
Baoquan Zhao
Feize Wu
Fu Lee Wang
Qing Li
Xudong Mao
DiffM
31
1
0
07 Jun 2024
Inv-Adapter: ID Customization Generation via Image Inversion and
  Lightweight Adapter
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
Peng-Fei Xing
Ning Wang
Jianbo Ouyang
Zechao Li
DiffM
28
1
0
05 Jun 2024
GenPalm: Contactless Palmprint Generation with Diffusion Models
GenPalm: Contactless Palmprint Generation with Diffusion Models
Steven A. Grosz
Anil K. Jain
29
2
0
01 Jun 2024
RefDrop: Controllable Consistency in Image or Video Generation via
  Reference Feature Guidance
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
JiaoJiao Fan
Haotian Xue
Qinsheng Zhang
Yongxin Chen
30
0
0
27 May 2024
Protect-Your-IP: Scalable Source-Tracing and Attribution against
  Personalized Generation
Protect-Your-IP: Scalable Source-Tracing and Attribution against Personalized Generation
Runyi Li
Xuanyu Zhang
Zhipei Xu
Yongbing Zhang
Jian Zhang
WIGM
39
3
0
26 May 2024
Towards Understanding the Working Mechanism of Text-to-Image Diffusion
  Model
Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model
Mingyang Yi
Aoxue Li
Yi Xin
Zhenguo Li
DiffM
29
11
0
24 May 2024
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance
Zhicheng Sun
Zhenhao Yang
Yang Jin
Haozhe Chi
Kun Xu
...
Hao Jiang
Di Zhang
Yang Song
Kun Gai
Yadong Mu
18
3
0
23 May 2024
MasterWeaver: Taming Editability and Face Identity for Personalized
  Text-to-Image Generation
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation
Yuxiang Wei
Zhilong Ji
Jinfeng Bai
Hongzhi Zhang
Lei Zhang
W. Zuo
DiffM
33
0
0
09 May 2024
A Survey on Personalized Content Synthesis with Diffusion Models
A Survey on Personalized Content Synthesis with Diffusion Models
Xu-Lu Zhang
Xiao Wei
Wengyu Zhang
Jinlin Wu
Zhaoxiang Zhang
Zhen Lei
Qing Li
Zhen Lei
Qing Li
EGVM
119
18
0
09 May 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video
  Generation
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
Yupeng Zhou
Daquan Zhou
Ming-Ming Cheng
Jiashi Feng
Qibin Hou
DiffM
VGen
25
86
0
02 May 2024
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Chanran Kim
Jeongin Lee
Shichang Joung
Bongmo Kim
Yeul-Min Baek
98
16
0
30 Apr 2024
Hide and Seek: How Does Watermarking Impact Face Recognition?
Hide and Seek: How Does Watermarking Impact Face Recognition?
Yuguang Yao
Steven Grosz
Sijia Liu
Anil K. Jain
30
1
0
29 Apr 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Zinan Guo
Yanze Wu
Zhuowei Chen
Lang Chen
Qian He
DiffM
33
57
0
24 Apr 2024
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion
  Models
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
Qinghe Wang
Baolu Li
Xiaomin Li
Bing Cao
Liqian Ma
Huchuan Lu
Xu Jia
DiffM
29
6
0
24 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with
  Reward Feedback Learning
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
23
12
0
23 Apr 2024
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation
Xuanhua He
Quande Liu
Shengju Qian
Xin Eric Wang
Tao Hu
Ke Cao
K. Yan
Jie Zhang
VGen
21
39
0
23 Apr 2024
From Parts to Whole: A Unified Reference Framework for Controllable
  Human Image Generation
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation
Zehuan Huang
Hongxing Fan
Lipeng Wang
Lu Sheng
DiffM
16
10
0
23 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
92
22
0
22 Apr 2024
Universal Fingerprint Generation: Controllable Diffusion Model with
  Multimodal Conditions
Universal Fingerprint Generation: Controllable Diffusion Model with Multimodal Conditions
Steven A. Grosz
Anil K. Jain
32
2
0
21 Apr 2024
MoA: Mixture-of-Attention for Subject-Context Disentanglement in
  Personalized Image Generation
MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation
Kuan-Chieh Jackson Wang
Daniil Ostashev
Yuwei Fang
Sergey Tulyakov
Kfir Aberman
17
8
0
17 Apr 2024
LCM-Lookahead for Encoder-based Text-to-Image Personalization
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Rinon Gal
Or Lichter
Elad Richardson
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
DiffM
31
29
0
04 Apr 2024
InstructBrush: Learning Attention-based Instruction Optimization for
  Image Editing
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
22
4
0
27 Mar 2024
FlashFace: Human Image Personalization with High-fidelity Identity
  Preservation
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Shilong Zhang
Lianghua Huang
Xi Chen
Yifei Zhang
Zhigang Wu
Yutong Feng
Wei Wang
Yujun Shen
Yu Liu
Ping Luo
35
7
0
25 Mar 2024
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition
  Integration
MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration
Zhichao Wei
Qingkun Su
Long Qin
Weizhi Wang
DiffM
23
6
0
22 Mar 2024
Infinite-ID: Identity-preserved Personalization via ID-semantics
  Decoupling Paradigm
Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm
Yi Wu
Ziqiang Li
Heliang Zheng
Chaoyue Wang
Bin Li
DiffM
42
17
0
18 Mar 2024
Source Prompt Disentangled Inversion for Boosting Image Editability with
  Diffusion Models
Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
Rui Li
Ruihuang Li
Song Guo
Lei Zhang
DiffM
16
7
0
17 Mar 2024
OMG: Occlusion-friendly Personalized Multi-concept Generation in
  Diffusion Models
OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models
Zhe Kong
Yong Zhang
Tianyu Yang
Tao Wang
Kaihao Zhang
Bizhu Wu
Guanying Chen
Wei Liu
Wenhan Luo
DiffM
31
8
0
16 Mar 2024
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Controllable Generation with Text-to-Image Diffusion Models: A Survey
Pu Cao
Feng Zhou
Qing-Huang Song
Lu Yang
64
35
0
07 Mar 2024
Transparent Image Layer Diffusion using Latent Transparency
Transparent Image Layer Diffusion using Latent Transparency
Lvmin Zhang
Maneesh Agrawala
29
41
0
27 Feb 2024
Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity
  Personalized Diffusion Generation
Beyond Inserting: Learning Identity Embedding for Semantic-Fidelity Personalized Diffusion Generation
Yang Li
Songlin Yang
Wei Wang
Jing Dong
DiffM
11
1
0
31 Jan 2024
StableIdentity: Inserting Anybody into Anywhere at First Sight
StableIdentity: Inserting Anybody into Anywhere at First Sight
Qinghe Wang
Xu Jia
Xiaomin Li
Taiqing Li
Liqian Ma
Yunzhi Zhuge
Huchuan Lu
26
20
0
29 Jan 2024
UNIMO-G: Unified Image Generation through Multimodal Conditional
  Diffusion
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion
Wei Li
Xue Xu
Jiachen Liu
Xinyan Xiao
10
5
0
24 Jan 2024
InstantID: Zero-shot Identity-Preserving Generation in Seconds
InstantID: Zero-shot Identity-Preserving Generation in Seconds
Qixun Wang
Xu Bai
Haofan Wang
Zekui Qin
Anthony Chen
Huaxia Li
Xu Tang
Yao Hu
16
234
0
15 Jan 2024
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Jing Shi
Wei Xiong
Zhe-nan Lin
H. J. Jung
DiffM
116
272
0
06 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
General Facial Representation Learning in a Visual-Linguistic Manner
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
126
161
0
06 Dec 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
273
845
0
17 Feb 2021
Previous
1234
Next