Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.08465
Cited By
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
17 April 2023
Ming Cao
Xintao Wang
Zhongang Qi
Ying Shan
Xiaohu Qie
Yinqiang Zheng
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing"
50 / 60 papers shown
Title
Multi-turn Consistent Image Editing
Zijun Zhou
Yingying Deng
Xiangyu He
Weiming Dong
Fan Tang
46
0
0
07 May 2025
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li
Xin Gu
Fan Chen
X. Xing
Longyin Wen
C. L. P. Chen
Sijie Zhu
DiffM
71
1
0
05 May 2025
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
Jiangtong Tan
Hu Yu
Jie Huang
Jie Xiao
Feng Zhao
57
1
0
02 May 2025
InstructAttribute: Fine-grained Object Attributes editing with Instruction
Xingxi Yin
Jingfeng Zhang
Zhi Li
Y. Li
Y. Zhang
DiffM
75
0
0
01 May 2025
REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
Gal Almog
Ariel Shamir
Ohad Fried
DiffM
50
0
0
26 Apr 2025
Physics-informed 4D X-ray image reconstruction from ultra-sparse spatiotemporal data
Zisheng Yao
Yuhe Zhang
Zhe Hu
Robert Klöfkorn
Tobias Ritschel
Pablo Villanueva-Perez
AI4CE
61
0
0
04 Apr 2025
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
Hyunjun Lee
Hyunsoo Lee
Sookwan Han
DiffM
44
0
0
27 Mar 2025
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
Runze He
Bo Cheng
Yuhang Ma
Qingxiang Jia
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Liebucha Wu
Dawei Leng
Yuhui Yin
DiffM
VLM
47
0
0
13 Mar 2025
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo
Fan Ma
Kaixin Shen
Linchao Zhu
Yi Yang
VLM
45
0
0
12 Mar 2025
Consistent Image Layout Editing with Diffusion Models
Tao Xia
Yudi Zhang
Ting Liu Lei Zhang
DiffM
49
1
0
09 Mar 2025
Zero-Shot Head Swapping in Real-World Scenarios
S. Jeong
Taewoong Kang
Hyojin Jang
Jaegul Choo
34
0
0
02 Mar 2025
MagicFace: High-Fidelity Facial Expression Editing with Action-Unit Control
Mengting Wei
Tuomas Varanka
Xingxun Jiang
Huai-Qian Khor
Guoying Zhao
DiffM
39
0
0
10 Jan 2025
GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
Rahul Sajnani
Jeroen Vanbaar
Jie Min
Kapil D. Katyal
Srinath Sridhar
DiffM
49
10
0
03 Jan 2025
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
60
6
0
03 Jan 2025
Edicho: Consistent Image Editing in the Wild
Qingyan Bai
Hao Ouyang
Yinghao Xu
Qiuyu Wang
Ceyuan Yang
Ka Leong Cheng
Yujun Shen
Qifeng Chen
DiffM
62
1
0
30 Dec 2024
Zero-Shot Low Light Image Enhancement with Diffusion Prior
Joshua Cho
Sara Aghajanzadeh
Zhen Zhu
David A. Forsyth
DiffM
94
0
0
18 Dec 2024
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Q. He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Y. Liu
Y. Wang
Chengjie Wang
X. Li
J. Zhang
DiffM
117
1
0
04 Dec 2024
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Weimin Qiu
Jieke Wang
Meng Tang
DiffM
74
0
0
28 Nov 2024
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
P. Xu
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Q. He
J. Zhang
Chengjie Wang
Yunsheng Wu
Charles X. Ling
Boyu Wang
87
2
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
H. Zhang
Yueting Zhuang
DiffM
95
15
0
24 Nov 2024
SpatialDreamer: Self-supervised Stereo Video Synthesis from Monocular Input
Zhen Lv
Yangqi Long
Congzhentao Huang
Cao Li
Chengfei Lv
Hao Ren
Dian Zheng
DiffM
VGen
MDE
110
5
0
18 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
47
0
0
15 Nov 2024
SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation
Koichi Namekata
Sherwin Bahmani
Ziyi Wu
Yash Kant
Igor Gilitschenski
David B. Lindell
VGen
55
13
0
07 Nov 2024
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control
Weichao Zeng
Yan Shu
Zhenhang Li
Dongbao Yang
Yu Zhou
DiffM
22
7
0
14 Oct 2024
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design
Jiachen Li
Qian Long
Jian Zheng
Xiaofeng Gao
Robinson Piramuthu
Wenhu Chen
William Yang Wang
VGen
25
22
0
08 Oct 2024
TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer
Zihan Su
Junhao Zhuang
Chun Yuan
DiffM
32
0
0
15 Sep 2024
Training-Free Style Consistent Image Synthesis with Condition and Mask Guidance in E-Commerce
Guandong Li
DiffM
19
2
0
07 Sep 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
86
1
0
31 Aug 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
52
11
0
17 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
63
0
0
17 Jul 2024
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
Kuan Heng Lin
Sicheng Mo
Ben Klingher
Fangzhou Mu
Bolei Zhou
DiffM
16
15
0
11 Jun 2024
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
Wenqi Ouyang
Yi Dong
Lei Yang
Jianlou Si
Xingang Pan
VGen
DiffM
37
11
0
26 May 2024
ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion
Ziyue Zhang
Mingbao Lin
Rongrong Ji
Yuxin Zhang
Rongrong Ji
DiffM
42
3
0
26 Apr 2024
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
32
7
0
29 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
66
11
0
25 Mar 2024
VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis
Yumeng Li
William H. Beluch
M. Keuper
Dan Zhang
Anna Khoreva
DiffM
VGen
71
5
0
20 Mar 2024
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model
Yuxuan Zhang
Lifu Wei
Qing Zhang
Yiren Song
DiffM
26
12
0
12 Mar 2024
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
63
82
0
27 Feb 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
12
0
0
08 Feb 2024
Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing
Gwanhyeong Koo
Sunjae Yoon
Changdong Yoo
DiffM
6
7
0
18 Jan 2024
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Yash Jain
Anshul Nasery
Vibhav Vineet
Harkirat Singh Behl
VGen
24
30
0
12 Dec 2023
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Mengwei Ren
Wei Xiong
Jae Shin Yoon
Zhixin Shu
Jianming Zhang
HyunJoon Jung
Guido Gerig
He Zhang
DiffM
21
16
0
11 Dec 2023
FaceStudio: Put Your Face Everywhere in Seconds
Yuxuan Yan
C. Zhang
Rui Wang
Yichao Zhou
Gege Zhang
Pei Cheng
Gang Yu
Bin-Bin Fu
DiffM
21
38
0
05 Dec 2023
M
2
^{2}
2
Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation
Xiaowei Chi
Rongyu Zhang
Zhengkai Jiang
Yijiang Liu
Ziyi Lin
...
Chaoyou Fu
Peng Gao
Shanghang Zhang
Qi-fei Liu
Yi-Ting Guo
MLLM
33
1
0
29 Nov 2023
Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer
Danah Yatim
Rafail Fridman
Omer Bar-Tal
Yoni Kasten
Tali Dekel
DiffM
VGen
16
50
0
28 Nov 2023
ConTex-Human: Free-View Rendering of Human from a Single Image with Texture-Consistent Synthesis
Xiangjun Gao
Xiaoyu Li
Chaopeng Zhang
Qi Zhang
Yan-Pei Cao
Ying Shan
Long Quan
DiffM
34
9
0
28 Nov 2023
As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors
Seungwoo Yoo
Kunho Kim
Vladimir G. Kim
Minhyuk Sung
DiffM
16
13
0
28 Nov 2023
HiFi-123: Towards High-fidelity One Image to 3D Content Generation
Wangbo Yu
Li-ming Yuan
Yan-Pei Cao
Xiangjun Gao
Xiaoyu Li
Wenbo Hu
Long Quan
Ying Shan
Yonghong Tian
DiffM
8
29
0
10 Oct 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
11
24
0
28 Sep 2023
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing
Kai Wang
Fei Yang
Shiqi Yang
Muhammad Atif Butt
Joost van de Weijer
DiffM
15
50
0
27 Sep 2023
1
2
Next