Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.01626
Cited By
Prompt-to-Prompt Image Editing with Cross Attention Control
2 August 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prompt-to-Prompt Image Editing with Cross Attention Control"
50 / 1,376 papers shown
Title
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis
S. Sastry
Subash Khanal
A. Dhakal
Nathan Jacobs
58
6
0
09 Apr 2024
SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions
Xiaoyu Liu
Yuxiang Wei
Ming-Yu Liu
Xianhui Lin
Peiran Ren
Xuansong Xie
Wangmeng Zuo
DiffM
39
5
0
09 Apr 2024
ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng
Prafull Sharma
Andrew Markham
Niki Trigoni
Varun Jampani
41
9
0
09 Apr 2024
SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing
Jing Gu
Yilin Wang
Nanxuan Zhao
Wei Xiong
Qing Liu
Zhifei Zhang
He Zhang
Jianming Zhang
HyunJoon Jung
Xin Eric Wang
DiffM
32
8
0
08 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
37
2
0
08 Apr 2024
Responsible Visual Editing
Minheng Ni
Yeli Shen
Lei Zhang
W. Zuo
DiffM
27
0
0
08 Apr 2024
Investigating the Effectiveness of Cross-Attention to Unlock Zero-Shot Editing of Text-to-Video Diffusion Models
Saman Motamed
Wouter Van Gansbeke
Luc Van Gool
VGen
DiffM
37
1
0
08 Apr 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
Dazhong Shen
Guanglu Song
Zeyue Xue
Fu-Yun Wang
Yu Liu
DiffM
32
11
0
08 Apr 2024
MC
2
^2
2
: Multi-concept Guidance for Customized Multi-concept Generation
Jiaxiu Jiang
Yabo Zhang
Kailai Feng
Xiaohe Wu
Wangmeng Zuo
DiffM
36
11
0
08 Apr 2024
StyleForge: Enhancing Text-to-Image Synthesis for Any Artistic Styles with Dual Binding
Junseo Park
Beom-Seok Ko
Hyeryung Jang
DiffM
96
1
0
08 Apr 2024
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?
Yutong Xie
Qi Chen
Sinuo Wang
Minh Nguyen Nhat To
Iris Lee
Ee Win Khoo
Kerolos Hendy
Daniel Koh
Yong-quan Xia
Qi Wu
MedIm
LM&MA
37
6
0
07 Apr 2024
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Xiefan Guo
Jinlin Liu
Miaomiao Cui
Jiankai Li
Hongyu Yang
Di Huang
23
25
0
06 Apr 2024
ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing
Alec Helbling
Seongmin Lee
Polo Chau
DiffM
19
1
0
05 Apr 2024
Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
Sang-Sub Jang
Jaehyeong Jo
Kimin Lee
Sung Ju Hwang
23
15
0
05 Apr 2024
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Gihyun Kwon
Simon Jenni
Dingzeyu Li
Joon-Young Lee
Jong Chul Ye
Fabian Caba Heilbron
DiffM
45
13
0
05 Apr 2024
DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling
Haoran Li
Haolin Shi
Wenli Zhang
Wenjun Wu
Yong Liao
Lin Wang
Lik-hang Lee
Pengyuan Zhou
3DGS
32
30
0
04 Apr 2024
DreamWalk: Style Space Exploration using Diffusion Guidance
Michelle Shu
Charles Herrmann
Richard Strong Bowen
Forrester Cole
Ramin Zabih
AI4TS
DiffM
37
2
0
04 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Steven G. McDonagh
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
37
10
0
03 Apr 2024
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation
Haofan Wang
Matteo Spinelli
Qixun Wang
Xu Bai
Zekui Qin
Anthony Chen
DiffM
44
85
0
03 Apr 2024
A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion
Zeyu Zhao
Nan Gao
Zhi Zeng
Guixuan Zhang
Jie Liu
Shuwu Zhang
DiffM
36
0
0
03 Apr 2024
Faster Diffusion via Temporal Attention Decomposition
Haozhe Liu
Wentian Zhang
Jinheng Xie
Francesco Faccio
Mengmeng Xu
Tao Xiang
Mike Zheng Shou
Juan-Manuel Perez-Rua
Jürgen Schmidhuber
DiffM
67
19
0
03 Apr 2024
Fashion Style Editing with Generative Human Prior
Chaerin Kong
Seungyong Lee
Soohyeok Im
Wonsuk Yang
43
0
0
02 Apr 2024
CosmicMan: A Text-to-Image Foundation Model for Humans
Shikai Li
Jianglin Fu
Kaiyuan Liu
Wentao Wang
Kwan-Yee Lin
Wayne Wu
DiffM
35
19
0
01 Apr 2024
Large Motion Model for Unified Multi-Modal Motion Generation
Mingyuan Zhang
Daisheng Jin
Chenyang Gu
Fangzhou Hong
Zhongang Cai
...
Chongzhi Zhang
Xinying Guo
Lei Yang
Ying He
Ziwei Liu
VGen
53
25
0
01 Apr 2024
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
Simran Khanuja
Sathyanarayanan Ramamoorthy
Yueqi Song
Graham Neubig
DiffM
20
11
0
01 Apr 2024
Getting it Right: Improving Spatial Consistency in Text-to-Image Models
Agneet Chatterjee
Gabriela Ben-Melech Stan
Estelle Aflalo
Sayak Paul
Dhruba Ghosh
...
Ludwig Schmidt
Hanna Hajishirzi
Vasudev Lal
Chitta Baral
Yezhou Yang
EGVM
VLM
59
14
0
01 Apr 2024
Uncovering the Text Embedding in Text-to-Image Diffusion Models
Huikang Yu
Hao Luo
Fan Wang
Feng Zhao
31
10
0
01 Apr 2024
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
Haofeng Liu
Chenshu Xu
Yifei Yang
Lihua Zeng
Shengfeng He
DiffM
50
22
0
01 Apr 2024
Benchmarking Counterfactual Image Generation
Thomas Melistas
Nikos Spyrou
Nefeli Gkouti
Pedro Sanchez
Athanasios Vlontzos
Yannis Panagakis
G. Papanastasiou
Sotirios A. Tsaftaris
EGVM
CML
46
7
0
29 Mar 2024
U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation
You Wu
Kean Liu
Xiaoyue Mi
Fan Tang
Juan Cao
Jintao Li
DiffM
32
4
0
29 Mar 2024
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
42
7
0
29 Mar 2024
CLoRA: A Contrastive Approach to Compose Multiple LoRA Models
Tuna Han Salih Meral
Enis Simsar
Federico Tombari
Pinar Yanardag
MoMe
34
0
0
28 Mar 2024
GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models
Yusuf Dalva
Hidir Yesiltepe
Pinar Yanardag
DiffM
32
2
0
28 Mar 2024
MIST: Mitigating Intersectional Bias with Disentangled Cross-Attention Editing in Text-to-Image Diffusion Models
Hidir Yesiltepe
Kiymet Akdemir
Pinar Yanardag
29
3
0
28 Mar 2024
DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation
Haonan Lin
Mengmeng Wang
Yan Chen
Wenbin An
Yuzhe Yao
Guang Dai
Qianying Wang
Yong-Jin Liu
Jingdong Wang
DiffM
38
4
0
28 Mar 2024
TextCraftor: Your Text Encoder Can be Image Quality Controller
Yanyu Li
Xian Liu
Anil Kag
Ju Hu
Yerlan Idelbayev
Dhritiman Sagar
Yanzhi Wang
Sergey Tulyakov
Jian Ren
45
14
0
27 Mar 2024
CPR: Retrieval Augmented Generation for Copyright Protection
Aditya Golatkar
Alessandro Achille
L. Zancato
Yu-Xiang Wang
Ashwin Swaminathan
Stefano Soatto
DiffM
27
16
0
27 Mar 2024
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion
Daniel Winter
Matan Cohen
Shlomi Fruchter
Yael Pritch
Alex Rav Acha
Yedid Hoshen
DiffM
40
26
0
27 Mar 2024
InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
Ruoyu Zhao
Qingnan Fan
Fei Kou
Shuai Qin
Hong Gu
Wei Wu
Pengcheng Xu
Mingrui Zhu
Nannan Wang
Xinbo Gao
35
4
0
27 Mar 2024
FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing
Trong-Tung Nguyen
Duc A. Nguyen
Anh Tran
Cuong Pham
DiffM
36
7
0
27 Mar 2024
Attention Calibration for Disentangled Text-to-Image Personalization
Yanbing Zhang
Mengping Yang
Qin Zhou
Zhe Wang
27
15
0
27 Mar 2024
DiffStyler: Diffusion-based Localized Image Style Transfer
Shaoxu Li
DiffM
28
7
0
27 Mar 2024
AID: Attention Interpolation of Text-to-Image Diffusion
Qiyuan He
Jinghao Wang
Ziwei Liu
Angela Yao
DiffM
32
9
0
26 Mar 2024
Bidirectional Consistency Models
Liangchen Li
Jiajun He
DiffM
61
11
0
26 Mar 2024
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Zhongwei Zhang
Fuchen Long
Yingwei Pan
Zhaofan Qiu
Ting Yao
Yang Cao
Tao Mei
VGen
43
22
0
25 Mar 2024
SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Rui Zhu
Yingwei Pan
Yehao Li
Ting Yao
Zhenglong Sun
Tao Mei
C. Chen
50
23
0
25 Mar 2024
Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
Zhikai Chen
Fuchen Long
Zhaofan Qiu
Ting Yao
Wengang Zhou
Jiebo Luo
Tao Mei
DiffM
28
11
0
25 Mar 2024
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation
Omer Dahary
Or Patashnik
Kfir Aberman
Daniel Cohen-Or
DiffM
29
28
0
25 Mar 2024
Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
S. A. Baumann
Felix Krause
Michael Neumayr
Nick Stracke
Vincent Tao Hu
Bjorn Ommer
Björn Ommer
DiffM
LM&Ro
70
11
0
25 Mar 2024
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
Siddharth Tourani
Ahmed Alwheibi
Arif Mahmood
Muhammad Haris Khan
DiffM
30
1
0
24 Mar 2024
Previous
1
2
3
...
12
13
14
...
26
27
28
Next