ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.01626
  4. Cited By
Prompt-to-Prompt Image Editing with Cross Attention Control

Prompt-to-Prompt Image Editing with Cross Attention Control

2 August 2022
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
    DiffM
ArXivPDFHTML

Papers citing "Prompt-to-Prompt Image Editing with Cross Attention Control"

50 / 1,376 papers shown
Title
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
75
6
0
30 May 2024
Streaming Video Diffusion: Online Video Editing with Diffusion Models
Streaming Video Diffusion: Online Video Editing with Diffusion Models
Feng Chen
Zhen Yang
Bohan Zhuang
Qi Wu
DiffM
41
4
0
30 May 2024
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Text Guided Image Editing with Automatic Concept Locating and Forgetting
Jia Li
Lijie Hu
Zhixian He
Jingfeng Zhang
Tianhang Zheng
Di Wang
DiffM
41
8
0
30 May 2024
Creating Language-driven Spatial Variations of Icon Images
Creating Language-driven Spatial Variations of Icon Images
Xianghao Xu
Aditya Ganeshan
K. Willis
Yewen Pu
Daniel E. Ritchie
42
0
0
30 May 2024
Personalized Interiors at Scale: Leveraging AI for Efficient and
  Customizable Design Solutions
Personalized Interiors at Scale: Leveraging AI for Efficient and Customizable Design Solutions
Kaiwen Zhou
Tianyu Wang
40
2
0
29 May 2024
SketchDeco: Decorating B&W Sketches with Colour
SketchDeco: Decorating B&W Sketches with Colour
Chaitat Utintu
Pinaki Nath Chowdhury
Aneeshan Sain
Subhadeep Koley
A. Bhunia
Yi-Zhe Song
DiffM
34
3
0
29 May 2024
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map
  Filtering
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering
Ido Sobol
Chenfeng Xu
Or Litany
DiffM
35
1
0
29 May 2024
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian
  Splatting
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting
Qihang Zhang
Yinghao Xu
Chaoyang Wang
Hsin-Ying Lee
Gordon Wetzstein
Bolei Zhou
Ceyuan Yang
3DGS
38
6
0
28 May 2024
Text Modality Oriented Image Feature Extraction for Detecting
  Diffusion-based DeepFake
Text Modality Oriented Image Feature Extraction for Detecting Diffusion-based DeepFake
Di Yang
Yihao Huang
Qing-Wu Guo
Felix Juefei Xu
Xiaojun Jia
Run Wang
G. Pu
Yang Liu
DiffM
32
0
0
28 May 2024
AttenCraft: Attention-guided Disentanglement of Multiple Concepts for
  Text-to-Image Customization
AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization
Junjie Shentu
Matthew Watson
Noura Al Moubayed
DiffM
49
0
0
28 May 2024
Diffusion Model Patching via Mixture-of-Prompts
Diffusion Model Patching via Mixture-of-Prompts
Seokil Ham
Sangmin Woo
Jin-Young Kim
Hyojun Go
Byeongjun Park
Changick Kim
VLM
26
2
0
28 May 2024
From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan
  Creation
From Text to Blueprint: Leveraging Text-to-Image Tools for Floor Plan Creation
Xiaoyu Li
Jonathan Benjamin
Xin Zhang
38
1
0
27 May 2024
Diagnosing the Compositional Knowledge of Vision Language Models from a
  Game-Theoretic View
Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View
Jin Wang
Shichao Dong
Yapeng Zhu
Kelu Yao
Weidong Zhao
Chao Li
Ping Luo
CoGe
LRM
43
2
0
27 May 2024
Training-free Editioning of Text-to-Image Models
Training-free Editioning of Text-to-Image Models
Jinqi Wang
Yunfei Fu
Zhangcan Ding
Bailin Deng
Yu-Kun Lai
Yipeng Qin
DiffM
VLM
34
0
0
27 May 2024
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled
  Self-Attention Injection
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection
Gihyun Kwon
Jangho Park
Jong Chul Ye
VGen
DiffM
45
0
0
27 May 2024
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt
  Following and High-Fidelity Editing
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing
Xinyu Zhang
Mengxue Kang
Fei Wei
Shuang Xu
Yuhe Liu
Lin Ma
MLLM
DiffM
32
2
0
27 May 2024
PromptFix: You Prompt and We Fix the Photo
PromptFix: You Prompt and We Fix the Photo
Yongsheng Yu
Ziyun Zeng
Hang Hua
Jianlong Fu
Jiebo Luo
MLLM
DiffM
VLM
38
20
0
27 May 2024
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion
  Models
DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models
Hengkang Wang
Xu Zhang
Taihui Li
Yuxiang Wan
Tiancong Chen
Ju Sun
DiffM
29
12
0
27 May 2024
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion
  Models
I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models
Wenqi Ouyang
Yi Dong
Lei Yang
Jianlou Si
Xingang Pan
VGen
DiffM
41
11
0
26 May 2024
LEAST: "Local" text-conditioned image style transfer
LEAST: "Local" text-conditioned image style transfer
Silky Singh
Surgan Jandial
Simra Shahid
Abhinav Java
37
0
0
25 May 2024
ModelLock: Locking Your Model With a Spell
ModelLock: Locking Your Model With a Spell
Yifeng Gao
Yuhua Sun
Xingjun Ma
Zuxuan Wu
Yu-Gang Jiang
VLM
40
1
0
25 May 2024
ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score
  Matching
ExactDreamer: High-Fidelity Text-to-3D Content Creation via Exact Score Matching
Yumin Zhang
Xingyu Miao
Haoran Duan
Bo Wei
Tejal Shah
Yang Long
R. Ranjan
27
3
0
24 May 2024
FastDrag: Manipulate Anything in One Step
FastDrag: Manipulate Anything in One Step
Xuanjia Zhao
Jian Guan
Congyi Fan
Dongli Xu
Youtian Lin
Haiwei Pan
Pengming Feng
DiffM
30
4
0
24 May 2024
Challenges and Opportunities in 3D Content Generation
Challenges and Opportunities in 3D Content Generation
Ke Zhao
Andreas Larsen
29
0
0
24 May 2024
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion
Aoxue Li
Mingyang Yi
Zhenguo Li
DiffM
48
0
0
24 May 2024
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Feng Liang
Akio Kodaira
Chenfeng Xu
M. Tomizuka
Kurt Keutzer
Diana Marculescu
DiffM
VGen
70
7
0
24 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image
  Editing
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
34
9
0
23 May 2024
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation
Shiqi Yang
Zhi-Wei Zhong
Mengjie Zhao
Shusuke Takahashi
Masato Ishii
Takashi Shibuya
Yuki Mitsufuji
43
2
0
23 May 2024
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible
  Pose Control
PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control
Yong Zhong
Min Zhao
Zebin You
Xiaofeng Yu
Changwang Zhang
Chongxuan Li
DiffM
31
6
0
23 May 2024
FreeTuner: Any Subject in Any Style with Training-free Diffusion
FreeTuner: Any Subject in Any Style with Training-free Diffusion
Youcan Xu
Zhen Wang
Jun Xiao
Wei Liu
Long Chen
DiffM
36
9
0
23 May 2024
Enhancing Image Layout Control with Loss-Guided Diffusion Models
Enhancing Image Layout Control with Loss-Guided Diffusion Models
Zakaria Patel
Kirill Serkh
DiffM
36
3
0
23 May 2024
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept
  Composition
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Ganggui Ding
Canyu Zhao
Wen Wang
Zhen Yang
Zide Liu
Hao Chen
Chunhua Shen
DiffM
33
20
0
22 May 2024
MotionCraft: Physics-based Zero-Shot Video Generation
MotionCraft: Physics-based Zero-Shot Video Generation
L. S. Aira
Antonio Montanaro
Emanuele Aiello
D. Valsesia
E. Magli
DiffM
VGen
26
9
0
22 May 2024
Enhanced Creativity and Ideation through Stable Video Synthesis
Enhanced Creativity and Ideation through Stable Video Synthesis
Elijah Miller
Thomas Dupont
Mingming Wang
VGen
28
0
0
22 May 2024
Personalized Residuals for Concept-Driven Text-to-Image Generation
Personalized Residuals for Concept-Driven Text-to-Image Generation
Cusuh Ham
Matthew Fisher
James Hays
Nicholas I. Kolkin
Yuchen Liu
Richard Y. Zhang
Tobias Hinz
DiffM
46
7
0
21 May 2024
EmoEdit: Evoking Emotions through Image Manipulation
EmoEdit: Evoking Emotions through Image Manipulation
Jingyuan Yang
Jiawei Feng
Weibin Luo
Dani Lischinski
Daniel Cohen-Or
Hui Huang
DiffM
19
1
0
21 May 2024
CustomText: Customized Textual Image Generation using Diffusion Models
CustomText: Customized Textual Image Generation using Diffusion Models
Shubham Paliwal
Arushi Jain
Monika Sharma
Vikram Jamwal
L. Vig
35
0
0
21 May 2024
Customize Your Own Paired Data via Few-shot Way
Customize Your Own Paired Data via Few-shot Way
Jinshu Chen
Bingchuan Li
Miao Hua
Panpan Xu
Qian He
DiffM
34
0
0
21 May 2024
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models
  Using Spatio-Temporal Slices
Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices
Nathaniel Cohen
Vladimir Kulikov
Matan Kleiner
Inbar Huberman-Spiegelglas
T. Michaeli
VGen
DiffM
30
15
0
20 May 2024
Images that Sound: Composing Images and Sounds on a Single Canvas
Images that Sound: Composing Images and Sounds on a Single Canvas
Ziyang Chen
Daniel Geng
Andrew Owens
DiffM
48
9
0
20 May 2024
ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image Editing
ReasonPix2Pix: Instruction Reasoning Dataset for Advanced Image Editing
Ying Jin
Pengyang Ling
Xiao-wen Dong
Pan Zhang
Jiaqi Wang
Dahua Lin
29
2
0
18 May 2024
ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation
ART3D: 3D Gaussian Splatting for Text-Guided Artistic Scenes Generation
Pengzhi Li
Chengshuai Tang
Qinxuan Huang
Zhiheng Li
3DGS
41
12
0
17 May 2024
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks
  via Multi-modal Large Language Models
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
...
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
31
12
0
16 May 2024
Training-free Subject-Enhanced Attention Guidance for Compositional
  Text-to-image Generation
Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation
Shengyuan Liu
Bo Wang
Ye Ma
Te Yang
Xipeng Cao
Quan Chen
Han Li
Di Dong
Peng Jiang
EGVM
41
2
0
11 May 2024
Prompt-guided Precise Audio Editing with Diffusion Models
Prompt-guided Precise Audio Editing with Diffusion Models
Manjie Xu
Chenxing Li
Duzhen Zhang
Dan Su
Weihan Liang
Dong Yu
DiffM
36
4
0
11 May 2024
Non-confusing Generation of Customized Concepts in Diffusion Models
Non-confusing Generation of Customized Concepts in Diffusion Models
Wang Lin
Jingyuan Chen
Jiaxin Shi
Yichen Zhu
Chen Liang
...
Tao Jin
Zhou Zhao
Fei Wu
Shuicheng Yan
Hanwang Zhang
DiffM
42
11
0
11 May 2024
Distilling Diffusion Models into Conditional GANs
Distilling Diffusion Models into Conditional GANs
Minguk Kang
Richard Zhang
Connelly Barnes
Sylvain Paris
Suha Kwak
Jaesik Park
Eli Shechtman
Jun-Yan Zhu
Taesung Park
38
36
0
09 May 2024
Lumina-T2X: Transforming Text into Any Modality, Resolution, and
  Duration via Flow-based Large Diffusion Transformers
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
Peng Gao
Le Zhuo
Ziyi Lin
Ruoyi Du
Xu Luo
...
Weicai Ye
He Tong
Jingwen He
Yu Qiao
Hongsheng Li
VGen
35
82
0
09 May 2024
Attention-Driven Training-Free Efficiency Enhancement of Diffusion
  Models
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
Hongjie Wang
Difan Liu
Yan Kang
Yijun Li
Zhe Lin
N. Jha
Yuchen Liu
27
12
0
08 May 2024
FlexEControl: Flexible and Efficient Multimodal Control for
  Text-to-Image Generation
FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation
Xuehai He
Jian Zheng
Jacob Zhiyuan Fang
Robinson Piramuthu
Mohit Bansal
Vicente Ordonez
Gunnar A. Sigurdsson
Nanyun Peng
Xin Eric Wang
DiffM
45
1
0
08 May 2024
Previous
123...101112...262728
Next