ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

Computer Vision and Pattern Recognition (CVPR), 2022
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,733 papers shown
PRINTER:Deformation-Aware Adversarial Learning for Virtual IHC Staining with In Situ Fidelity
PRINTER:Deformation-Aware Adversarial Learning for Virtual IHC Staining with In Situ Fidelity
Yizhe Yuan
Bingsen Xue
Bangzheng Pu
Chengxiang Wang
Cheng Jin
85
1
0
01 Sep 2025
CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation
CompSlider: Compositional Slider for Disentangled Multiple-Attribute Image Generation
Zixin Zhu
Kevin Duarte
Mamshad Nayeem Rizve
Chengyuan Xu
Ratheesh Kalarot
Junsong Yuan
DiffM
230
1
0
31 Aug 2025
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
Xuechao Zou
Shun Zhang
Xing Fu
Y. Li
Kai Li
Yushe Cao
Congyan Lang
Pin Tao
Junliang Xing
DiffM
191
0
0
30 Aug 2025
3D-LATTE: Latent Space 3D Editing from Textual Instructions
3D-LATTE: Latent Space 3D Editing from Textual Instructions
Maria Parelli
Michael Oechsle
Michael Niemeyer
Federico Tombari
Andreas Geiger
DiffM
291
2
0
29 Aug 2025
Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent
Describe, Don't Dictate: Semantic Image Editing with Natural Language Intent
En Ci
Shanyan Guan
Yanhao Ge
Yilin Zhang
Wei-Jang Li
Zhenyu Zhang
Jian Yang
Ying Tai
DiffM
98
2
0
28 Aug 2025
Evaluating Compositional Generalisation in VLMs and Diffusion Models
Evaluating Compositional Generalisation in VLMs and Diffusion Models
Beth Pearson
Bilal Boulbarss
Michael Wray
Martha Lewis
DiffMCoGe
152
1
0
28 Aug 2025
DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes
DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes
Yajiao Xiong
Xiaoyu Zhou
Yongtao Wan
Deqing Sun
Ming-Hsuan Yang
3DGS3DV
130
3
0
28 Aug 2025
CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models
CraftGraffiti: Exploring Human Identity with Custom Graffiti Art via Facial-Preserving Diffusion Models
Ayan Banerjee
Fernando Vilariño
Josep Lladós
DiffM
113
0
0
28 Aug 2025
Articulate3D: Zero-Shot Text-Driven 3D Object Posing
Articulate3D: Zero-Shot Text-Driven 3D Object Posing
Oishi Deb
Anjun Hu
Ashkan Khakzar
Juil Sock
Christian Rupprecht
81
0
0
26 Aug 2025
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation
Jianwen Jiang
Weihong Zeng
Zerong Zheng
Jiaqi Yang
Chao Liang
Wang Liao
Han Liang
Yuan Zhang
Mingyuan Gao
VGen
131
12
0
26 Aug 2025
Propose and Rectify: A Forensics-Driven MLLM Framework for Image Manipulation Localization
Propose and Rectify: A Forensics-Driven MLLM Framework for Image Manipulation Localization
Keyang Zhang
Chenqi Kong
Hui Liu
Bo Ding
Xinghao Jiang
Haoliang Li
128
2
0
25 Aug 2025
From Global to Local: Social Bias Transfer in CLIP
From Global to Local: Social Bias Transfer in CLIP
Ryan Ramos
Yusuke Hirota
Yuta Nakashima
Noa Garcia
118
0
0
25 Aug 2025
SpotEdit: Evaluating Visually-Guided Image Editing Methods
SpotEdit: Evaluating Visually-Guided Image Editing Methods
Sara Ghazanfari
Wei-An Lin
Haitong Tian
Ersin Yumer
DiffM
151
0
0
25 Aug 2025
An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing
An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing
Zihan Liang
Jiahao Sun
Haoran Ma
DiffM
87
1
0
24 Aug 2025
PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting
PromptFlare: Prompt-Generalized Defense via Cross-Attention Decoy in Diffusion-Based Inpainting
Hohyun Na
Seunghoo Hong
Simon S. Woo
AAMLDiffM
120
0
0
22 Aug 2025
GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment Design
GenTune: Toward Traceable Prompts to Improve Controllability of Image Refinement in Environment DesignACM Symposium on User Interface Software and Technology (UIST), 2025
Wen-Fan Wang
Ting-Ying Lee
Chien-Ting Lu
Che-Wei Hsu
Nil Ponsa Campany
Yu-Mei Chen
Mike Y. Chen
Bing-Yu Chen
DiffM
168
2
0
21 Aug 2025
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Canyu Zhao
Xiaoman Li
Tianjian Feng
Zhiyue Zhao
Hao Chen
Chunhua Shen
DiffMVGen
187
2
0
20 Aug 2025
AnchorSync: Global Consistency Optimization for Long Video Editing
AnchorSync: Global Consistency Optimization for Long Video Editing
Zichi Liu
Yinggui Wang
Tao Wei
Chao Ma
DiffMVGen
155
0
0
20 Aug 2025
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Ouroboros: Single-step Diffusion Models for Cycle-consistent Forward and Inverse Rendering
Shanlin Sun
Yifan Wang
Hanwen Zhang
Yifeng Xiong
Qin Ren
Ruogu Fang
Xiaohui Xie
Chenyu You
172
4
0
20 Aug 2025
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications
Omkar Thawakar
Dmitry Demidov
Ritesh Thawkar
Rao Muhammad Anwer
M. Shah
Fahad Shahbaz Khan
Salman Khan
VGen
100
1
0
19 Aug 2025
Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping
Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping
Siddharth Khandelwal
Sridhar Kamath
Arjun Jain
DiffM
214
0
0
18 Aug 2025
Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score
Single-Reference Text-to-Image Manipulation with Dual Contrastive Denoising Score
Syed Muhmmad Israr
Feng Zhao
DiffM
155
0
0
18 Aug 2025
CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion
CoreEditor: Consistent 3D Editing via Correspondence-constrained Diffusion
Zhe Zhu
Honghua Chen
Peng Li
Mingqiang Wei
DiffM
149
1
0
15 Aug 2025
SPG: Style-Prompting Guidance for Style-Specific Content Creation
SPG: Style-Prompting Guidance for Style-Specific Content Creation
Qian Liang
Zichong Chen
Yang Zhou
Hui Huang
DiffM
130
0
0
15 Aug 2025
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale
NextStep Team
Chunrui Han
Guopeng Li
J. Wu
Quan Sun
...
Ziyang Meng
Binxing Jiao
Daxin Jiang
X. Zhang
Yibo Zhu
DiffM
216
22
0
14 Aug 2025
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Empowering Multimodal LLMs with External Tools: A Comprehensive Survey
Wenbin An
Jiahao Nie
Yaqiang Wu
Feng Tian
Shijian Lu
Q. Zheng
MLLM
183
1
0
14 Aug 2025
A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation
A Survey on 3D Gaussian Splatting Applications: Segmentation, Editing, and Generation
Shuting He
Peilin Ji
Yitong Yang
Changshuo Wang
Jiayi Ji
Yinglin Wang
Henghui Ding
3DGS
296
9
0
13 Aug 2025
SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing
SVG-Head: Hybrid Surface-Volumetric Gaussians for High-Fidelity Head Reconstruction and Real-Time Editing
Heyi Sun
Cong Wang
Tian-Xing Xu
Jingwei Huang
Di Kang
Chunchao Guo
Song-Hai Zhang
3DGS
140
2
0
13 Aug 2025
Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Junyan Ye
Shihong Deng
Zihao Wang
Leqi Zhu
Zhenghao Hu
...
Zhiyuan Yan
Jinghua Yu
Jiaming Song
Conghui He
Weijia Li
VLM
216
38
0
13 Aug 2025
Stable Diffusion Models are Secretly Good at Visual In-Context Learning
Stable Diffusion Models are Secretly Good at Visual In-Context Learning
Trevine Oorloff
Vishwanath Sindagi
Wele Gedara Chaminda Bandara
Ali Shafahi
Amin Ghiasi
Charan Prakash
R. Ardekani
DiffMVLM
167
3
0
13 Aug 2025
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer
Zixin Yin
Xili Dai
Ling Chen
Deyu Zhou
Jianan Wang
Duomin Wang
Gang Yu
Lionel M. Ni
Lei Zhang
H. Shum
DiffM
145
1
0
12 Aug 2025
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
Joonghyuk Shin
Alchan Hwang
Yujin Kim
Daneul Kim
Jaesik Park
DiffM
122
4
0
11 Aug 2025
Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation
Splat4D: Diffusion-Enhanced 4D Gaussian Splatting for Temporally and Spatially Consistent Content Creation
Minghao Yin
Yukang Cao
Songyou Peng
Kai Han
3DGS
103
2
0
11 Aug 2025
Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing
Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing
Weitao Wang
Haoran Xu
Jun Meng
Haoqian Wang
DiffM
68
0
0
11 Aug 2025
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
Junzhe Xu
Yuyang Yin
Xi Chen
233
5
0
11 Aug 2025
CObL: Toward Zero-Shot Ordinal Layering without User Prompting
CObL: Toward Zero-Shot Ordinal Layering without User Prompting
Aneel Damaraju
D. Hazineh
Todd E. Zickler
BDL
127
0
0
11 Aug 2025
WeatherDiffusion: Controllable Weather Editing in Intrinsic Space
WeatherDiffusion: Controllable Weather Editing in Intrinsic Space
Yixin Zhu
Zuoliang Zhu
Jian Yang
Jian Yang
J. Xie
Beibei Wang
184
0
0
09 Aug 2025
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing
Weiyan Xie
Han Gao
Didan Deng
Kaican Li
April Hua Liu
Yongxiang Huang
Nevin L. Zhang
DiffM
203
0
0
09 Aug 2025
NEP: Autoregressive Image Editing via Next Editing Token Prediction
NEP: Autoregressive Image Editing via Next Editing Token Prediction
Huimin Wu
Xiaojian Ma
Haozhe Zhao
Yanpeng Zhao
Qing Li
DiffM
146
2
0
08 Aug 2025
A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding
A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding
Mahmoud Chick Zaouali
Todd Charter
Yehor Karpichev
Brandon Haworth
Homayoun Najjjaran
3DGS
287
1
0
07 Aug 2025
Neural Speech Extraction with Human Feedback
Neural Speech Extraction with Human Feedback
Malek Itani
Ashton Graves
Sefik Emre Eskimez
Shyamnath Gollakota
79
1
0
05 Aug 2025
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Draw Your Mind: Personalized Generation via Condition-Level Modeling in Text-to-Image Diffusion Models
Hyungjin Kim
Seokho Ahn
Young-Duk Seo
DiffM
130
1
0
05 Aug 2025
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
P. Wang
Yi Peng
Yimeng Gan
Liang Hu
Tianyidan Xie
...
Hongyang Wei
Eric Li
Xuchen Song
Yang Liu
Yahui Zhou
SyDa
132
10
0
05 Aug 2025
Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation
Zero Shot Domain Adaptive Semantic Segmentation by Synthetic Data Generation and Progressive Adaptation
Jun Luo
Zijing Zhao
Yang Liu
160
2
0
05 Aug 2025
EditGarment: An Instruction-Based Garment Editing Dataset Constructed with Automated MLLM Synthesis and Semantic-Aware Evaluation
EditGarment: An Instruction-Based Garment Editing Dataset Constructed with Automated MLLM Synthesis and Semantic-Aware Evaluation
Deqiang Yin
Junyi Guo
Huanda Lu
Fangyu Wu
Dongming Lu
182
0
0
05 Aug 2025
MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing
MILD: Multi-Layer Diffusion Strategy for Complex and Precise Multi-IP Aware Human Erasing
Jinghan Yu
Junhao Xiao
Zhiyuan Ma
Yue Ma
Kaiqi Liu
Yuhan Wang
Daizong Liu
Xianghao Meng
Jianjun Li
DiffM
199
0
0
05 Aug 2025
DreamPainter: Image Background Inpainting for E-commerce Scenarios
DreamPainter: Image Background Inpainting for E-commerce Scenarios
Sijie Zhao
Jing Cheng
Yaoyao Wu
Hao Xu
Shaohui Jiao
DiffM
114
0
0
04 Aug 2025
Optimal Transport for Rectified Flow Image Editing: Unifying Inversion-Based and Direct Methods
Optimal Transport for Rectified Flow Image Editing: Unifying Inversion-Based and Direct Methods
Marian Lupascu
Mihai-Sorin Stupariu
DiffM
259
0
0
04 Aug 2025
Qwen-Image Technical Report
Qwen-Image Technical Report
Chenfei Wu
Jiahao Nick Li
Jingren Zhou
Junyang Lin
Kaiyuan Gao
...
Yichang Zhang
Yongqiang Zhu
Y. Wu
Yuxuan Cai
Zenan Liu
DiffMVLM
349
239
0
04 Aug 2025
AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models
AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models
Die Chen
Zhongjie Duan
Ruoyao Xiao
Cen Chen
Daoyuan Chen
Yaliang Li
Yinda Chen
142
0
0
04 Aug 2025
Previous
123456...333435
Next
Page 5 of 35
Pageof 35