ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

Computer Vision and Pattern Recognition (CVPR), 2022
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,733 papers shown
UniSER: A Foundation Model for Unified Soft Effects Removal
UniSER: A Foundation Model for Unified Soft Effects Removal
Jingdong Zhang
Lingzhi Zhang
Qing Liu
M. Chiu
Connelly Barnes
...
Eli Shechtman
Sohrab Amirghodsi
Xin Li
Wenping Wang
Xiaohang Zhan
DiffM
162
0
0
18 Nov 2025
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization
Daniel Gilo
Or Litany
168
0
0
18 Nov 2025
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model
Chunshi Wang
Junliang Ye
Yunhan Yang
Yang Li
Zizhuo Lin
Jun Zhu
Zhuo Chen
Yawei Luo
Chunchao Guo
MLLM
195
0
0
17 Nov 2025
Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting
Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting
Jiangnan Ye
Jiedong Zhuang
Lianrui Mu
Wenjie Zheng
Jiaqi Hu
Xingze Zou
Jing Wang
Haoji Hu
3DGS
184
0
0
17 Nov 2025
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data
Yunxin Li
Xinyu Chen
Shenyuan Jiang
Haoyuan Shi
Zhenyu Liu
...
Zhenran Xu
Yicheng Ma
Meishan Zhang
Baotian Hu
Min Zhang
MLLMMoEOSLMVLM
619
1
0
16 Nov 2025
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation
Haozhe Liu
Ding Liu
Mingchen Zhuge
Zijian Zhou
Tian Xie
...
Juan-Manuel Perez-Rua
Tao Xiang
Wei Liu
Shikun Liu
Jürgen Schmidhuber
105
0
0
15 Nov 2025
Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
Hossein Mohebbi
Mohammed Abdulrahman
Yanting Miao
Pascal Poupart
Suraj Kothawade
DiffMOffRL
197
0
0
15 Nov 2025
SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing
SimuFreeMark: A Noise-Simulation-Free Robust Watermarking Against Image Editing
Yichao Tang
Mingyang Li
Di Miao
Sheng Li
Zhenxing Qian
Xinpeng Zhang
92
0
0
14 Nov 2025
Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency Partition
Towards Fine-Grained Interpretability: Counterfactual Explanations for Misclassification with Saliency PartitionComputer Vision and Pattern Recognition (CVPR), 2025
Lintong Zhang
Kang Yin
Seong-Whan Lee
FAtt
464
0
0
11 Nov 2025
VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics
VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics
Daniel Cher
Brian Wei
Srikumar Sastry
Nathan Jacobs
DiffMVGen
164
0
0
11 Nov 2025
LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer Learning
LayerEdit: Disentangled Multi-Object Editing via Conflict-Aware Multi-Layer LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2025
Fengyi Fu
Mengqi Huang
Lei Zhang
Zhendong Mao
220
0
0
11 Nov 2025
Generative AI Meets 6G and Beyond: Diffusion Models for Semantic Communications
Generative AI Meets 6G and Beyond: Diffusion Models for Semantic Communications
Hai-Long Qin
Jincheng Dai
Guo Lu
Shuo Shao
Sixian Wang
Tongda Xu
Wenjun Zhang
Ping Zhang
Khaled B. Letaief
DiffMVLM
422
0
0
11 Nov 2025
Top2Ground: A Height-Aware Dual Conditioning Diffusion Model for Robust Aerial-to-Ground View Generation
Top2Ground: A Height-Aware Dual Conditioning Diffusion Model for Robust Aerial-to-Ground View Generation
Jae Joong Lee
Bedrich Benes
DiffM
136
0
0
11 Nov 2025
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
DIMO: Diverse 3D Motion Generation for Arbitrary Objects
Linzhan Mou
Jiahui Lei
Chen Wang
Lingjie Liu
Kostas Daniilidis
VGen
182
1
0
10 Nov 2025
FreeControl: Efficient, Training-Free Structural Control via One-Step Attention Extraction
FreeControl: Efficient, Training-Free Structural Control via One-Step Attention Extraction
Jiang Lin
Xinyu Chen
Song Wu
Zhiqiu Zhang
Jizhi Zhang
Ye Wang
Qiang Tang
Qian Wang
Jian Yang
Zili Yi
DiffM
132
0
0
07 Nov 2025
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
Connor Dunlop
Matthew Zheng
Kavana Venkatesh
Pinar Yanardag
DiffM
93
1
0
06 Nov 2025
EVLP:Learning Unified Embodied Vision-Language Planner with Reinforced Supervised Fine-Tuning
EVLP:Learning Unified Embodied Vision-Language Planner with Reinforced Supervised Fine-Tuning
Xinyan Cai
Shiguang Wu
Dafeng Chi
Yuzheng Zhuang
Xingyue Quan
Jianye Hao
Qiang Guan
103
0
0
03 Nov 2025
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation
Yongyuan Liang
Wei Chow
Feng Li
Ziqiao Ma
Xiyao Wang
Jiageng Mao
Jiuhai Chen
Jiatao Gu
Y. Wang
Furong Huang
LRM
240
1
0
03 Nov 2025
Example-Based Feature Painting on Textures
Example-Based Feature Painting on Textures
Andrei-Timotei Ardelean
Tim Weyrich
DiffM
198
0
0
03 Nov 2025
UniREditBench: A Unified Reasoning-based Image Editing Benchmark
UniREditBench: A Unified Reasoning-based Image Editing Benchmark
Feng Han
Y. Wang
Chenglin Li
Zheming Liang
Dianyi Wang
...
Zhipeng Wei
Chao Gong
Cheng Jin
Yue Yu
J. Wang
194
2
0
03 Nov 2025
Med-Banana-50K: A Cross-modality Large-Scale Dataset for Text-guided Medical Image Editing
Med-Banana-50K: A Cross-modality Large-Scale Dataset for Text-guided Medical Image Editing
Zhihui Chen
Mengling Feng
MedImLM&MA
386
0
0
02 Nov 2025
BlurGuard: A Simple Approach for Robustifying Image Protection Against AI-Powered Editing
BlurGuard: A Simple Approach for Robustifying Image Protection Against AI-Powered Editing
J. Kim
Yunhun Nam
Minseon Kim
Sangpil Kim
Jongheon Jeong
AAMLDiffM
220
0
0
31 Oct 2025
Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing
Understanding the Implicit User Intention via Reasoning with Large Language Model for Image Editing
Yijia Wang
Yiqing Shen
Weiming Chen
Z. He
DiffM
145
0
0
31 Oct 2025
FreeSliders: Training-Free, Modality-Agnostic Concept Sliders for Fine-Grained Diffusion Control in Images, Audio, and Video
FreeSliders: Training-Free, Modality-Agnostic Concept Sliders for Fine-Grained Diffusion Control in Images, Audio, and Video
Rotem Ezra
Hedi Zisling
Nimrod Berman
Ilan Naiman
Alexey Gorkor
Liran Nochumsohn
Eliya Nachmani
Omri Azencot
DiffM
162
0
0
30 Oct 2025
Emu3.5: Native Multimodal Models are World Learners
Emu3.5: Native Multimodal Models are World Learners
Yufeng Cui
Honghao Chen
Haoge Deng
X. Y. Huang
Xinghang Li
...
Zhuo Chen
Yulong Ao
Tiejun Huang
Zhongyuan Wang
Xinlong Wang
MLLMVGen
460
18
0
30 Oct 2025
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification
Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification
William Yang
Xindi Wu
Zhiwei Deng
Esin Tureci
Olga Russakovsky
DiffM
150
0
0
28 Oct 2025
Group Relative Attention Guidance for Image Editing
Group Relative Attention Guidance for Image Editing
Xuanpu Zhang
Xuesong Niu
Ruidong Chen
Dan Song
Jianhao Zeng
Penghui Du
Haoxiang Cao
Kai Wu
An-an Liu
DiffM
211
0
0
28 Oct 2025
Neural USD: An object-centric framework for iterative editing and control
Neural USD: An object-centric framework for iterative editing and control
Alejandro Escontrela
Shrinu Kushagra
Sjoerd van Steenkiste
Yulia Rubanova
Aleksander Holynski
Kelsey R. Allen
Kevin Murphy
Thomas Kipf
DiffM
148
0
0
28 Oct 2025
UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization
UniAIDet: A Unified and Universal Benchmark for AI-Generated Image Content Detection and Localization
Huixuan Zhang
Xiaojun Wan
EGVM
173
0
0
27 Oct 2025
LightFusion: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation
LightFusion: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation
Zeyu Wang
Z. Chen
Chenhui Gou
Feng Li
Chaorui Deng
...
Kunchang Li
Weihao Yu
Haoqin Tu
Haoqi Fan
Cihang Xie
364
0
0
27 Oct 2025
SAO-Instruct: Free-form Audio Editing using Natural Language Instructions
SAO-Instruct: Free-form Audio Editing using Natural Language Instructions
Michael Ungersböck
Florian Grötschla
Luca A. Lanzendörfer
June Young Yi
Changho Choi
Roger Wattenhofer
AuLLM
165
1
0
26 Oct 2025
GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation
GeoDiffusion: A Training-Free Framework for Accurate 3D Geometric Conditioning in Image Generation
Phillip Mueller
Talip Uenlue
Sebastian Schmidt
Marcel Kollovieh
Jiajie Fan
Stephan Guennemann
Lars Mikelsons
116
0
0
25 Oct 2025
Bridging the gap to real-world language-grounded visual concept learning
Bridging the gap to real-world language-grounded visual concept learning
Whie Jung
Semin Kim
Junee Kim
Seunghoon Hong
152
0
0
24 Oct 2025
EditInfinity: Image Editing with Binary-Quantized Generative Models
EditInfinity: Image Editing with Binary-Quantized Generative Models
Jiahuan Wang
Yuxin Chen
Jun Yu
Guangming Lu
Wenjie Pei
218
1
0
23 Oct 2025
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
AutoScape: Geometry-Consistent Long-Horizon Scene Generation
Jiacheng Chen
Ziyu Jiang
Mingfu Liang
Bingbing Zhuang
Jong-Chyi Su
Sparsh Garg
Ying Wu
Manmohan Chandraker
VGen
154
1
0
23 Oct 2025
[De|Re]constructing VLMs' Reasoning in Counting
[De|Re]constructing VLMs' Reasoning in Counting
Simone Alghisi
Gabriel Roccabruna
Massimo Rizzoli
Seyed Mahed Mousavi
Giuseppe Riccardi
ReLMLRMVLM
206
1
0
22 Oct 2025
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
Yusu Qian
Eli Bocek-Rivele
Liangchen Song
Jialing Tong
Yinfei Yang
Jiasen Lu
Wenze Hu
Zhe Gan
118
9
0
22 Oct 2025
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization
Yuanhe Guo
Linxi Xie
Zhuoran Chen
Kangrui Yu
Ryan Po
Guandao Yang
Gordon Wetztein
Hongyi Wen
VLM
88
0
0
21 Oct 2025
Beyond Frequency: Scoring-Driven Debiasing for Object Detection via Blueprint-Prompted Image Synthesis
Beyond Frequency: Scoring-Driven Debiasing for Object Detection via Blueprint-Prompted Image Synthesis
Xinhao Cai
Liulei Li
Gensheng Pei
Tao Chen
Jinshan Pan
Yazhou Yao
Wenguan Wang
170
0
0
21 Oct 2025
HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery
HIDISC: A Hyperbolic Framework for Domain Generalization with Generalized Category Discovery
Vaibhav Rathore
Divyam Gupta
Biplab Banerjee
127
0
0
20 Oct 2025
UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts
UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts
Fu-Yun Wang
Han Zhang
Michael Gharbi
Hongsheng Li
Taesung Park
150
0
0
20 Oct 2025
Personalized Image Filter: Mastering Your Photographic Style
Personalized Image Filter: Mastering Your Photographic Style
Chengxuan Zhu
Shuchen Weng
Jiacong Fang
Peixuan Zhang
Si Li
Chao Xu
Boxin Shi
DiffM
157
0
0
19 Oct 2025
From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
Xiangyu Mu
Dongliang Zhou
Jie Hou
Haijun Zhang
Weili Guan
DiffM
193
1
0
19 Oct 2025
Region in Context: Text-condition Image editing with Human-like semantic reasoning
Region in Context: Text-condition Image editing with Human-like semantic reasoning
Thuy Phuong Vu
Dinh-Cuong Hoang
Minhhuy Le
Phan Xuan Tan
DiffM
125
0
0
19 Oct 2025
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback
Zongjian Li
Zheyuan Liu
Qihui Zhang
Bin Lin
Feize Wu
...
Wangbo Yu
Yuwei Niu
Shaodong Wang
Xinhua Cheng
Li Yuan
406
13
0
19 Oct 2025
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
Qingyan Bai
Qiuyu Wang
Hao Ouyang
Yue Yu
Hanlin Wang
...
Yanhong Zeng
Zichen Liu
Yinghao Xu
Yujun Shen
Qifeng Chen
VGen
375
11
0
17 Oct 2025
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery
Jie-Ying Lee
Yi-Ruei Liu
Shr-Ruei Tsai
Wei-Cheng Chang
Chung-Ho Wu
Jiewen Chan
Zhenjun Zhao
Chieh Hubert Lin
Yu-Lun Liu
3DGS
278
6
0
17 Oct 2025
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal
Shr-Ruei Tsai
Wei-Cheng Chang
Jie-Ying Lee
Chih-Hai Su
Yu-Lun Liu
DiffM
181
5
0
17 Oct 2025
BLIP3o-NEXT: Next Frontier of Native Image Generation
BLIP3o-NEXT: Next Frontier of Native Image Generation
Jiuhai Chen
Le Xue
Zhiyang Xu
Xichen Pan
Shusheng Yang
...
Tianyi Zhou
Junnan Li
Silvio Savarese
Caiming Xiong
Ran Xu
113
13
0
17 Oct 2025
Salient Concept-Aware Generative Data Augmentation
Salient Concept-Aware Generative Data Augmentation
Tianchen Zhao
Xuanbai Chen
Zhihua Li
J. Fang
Dongsheng An
Xiang Xu
Zhuowen Tu
Yifan Xing
DiffM
206
0
0
16 Oct 2025
Previous
12345...333435
Next
Page 2 of 35
Pageof 35