ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 290 papers shown
Title
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Q. He
Jinlong Peng
P. Xu
Boyuan Jiang
Xiaobin Hu
...
Y. Liu
Y. Wang
Chengjie Wang
X. Li
J. Zhang
DiffM
120
1
0
04 Dec 2024
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
X. Zhang
Zecheng Tang
Zhipei Xu
Runyi Li
Youmin Xu
Bin Chen
Feng Gao
Jian Andrew Zhang
WIGM
93
4
0
02 Dec 2024
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
Sudarshan Rajagopalan
Nithin Gopalakrishnan Nair
Jay N. Paranjape
Vishal M. Patel
DiffM
90
0
0
26 Nov 2024
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
Hanhui Wang
Yihua Zhang
Ruizheng Bai
Yue Zhao
Sijia Liu
Z. Tu
AAML
PICV
95
2
0
25 Nov 2024
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
P. Xu
Boyuan Jiang
Xiaobin Hu
Donghao Luo
Q. He
J. Zhang
Chengjie Wang
Yunsheng Wu
Charles X. Ling
Boyu Wang
87
2
0
24 Nov 2024
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
Qifan Yu
Wei Chow
Zhongqi Yue
Kaihang Pan
Yang Wu
Xiaoyang Wan
Juncheng Billy Li
Siliang Tang
H. Zhang
Yueting Zhuang
DiffM
95
15
0
24 Nov 2024
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
GIFT: A Framework for Global Interpretable Faithful Textual Explanations of Vision Classifiers
Éloi Zablocki
Valentin Gerard
Amaia Cardiel
Eric Gaussier
Matthieu Cord
Eduardo Valle
69
0
0
23 Nov 2024
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video
Jiawei Zhang
Zijian Wu
Zhiyang Liang
Yicheng Gong
Dongfang Hu
Yao Yao
Xun Cao
Hao Zhu
3DGS
82
1
0
23 Nov 2024
ColorEdit: Training-free Image-Guided Color editing with diffusion model
ColorEdit: Training-free Image-Guided Color editing with diffusion model
Xingxi Yin
Zhi Li
Jingfeng Zhang
Chenglin Li
Yin Zhang
DiffM
47
0
0
15 Nov 2024
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision
Cong Wei
Zheyang Xiong
Weiming Ren
Xinrun Du
Ge Zhang
Wenhu Chen
99
18
0
11 Nov 2024
Extreme Rotation Estimation in the Wild
Extreme Rotation Estimation in the Wild
Hana Bezalel
Dotan Ankri
Ruojin Cai
Hadar Averbuch-Elor
18
2
0
11 Nov 2024
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields
Eric Zhu
Mara Levy
M. Gwilliam
Abhinav Shrivastava
40
0
0
04 Nov 2024
X-Drive: Cross-modality consistent multi-sensor data synthesis for
  driving scenarios
X-Drive: Cross-modality consistent multi-sensor data synthesis for driving scenarios
Yichen Xie
Chenfeng Xu
C-T.John Peng
Shuqi Zhao
Nhat Ho
Alexander T. Pham
Mingyu Ding
M. Tomizuka
W. Zhan
DiffM
31
2
0
02 Nov 2024
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances
Shilin Lu
Zihan Zhou
Jiayou Lu
Yuanzhi Zhu
A. Kong
WIGM
78
10
0
24 Oct 2024
Progressive Compositionality in Text-to-Image Generative Models
Progressive Compositionality in Text-to-Image Generative Models
Xu Han
Linghao Jin
Xiaofeng Liu
Paul Pu Liang
CoGe
93
2
0
22 Oct 2024
MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical
  Image Applications
MedDiff-FM: A Diffusion-based Foundation Model for Versatile Medical Image Applications
Yongrui Yu
Yannian Gu
S. Zhang
Xiaofan Zhang
MedIm
23
2
0
20 Oct 2024
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Jinbin Bai
Tian-Chun Ye
Wei Chow
Enxin Song
Qing-Guo Chen
Xiangtai Li
Zhen Dong
Lei Zhu
50
13
0
10 Oct 2024
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image
  Editing
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
June Suk Choi
Kyungmin Lee
Jongheon Jeong
Saining Xie
Jinwoo Shin
Kimin Lee
DiffM
AAML
23
2
0
08 Oct 2024
Revealing Directions for Text-guided 3D Face Editing
Revealing Directions for Text-guided 3D Face Editing
Zhuo Chen
Yichao Yan
Sehngqi Liu
Yuhao Cheng
Weiming Zhao
Lincheng Li
Mengxiao Bi
Xiaokang Yang
DiffM
30
0
0
07 Oct 2024
FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
FoAM: Foresight-Augmented Multi-Task Imitation Policy for Robotic Manipulation
Litao Liu
Wentao Wang
Yifan Han
Zhuoli Xie
Pengfei Yi
Junyan Li
Yi Qin
Wenzhao Lian
32
2
0
29 Sep 2024
Word2Wave: Language Driven Mission Programming for Efficient Subsea Deployments of Marine Robots
Word2Wave: Language Driven Mission Programming for Efficient Subsea Deployments of Marine Robots
Ruo Chen
David Blow
Adnan Abdullah
Md Jahidul Islam
38
1
0
27 Sep 2024
MIO: A Foundation Model on Multimodal Tokens
MIO: A Foundation Model on Multimodal Tokens
Zekun Wang
King Zhu
Chunpu Xu
Wangchunshu Zhou
Jiaheng Liu
...
Yuanxing Zhang
Ge Zhang
Ke Xu
Jie Fu
Wenhao Huang
MLLM
AuLLM
48
11
0
26 Sep 2024
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model
Hongliang Zhong
Can Wang
Jingbo Zhang
Jing Liao
3DGS
DiffM
33
2
0
25 Sep 2024
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
Weifeng Lin
Xinyu Wei
Renrui Zhang
Le Zhuo
Shitian Zhao
...
Junlin Xie
Junlin Xie
Yu Qiao
Peng Gao
Hongsheng Li
MLLM
DiffM
50
10
0
23 Sep 2024
MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors
MaterialFusion: Enhancing Inverse Rendering with Material Diffusion Priors
Yehonathan Litman
Or Patashnik
Kangle Deng
Aviral Agrawal
Rushikesh Zawar
Fernando de la Torre
Shubham Tulsiani
31
5
0
23 Sep 2024
Dormant: Defending against Pose-driven Human Image Animation
Dormant: Defending against Pose-driven Human Image Animation
Jiachen Zhou
Mingsi Wang
Tianlin Li
Guozhu Meng
Kai Chen
44
3
0
22 Sep 2024
DNI: Dilutional Noise Initialization for Diffusion Video Editing
DNI: Dilutional Noise Initialization for Diffusion Video Editing
Sunjae Yoon
Gwanhyeong Koo
Ji Woo Hong
Chang D. Yoo
DiffM
21
2
0
19 Sep 2024
ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation
ORB-SfMLearner: ORB-Guided Self-supervised Visual Odometry with Selective Online Adaptation
Yanlin Jin
Rui-Yang Ju
Haojun Liu
Yuzhong Zhong
19
0
0
18 Sep 2024
TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer
TextureDiffusion: Target Prompt Disentangled Editing for Various Texture Transfer
Zihan Su
Junhao Zhuang
Chun Yuan
DiffM
34
0
0
15 Sep 2024
Data Augmentation via Latent Diffusion for Saliency Prediction
Data Augmentation via Latent Diffusion for Saliency Prediction
Bahar Aydemir
Deblina Bhattacharjee
Tong Zhang
Mathieu Salzmann
Sabine Süsstrunk
20
1
0
11 Sep 2024
Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records
Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records
Daeun Kyung
J. Kim
Tackeun Kim
E. Choi
MedIm
DiffM
34
1
0
11 Sep 2024
NeIn: Telling What You Don't Want
NeIn: Telling What You Don't Want
Nhat-Tan Bui
Dinh-Hieu Hoang
Quoc-Huy Trinh
Minh-Triet Tran
Truong Nguyen
Susan Gauch
29
2
0
09 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
34
0
0
07 Sep 2024
Training-Free Sketch-Guided Diffusion with Latent Optimization
Training-Free Sketch-Guided Diffusion with Latent Optimization
Sandra Zhang Ding
Jiafeng Mao
Kiyoharu Aizawa
DiffM
86
1
0
31 Aug 2024
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language
  Models
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Qirui Jiao
Daoyuan Chen
Yilun Huang
Yaliang Li
Ying Shen
VLM
27
5
0
08 Aug 2024
InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian
  Splatting
InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting
Xin-Yi Yu
Jun-Xin Yu
Li-Bo Zhou
Yan Wei
Lin-Lin Ou
3DGS
22
4
0
08 Aug 2024
Learning Feature-Preserving Portrait Editing from Generated Pairs
Learning Feature-Preserving Portrait Editing from Generated Pairs
Bowei Chen
Tiancheng Zhi
Peihao Zhu
Shen Sang
Jing Liu
Linjie Luo
DiffM
17
0
0
29 Jul 2024
Answerability Fields: Answerable Location Estimation via Diffusion
  Models
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
29
0
0
26 Jul 2024
StylusAI: Stylistic Adaptation for Robust German Handwritten Text
  Generation
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation
Nauman Riaz
S. Saifullah
S. Agne
Andreas Dengel
Sheraz Ahmed
DiffM
26
0
0
22 Jul 2024
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models
Zheng Chong
Xiao Dong
Haoxiang Li
Shiyue Zhang
Wenqing Zhang
Xujie Zhang
Hanqing Zhao
D. Jiang
Xiaodan Liang
DiffM
48
17
0
21 Jul 2024
Controlling Space and Time with Diffusion Models
Controlling Space and Time with Diffusion Models
Daniel Watson
Saurabh Saxena
Lala Li
Andrea Tagliasacchi
David J. Fleet
VGen
56
27
0
10 Jul 2024
Sketch-Guided Scene Image Generation
Sketch-Guided Scene Image Generation
Tianyu Zhang
Xiaoxuan Xie
Xusheng Du
H. Xie
DiffM
33
2
0
09 Jul 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and
  Editing
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
Zhenyu Wang
Aoxue Li
Zhenguo Li
Xihui Liu
MLLM
DiffM
36
25
0
08 Jul 2024
GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models
  via Counterfactual Probing
GenderBias-\emph{VL}: Benchmarking Gender Bias in Vision Language Models via Counterfactual Probing
Yisong Xiao
Aishan Liu
QianJia Cheng
Zhenfei Yin
Siyuan Liang
Jiapeng Li
Jing Shao
Xianglong Liu
Dacheng Tao
26
4
0
30 Jun 2024
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
Yicheng Chen
Xiangtai Li
Yining Li
Yanhong Zeng
Jianzong Wu
Xiangyu Zhao
Kai Chen
VLM
DiffM
54
3
0
28 Jun 2024
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
Ivan Villa-Renteria
Mason L. Wang
Zachary Shah
Zhe Li
Soohyun Kim
Neelesh Ramachandran
Mert Pilanci
29
0
0
27 Jun 2024
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
Yuang Peng
Yuxin Cui
Haomiao Tang
Zekun Qi
Runpei Dong
Jing Bai
Chunrui Han
Zheng Ge
Xiangyu Zhang
Shu-Tao Xia
EGVM
72
30
0
24 Jun 2024
A3D: Does Diffusion Dream about 3D Alignment?
A3D: Does Diffusion Dream about 3D Alignment?
Savva Ignatyev
Nina Konovalova
Daniil Selikhanovych
Nikolay Patakin
Nikolay Patakin
...
Anton Konushin
Peter Wonka
Alexander Filippov
Peter Wonka
Evgeny Burnaev
DiffM
58
0
0
21 Jun 2024
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data
Rotem Shalev-Arkushin
Aharon Azulay
Tavi Halperin
Eitan Richardson
Amit H. Bermano
Ohad Fried
DiffM
34
0
0
20 Jun 2024
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation
Eyal Michaeli
Ohad Fried
44
1
0
20 Jun 2024
Previous
123456
Next