ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions

InstructPix2Pix: Learning to Follow Image Editing Instructions

17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXivPDFHTML

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 290 papers shown
Title
SynCDR : Training Cross Domain Retrieval Models with Synthetic Data
SynCDR : Training Cross Domain Retrieval Models with Synthetic Data
Samarth Mishra
Carlos D. Castillo
Hongcheng Wang
Kate Saenko
Venkatesh Saligrama
35
1
0
31 Dec 2023
ZONE: Zero-Shot Instruction-Guided Local Editing
ZONE: Zero-Shot Instruction-Guided Local Editing
Shanglin Li
Bo-Wen Zeng
Yutang Feng
Sicheng Gao
Xuhui Liu
...
Li Lin
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
18
30
0
28 Dec 2023
Optimizing Diffusion Noise Can Serve As Universal Motion Priors
Optimizing Diffusion Noise Can Serve As Universal Motion Priors
Korrawe Karunratanakul
Konpat Preechakul
Emre Aksan
Thabo Beeler
Supasorn Suwajanakorn
Siyu Tang
DiffM
14
37
0
19 Dec 2023
Scene-Conditional 3D Object Stylization and Composition
Scene-Conditional 3D Object Stylization and Composition
Jinghao Zhou
Tomas Jakab
Philip H. S. Torr
Christian Rupprecht
DiffM
65
2
0
19 Dec 2023
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip
  Connection Editing
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Zeyinzi Jiang
Chaojie Mao
Yulin Pan
Zhen Han
Jingfeng Zhang
12
28
0
18 Dec 2023
SPIRE: Semantic Prompt-Driven Image Restoration
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang Qi
Zhengzhong Tu
Keren Ye
M. Delbracio
P. Milanfar
Qifeng Chen
Hossein Talebi
DiffM
19
11
0
18 Dec 2023
Tell Me What You See: Text-Guided Real-World Image Denoising
Tell Me What You See: Text-Guided Real-World Image Denoising
E. Yosef
Raja Giryes
DiffM
45
2
0
15 Dec 2023
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Mengwei Ren
Wei Xiong
Jae Shin Yoon
Zhixin Shu
Jianming Zhang
HyunJoon Jung
Guido Gerig
He Zhang
DiffM
23
16
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
53
4
0
11 Dec 2023
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular
  Video
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video
Yuchen Rao
Eduardo Pérez-Pellitero
Benjamin Busam
Yiren Zhou
Jifei Song
21
0
0
08 Dec 2023
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated
  Images
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images
Yiqun Zhang
Zhen Qin
Yang Liu
Dylan Campbell
9
2
0
07 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
14
55
0
06 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
23
150
0
05 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
109
13
0
04 Dec 2023
Text-Guided 3D Face Synthesis -- From Generation to Editing
Text-Guided 3D Face Synthesis -- From Generation to Editing
Yunjie Wu
Yapeng Meng
Zhipeng Hu
Lincheng Li
Haoqian Wu
Kun Zhou
Weiwei Xu
Xin Yu
DiffM
35
9
0
01 Dec 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion
  Models
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
30
17
0
30 Nov 2023
S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion
S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion
V. Kolmogorov
Rustem Takhanov
Dani Lischinski
DiffM
23
3
0
30 Nov 2023
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text
  Prompt
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt
Haiyao Xiao
Chenglai Zhong
Xuan Gao
Yudong Guo
Juyong Zhang
27
0
0
30 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation
  and Editing
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
21
10
0
29 Nov 2023
UniIR: Training and Benchmarking Universal Multimodal Information
  Retrievers
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
Cong Wei
Yang Chen
Haonan Chen
Hexiang Hu
Ge Zhang
Jie Fu
Alan Ritter
Wenhu Chen
28
50
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
21
43
0
28 Nov 2023
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
Jiemin Fang
Junjie Wang
Xiaopeng Zhang
Lingxi Xie
Qi Tian
3DGS
DiffM
20
107
0
27 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
26
19
0
24 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via
  Blender-Oriented GPT Planning
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGen
DiffM
23
23
0
21 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with
  2D Diffusion Models
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
16
19
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion
  Priors
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
17
38
0
09 Nov 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Shilong Liu
Hao Cheng
Haotian Liu
Hao Zhang
Feng Li
...
Hang Su
Jun Zhu
Lei Zhang
Jianfeng Gao
Chun-yue Li
MLLM
VLM
47
102
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion
  Personalization
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
16
12
0
07 Nov 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
43
8
0
25 Oct 2023
Audio Editing with Non-Rigid Text Prompts
Audio Editing with Non-Rigid Text Prompts
Francesco Paissan
Luca Della Libera
Zhepei Wang
Mirco Ravanelli
Paris Smaragdis
Cem Subakan
DiffM
19
5
0
19 Oct 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image
  Action Editing
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
11
24
0
28 Sep 2023
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for
  Text-Based Image Editing
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing
Kai Wang
Fei Yang
Shiqi Yang
Muhammad Atif Butt
Joost van de Weijer
DiffM
15
50
0
27 Sep 2023
FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image
  Editing
FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing
Songyan Chen
Jiancheng Huang
DiffM
14
13
0
26 Sep 2023
Forgedit: Text Guided Image Editing via Learning and Forgetting
Forgedit: Text Guided Image Editing via Learning and Forgetting
Shiwen Zhang
Shuai Xiao
Weilin Huang
DiffM
11
17
0
19 Sep 2023
My Art My Choice: Adversarial Protection Against Unruly AI
My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes
Ram Bhagat
U. Ciftci
Ilke Demir
DiffM
20
4
0
06 Sep 2023
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and
  Personalized Stylization
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang
Rongyuan Wu
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
22
133
0
28 Aug 2023
Instruction Position Matters in Sequence Generation with Large Language
  Models
Instruction Position Matters in Sequence Generation with Large Language Models
Yanjun Liu
Xianfeng Zeng
Fandong Meng
Jie Zhou
LRM
35
8
0
23 Aug 2023
Learning a More Continuous Zero Level Set in Unsigned Distance Fields
  through Level Set Projection
Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection
Junsheng Zhou
Baorui Ma
Shujuan Li
Yu-Shen Liu
Zhizhong Han
30
36
0
22 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation
  with Temporal Correspondence Guidance
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
19
13
0
19 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video
  Processing
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
28
81
0
15 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling
  Text-to-Image Generation
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
22
8
0
11 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
  Image Manipulation
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
24
33
0
02 Aug 2023
Understanding the Latent Space of Diffusion Models through the Lens of
  Riemannian Geometry
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
23
60
0
24 Jul 2023
Interpolating between Images with Diffusion Models
Interpolating between Images with Diffusion Models
Clinton Jia Wang
Polina Golland
DiffM
9
21
0
24 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Andrew Zhang
DiffM
22
142
0
05 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
19
21
0
04 Jul 2023
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Linoy Tsaban
Apolinário Passos
DiffM
4
36
0
02 Jul 2023
UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video
UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video
Zhi-Hao Lin
Bohan Liu
Yi-Ting Chen
David A. Forsyth
Jia-Bin Huang
Jia-Bin Huang
Anand Bhattad
Shenlong Wang
VGen
34
10
0
15 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
10
10
0
14 Jun 2023
GeneCIS: A Benchmark for General Conditional Image Similarity
GeneCIS: A Benchmark for General Conditional Image Similarity
S. Vaze
Nicolas Carion
Ishan Misra
VLM
DiffM
22
26
0
13 Jun 2023
Previous
123456
Next