Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.09800
Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"InstructPix2Pix: Learning to Follow Image Editing Instructions"
50 / 290 papers shown
Title
SynCDR : Training Cross Domain Retrieval Models with Synthetic Data
Samarth Mishra
Carlos D. Castillo
Hongcheng Wang
Kate Saenko
Venkatesh Saligrama
35
1
0
31 Dec 2023
ZONE: Zero-Shot Instruction-Guided Local Editing
Shanglin Li
Bo-Wen Zeng
Yutang Feng
Sicheng Gao
Xuhui Liu
...
Li Lin
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
18
30
0
28 Dec 2023
Optimizing Diffusion Noise Can Serve As Universal Motion Priors
Korrawe Karunratanakul
Konpat Preechakul
Emre Aksan
Thabo Beeler
Supasorn Suwajanakorn
Siyu Tang
DiffM
14
37
0
19 Dec 2023
Scene-Conditional 3D Object Stylization and Composition
Jinghao Zhou
Tomas Jakab
Philip H. S. Torr
Christian Rupprecht
DiffM
65
2
0
19 Dec 2023
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Zeyinzi Jiang
Chaojie Mao
Yulin Pan
Zhen Han
Jingfeng Zhang
12
28
0
18 Dec 2023
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang Qi
Zhengzhong Tu
Keren Ye
M. Delbracio
P. Milanfar
Qifeng Chen
Hossein Talebi
DiffM
19
11
0
18 Dec 2023
Tell Me What You See: Text-Guided Real-World Image Denoising
E. Yosef
Raja Giryes
DiffM
45
2
0
15 Dec 2023
Relightful Harmonization: Lighting-aware Portrait Background Replacement
Mengwei Ren
Wei Xiong
Jae Shin Yoon
Zhixin Shu
Jianming Zhang
HyunJoon Jung
Guido Gerig
He Zhang
DiffM
23
16
0
11 Dec 2023
Learning Naturally Aggregated Appearance for Efficient 3D Editing
Ka Leong Cheng
Qiuyu Wang
Zifan Shi
Kecheng Zheng
Yinghao Xu
Ouyang Hao
Qifeng Chen
Yujun Shen
3DH
53
4
0
11 Dec 2023
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video
Yuchen Rao
Eduardo Pérez-Pellitero
Benjamin Busam
Yiren Zhou
Jifei Song
21
0
0
08 Dec 2023
Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images
Yiqun Zhang
Zhen Qin
Yang Liu
Dylan Campbell
9
2
0
07 Dec 2023
DiffusionSat: A Generative Foundation Model for Satellite Imagery
Samar Khanna
Patrick Liu
Linqi Zhou
Chenlin Meng
Robin Rombach
Marshall Burke
David B. Lobell
Stefano Ermon
14
55
0
06 Dec 2023
Analyzing and Improving the Training Dynamics of Diffusion Models
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
23
150
0
05 Dec 2023
Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Zhuoran Yu
Chenchen Zhu
Sean Culatana
Raghuraman Krishnamoorthi
Fanyi Xiao
Yong Jae Lee
109
13
0
04 Dec 2023
Text-Guided 3D Face Synthesis -- From Generation to Editing
Yunjie Wu
Yapeng Meng
Zhipeng Hu
Lincheng Li
Haoqian Wu
Kun Zhou
Weiwei Xu
Xin Yu
DiffM
35
9
0
01 Dec 2023
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Zhen Xing
Qi Dai
Zihao Zhang
Hui Zhang
Hang-Rui Hu
Zuxuan Wu
Yu-Gang Jiang
VGen
30
17
0
30 Nov 2023
S2ST: Image-to-Image Translation in the Seed Space of Latent Diffusion
V. Kolmogorov
Rustem Takhanov
Dani Lischinski
DiffM
23
3
0
30 Nov 2023
CosAvatar: Consistent and Animatable Portrait Video Tuning with Text Prompt
Haiyao Xiao
Chenglai Zhong
Xuan Gao
Yudong Guo
Juyong Zhang
27
0
0
30 Nov 2023
MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing
Haoyu Zhao
Tianyi Lu
Jiaxi Gu
Xing Zhang
Qingping Zheng
Zuxuan Wu
Hang Xu
Yu-Gang Jiang
VGen
DiffM
21
10
0
29 Nov 2023
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
Cong Wei
Yang Chen
Haonan Chen
Hexiang Hu
Ge Zhang
Jie Fu
Alan Ritter
Wenhu Chen
28
50
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
21
43
0
28 Nov 2023
GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions
Jiemin Fang
Junjie Wang
Xiaopeng Zhang
Lingxi Xie
Qi Tian
3DGS
DiffM
20
107
0
27 Nov 2023
Image Super-Resolution with Text Prompt Diffusion
Zheng Chen
Yulun Zhang
Jinjin Gu
Xin Yuan
Linghe Kong
Guihai Chen
Xiaokang Yang
DiffM
26
19
0
24 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGen
DiffM
23
23
0
21 Nov 2023
3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models
Haibo Yang
Yang Chen
Yingwei Pan
Ting Yao
Zhineng Chen
Tao Mei
16
19
0
09 Nov 2023
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen
Yingwei Pan
Ting Yao
Tao Mei
DiffM
17
38
0
09 Nov 2023
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Shilong Liu
Hao Cheng
Haotian Liu
Hao Zhang
Feng Li
...
Hang Su
Jun Zhu
Lei Zhang
Jianfeng Gao
Chun-yue Li
MLLM
VLM
47
102
0
09 Nov 2023
A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Xingzhe He
Zhiwen Cao
Nicholas I. Kolkin
Lantao Yu
Kun Wan
Helge Rhodin
Ratheesh Kalarot
16
12
0
07 Nov 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
43
8
0
25 Oct 2023
Audio Editing with Non-Rigid Text Prompts
Francesco Paissan
Luca Della Libera
Zhepei Wang
Mirco Ravanelli
Paris Smaragdis
Cem Subakan
DiffM
19
5
0
19 Oct 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
11
24
0
28 Sep 2023
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing
Kai Wang
Fei Yang
Shiqi Yang
Muhammad Atif Butt
Joost van de Weijer
DiffM
15
50
0
27 Sep 2023
FEC: Three Finetuning-free Methods to Enhance Consistency for Real Image Editing
Songyan Chen
Jiancheng Huang
DiffM
14
13
0
26 Sep 2023
Forgedit: Text Guided Image Editing via Learning and Forgetting
Shiwen Zhang
Shuai Xiao
Weilin Huang
DiffM
11
17
0
19 Sep 2023
My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes
Ram Bhagat
U. Ciftci
Ilke Demir
DiffM
20
4
0
06 Sep 2023
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang
Rongyuan Wu
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
22
133
0
28 Aug 2023
Instruction Position Matters in Sequence Generation with Large Language Models
Yanjun Liu
Xianfeng Zeng
Fandong Meng
Jie Zhou
LRM
35
8
0
23 Aug 2023
Learning a More Continuous Zero Level Set in Unsigned Distance Fields through Level Set Projection
Junsheng Zhou
Baorui Ma
Shujuan Li
Yu-Shen Liu
Zhizhong Han
30
36
0
22 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
19
13
0
19 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
28
81
0
15 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
22
8
0
11 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
24
33
0
02 Aug 2023
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
23
60
0
24 Jul 2023
Interpolating between Images with Diffusion Models
Clinton Jia Wang
Polina Golland
DiffM
9
21
0
24 Jul 2023
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Chong Mou
Xintao Wang
Jie Song
Ying Shan
Jian Andrew Zhang
DiffM
22
142
0
05 Jul 2023
Collaborative Score Distillation for Consistent Visual Synthesis
Subin Kim
Kyungmin Lee
June Suk Choi
Jongheon Jeong
Kihyuk Sohn
Jinwoo Shin
DiffM
19
21
0
04 Jul 2023
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Linoy Tsaban
Apolinário Passos
DiffM
4
36
0
02 Jul 2023
UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video
Zhi-Hao Lin
Bohan Liu
Yi-Ting Chen
David A. Forsyth
Jia-Bin Huang
Jia-Bin Huang
Anand Bhattad
Shenlong Wang
VGen
34
10
0
15 Jun 2023
Extending CLIP's Image-Text Alignment to Referring Image Segmentation
Seoyeon Kim
Minguk Kang
Dongwon Kim
Jaesik Park
Suha Kwak
VLM
10
10
0
14 Jun 2023
GeneCIS: A Benchmark for General Conditional Image Similarity
S. Vaze
Nicolas Carion
Ishan Misra
VLM
DiffM
22
26
0
13 Jun 2023
Previous
1
2
3
4
5
6
Next