ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.09800
  4. Cited By
InstructPix2Pix: Learning to Follow Image Editing Instructions
v1v2 (latest)

InstructPix2Pix: Learning to Follow Image Editing Instructions

Computer Vision and Pattern Recognition (CVPR), 2022
17 November 2022
Tim Brooks
Aleksander Holynski
Alexei A. Efros
    DiffM
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "InstructPix2Pix: Learning to Follow Image Editing Instructions"

50 / 1,733 papers shown
HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks
HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks
Zhuo Chen
Xudong Xu
Manwen Liao
Ye Pan
Wenhan Zhu
Wayne Wu
Bo Dai
Xiaokang Yang
3DH
208
12
0
19 Apr 2023
Visual Instruction Tuning
Visual Instruction TuningNeural Information Processing Systems (NeurIPS), 2023
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDaVLMMLLM
1.2K
7,615
0
17 Apr 2023
Delta Denoising Score
Delta Denoising ScoreIEEE International Conference on Computer Vision (ICCV), 2023
Amir Hertz
Kfir Aberman
Daniel Cohen-Or
DiffM
281
118
0
14 Apr 2023
One-Shot Stylization for Full-Body Human Images
One-Shot Stylization for Full-Body Human Images
Aiyu Cui
Svetlana Lazebnik
3DH
241
0
0
14 Apr 2023
Expressive Text-to-Image Generation with Rich Text
Expressive Text-to-Image Generation with Rich TextIEEE International Conference on Computer Vision (ICCV), 2023
Songwei Ge
Taesung Park
Jun-Yan Zhu
Jia-Bin Huang
DiffM
482
98
0
13 Apr 2023
Segment Everything Everywhere All at Once
Segment Everything Everywhere All at OnceNeural Information Processing Systems (NeurIPS), 2023
Xueyan Zou
Jianwei Yang
Hao Zhang
Feng Li
Linjie Li
Jianfeng Wang
Lijuan Wang
Jianfeng Gao
Yong Jae Lee
MLLMVLM
433
683
0
13 Apr 2023
An Edit Friendly DDPM Noise Space: Inversion and Manipulations
An Edit Friendly DDPM Noise Space: Inversion and ManipulationsComputer Vision and Pattern Recognition (CVPR), 2023
Inbar Huberman-Spiegelglas
Vladimir Kulikov
T. Michaeli
DiffM
416
239
0
12 Apr 2023
DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion
DreamPose: Fashion Image-to-Video Synthesis via Stable DiffusionIEEE International Conference on Computer Vision (ICCV), 2023
J. Karras
Aleksander Holynski
Ting-Chun Wang
Ira Kemelmacher-Shlizerman
DiffMVGen
363
205
0
12 Apr 2023
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Improving Diffusion Models for Scene Text Editing with Dual Encoders
Jiabao Ji
Guanhua Zhang
Zhaowen Wang
Bairu Hou
Zhifei Zhang
Brian L. Price
Shiyu Chang
DiffM
220
45
0
12 Apr 2023
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into
  3D, alleviate Janus problem and Beyond
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond
Mohammadreza Armandpour
A. Sadeghian
Huangjie Zheng
Amir Sadeghian
Mingyuan Zhou
DiffM
406
148
0
11 Apr 2023
Leveraging Neural Representations for Audio Manipulation
Leveraging Neural Representations for Audio Manipulation
Scott H. Hawley
C. Steinmetz
111
3
0
10 Apr 2023
Towards Real-time Text-driven Image Manipulation with Unconditional
  Diffusion Models
Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models
Nikita Starodubcev
Dmitry Baranchuk
Valentin Khrulkov
Artem Babenko
DiffM
272
6
0
10 Apr 2023
InstantBooth: Personalized Text-to-Image Generation without Test-Time
  Finetuning
InstantBooth: Personalized Text-to-Image Generation without Test-Time FinetuningComputer Vision and Pattern Recognition (CVPR), 2023
Jing Shi
Wei Xiong
Zhe Lin
H. J. Jung
DiffM
367
372
0
06 Apr 2023
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Inst-Inpaint: Instructing to Remove Objects with Diffusion Models
Ahmet Burak Yildirim
Vedat Baday
Erkut Erdem
Aykut Erdem
Aysegül Dündar
DiffM
310
81
0
06 Apr 2023
Taming Encoder for Zero Fine-tuning Image Customization with
  Text-to-Image Diffusion Models
Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models
Xuhui Jia
Yang Zhao
Kelvin C. K. Chan
Yandong Li
Han-Ying Zhang
Boqing Gong
Tingbo Hou
Jian Shu
Yu-Chuan Su
DiffM
231
124
0
05 Apr 2023
AUDIT: Audio Editing by Following Instructions with Latent Diffusion
  Models
AUDIT: Audio Editing by Following Instructions with Latent Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Yuancheng Wang
Zeqian Ju
Xuejiao Tan
Lei He
Zhizheng Wu
Jiang Bian
Sheng Zhao
DiffM
333
91
0
03 Apr 2023
Subject-driven Text-to-Image Generation via Apprenticeship Learning
Subject-driven Text-to-Image Generation via Apprenticeship LearningNeural Information Processing Systems (NeurIPS), 2023
Wenhu Chen
Hexiang Hu
Yandong Li
Nataniel Rui
Xuhui Jia
Ming-Wei Chang
William W. Cohen
DiffM
922
232
0
01 Apr 2023
Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Going Beyond Nouns With Vision & Language Models Using Synthetic DataIEEE International Conference on Computer Vision (ICCV), 2023
Paola Cascante-Bonilla
Khaled Shehada
James Smith
Sivan Doveh
Donghyun Kim
...
Gül Varol
A. Oliva
Vicente Ordonez
Rogerio Feris
Leonid Karlinsky
VLMSyDa
468
48
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image EditorComputer Vision and Pattern Recognition (CVPR), 2023
Vidit Goel
E. Peruzzo
Lezhi Li
Dejia Xu
Xingqian Xu
Andrii Zadaianchuk
Trevor Darrell
Zinan Lin
Humphrey Shi
DiffM
294
17
0
30 Mar 2023
MDP: A Generalized Framework for Text-Guided Image Editing by
  Manipulating the Diffusion Path
MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path
Qian Wang
Biao Zhang
Michael Birsak
Peter Wonka
DiffM
258
24
0
29 Mar 2023
Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Instruct 3D-to-3D: Text Instruction Guided 3D-to-3D conversion
Hiromichi Kamata
Yuiko Sakuma
Akio Hayakawa
Masato Ishii
T. Narihira
DiffM
192
52
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Pierre Fernandez
Guillaume Couairon
Edouard Grave
Matthijs Douze
Teddy Furon
WIGM
335
312
0
27 Mar 2023
Training-free Content Injection using h-space in Diffusion Models
Training-free Content Injection using h-space in Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jaeseok Jeong
Mingi Kwon
Youngjung Uh
DiffM
279
41
0
27 Mar 2023
Guiding AI-Generated Digital Content with Wireless Perception
Guiding AI-Generated Digital Content with Wireless PerceptionIEEE wireless communications (IEEE Wireless Commun.), 2023
Jiacheng Wang
Hongyang Du
Dusit Niyato
Zehui Xiong
Jiawen Kang
Shiwen Mao
Xuemin
X. Shen
113
17
0
26 Mar 2023
Human Preference Score: Better Aligning Text-to-Image Models with Human
  Preference
Human Preference Score: Better Aligning Text-to-Image Models with Human PreferenceIEEE International Conference on Computer Vision (ICCV), 2023
Xiaoshi Wu
Keqiang Sun
Feng Zhu
Rui Zhao
Jiaming Song
245
266
0
25 Mar 2023
DreamBooth3D: Subject-Driven Text-to-3D Generation
DreamBooth3D: Subject-Driven Text-to-3D GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Amit Raj
S. Kaza
Ben Poole
Michael Niemeyer
Nataniel Ruiz
...
Kfir Aberman
Michael Rubinstein
Jonathan T. Barron
Yuanzhen Li
Varun Jampani
DiffM
319
268
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video
  Generators
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video GeneratorsIEEE International Conference on Computer Vision (ICCV), 2023
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zinan Lin
Shant Navasardyan
Humphrey Shi
VGen
311
739
0
23 Mar 2023
Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions
Instruct-NeRF2NeRF: Editing 3D Scenes with InstructionsIEEE International Conference on Computer Vision (ICCV), 2023
Ayaan Haque
Matthew Tancik
Alexei A. Efros
Aleksander Holynski
Angjoo Kanazawa
VGenDiffM
431
496
0
22 Mar 2023
Pix2Video: Video Editing using Image Diffusion
Pix2Video: Video Editing using Image DiffusionIEEE International Conference on Computer Vision (ICCV), 2023
Duygu Ceylan
C. Huang
Niloy J. Mitra
DiffMVGen
416
340
0
22 Mar 2023
LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
LD-ZNet: A Latent Diffusion Approach for Text-Based Image SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
K. Pnvr
Bharat Singh
P. Ghosh
Behjat Siddiquie
David Jacobs
DiffM
304
34
0
22 Mar 2023
Vox-E: Text-guided Voxel Editing of 3D Objects
Vox-E: Text-guided Voxel Editing of 3D ObjectsIEEE International Conference on Computer Vision (ICCV), 2023
Etai Sella
Gal Fiebelman
Peter Hedman
Hadar Averbuch-Elor
DiffM
351
107
0
21 Mar 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Lukas Höllein
Ang Cao
Andrew Owens
Justin Johnson
Matthias Nießner
DiffM
503
243
0
21 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
553
84
0
21 Mar 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
Zero-1-to-3: Zero-shot One Image to 3D ObjectIEEE International Conference on Computer Vision (ICCV), 2023
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
401
1,497
0
20 Mar 2023
Localizing Object-level Shape Variations with Text-to-Image Diffusion
  Models
Localizing Object-level Shape Variations with Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Or Patashnik
Daniel Garibi
Idan Azuri
Hadar Averbuch-Elor
Daniel Cohen-Or
DiffM
409
144
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
668
371
0
20 Mar 2023
DialogPaint: A Dialog-based Image Editing Model
DialogPaint: A Dialog-based Image Editing Model
Jingxuan Wei
Shiyu Wu
Xin Jiang
Yequan Wang
KELMDiffM
202
6
0
17 Mar 2023
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
387
27
0
17 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
HIVE: Harnessing Human Feedback for Instructional Visual EditingComputer Vision and Pattern Recognition (CVPR), 2023
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
331
164
0
16 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Efficient Diffusion Training via Min-SNR Weighting StrategyIEEE International Conference on Computer Vision (ICCV), 2023
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
312
224
0
16 Mar 2023
P+: Extended Textual Conditioning in Text-to-Image Generation
P+: Extended Textual Conditioning in Text-to-Image Generation
A. Voynov
Qinghao Chu
Daniel Cohen-Or
Kfir Aberman
VLMDiffM
370
245
0
16 Mar 2023
Automatic Geo-alignment of Artwork in Children's Story Books
Automatic Geo-alignment of Artwork in Children's Story Books
Jakub J Dylag
V. Suarez
James Wald
Aneesha Amodini Uvara
DiffM
153
0
0
16 Mar 2023
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a
  Single Image using Diffusion Models
Aerial Diffusion: Text Guided Ground-to-Aerial View Translation from a Single Image using Diffusion Models
D. Kothandaraman
Wanrong Zhu
Ming Lin
Dinesh Manocha
240
6
0
15 Mar 2023
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield
  Images with Class Labels
Class-Guided Image-to-Image Diffusion: Cell Painting from Brightfield Images with Class Labels
J. Cross-Zamirski
P. Anand
Guy B. Williams
E. Mouchet
Yinhai Wang
Carola-Bibiane Schönlieb
VLMDiffMMedIm
240
14
0
15 Mar 2023
Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style
  Transfer
Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style TransferIEEE International Conference on Computer Vision (ICCV), 2023
Serin Yang
Hyunmin Hwang
Jong Chul Ye
DiffM
430
85
0
15 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
336
385
0
14 Mar 2023
Accountable Textual-Visual Chat Learns to Reject Human Instructions in
  Image Re-creation
Accountable Textual-Visual Chat Learns to Reject Human Instructions in Image Re-creation
Zhiwei Zhang
Yuliang Liu
MLLM
373
0
0
10 Mar 2023
Video-P2P: Video Editing with Cross-attention Control
Video-P2P: Video Editing with Cross-attention ControlComputer Vision and Pattern Recognition (CVPR), 2023
Shaoteng Liu
Yuechen Zhang
Wenbo Li
Zhe Lin
Jiaya Jia
DiffMVGen
391
309
0
08 Mar 2023
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation
  Models
Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models
Chenfei Wu
Sheng-Kai Yin
Weizhen Qi
Xiaodong Wang
Zecheng Tang
Nan Duan
MLLMLRM
359
771
0
08 Mar 2023
ELODIN: Naming Concepts in Embedding Spaces
ELODIN: Naming Concepts in Embedding Spaces
Rodrigo Mello
Filipe Calegario
Geber Ramalho
DiffM
312
1
0
07 Mar 2023
Previous
123...333435
Next
Page 34 of 35
Pageof 35