Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.13333
Cited By
Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
26 November 2021
Zipeng Xu
Tianwei Lin
Hao Tang
Fu Li
Dongliang He
N. Sebe
Radu Timofte
Luc Van Gool
Errui Ding
EGVM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"
38 / 38 papers shown
Title
DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial Learning
Dino Ienco
C. Dantas
22
1
0
05 Aug 2024
Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment
Fei Zhou
Zhicong Huang
Tianhao Gu
Guoping Qiu
CoGe
VLM
43
0
0
14 Jun 2024
Closed-Loop Unsupervised Representation Disentanglement with
β
β
β
-VAE Distillation and Diffusion Probabilistic Feedback
Xin Jin
Bo Li
Baao Xie
Wenyao Zhang
Jinming Liu
Ziqiang Li
Tao Yang
Wenjun Zeng
DRL
DiffM
CoGe
24
7
0
04 Feb 2024
Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
13
5
0
15 Jan 2024
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
13
28
0
15 Dec 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
17
2
0
12 Oct 2023
Subjective Face Transform using Human First Impressions
Chaitanya Roygaga
Joshua Krinsky
Kai Zhang
Kenny Kwok
Aparna Bharati
CVBM
34
0
0
27 Sep 2023
Interactive Neural Painting
E. Peruzzo
Willi Menapace
Vidit Goel
F. Arrigoni
H. Tang
...
Nikita Orlov
Yuxiao Hu
Humphrey Shi
N. Sebe
Elisa Ricci
11
2
0
31 Jul 2023
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis
H. Tang
Guolei Sun
N. Sebe
Luc Van Gool
GAN
16
8
0
22 Jul 2023
FashionTex: Controllable Virtual Try-on with Text and Texture
Anran Lin
Nanxuan Zhao
Shuliang Ning
Yuda Qiu
Baoyuan Wang
Xiaoguang Han
DiffM
17
12
0
08 May 2023
Text-guided Eyeglasses Manipulation with Spatial Constraints
Jiacheng Wang
Ping Liu
Jingen Liu
Wei-ping Xu
DiffM
16
6
0
25 Apr 2023
Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis
Yankun Wu
Yuta Nakashima
Noa Garcia
CoGe
DiffM
13
26
0
20 Apr 2023
Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP
Tsuyoshi Baba
Kosuke Nishida
Kyosuke Nishida
CLIP
30
1
0
03 Apr 2023
SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
Zipeng Xu
Songlong Xing
E. Sangineto
N. Sebe
CLIP
9
2
0
16 Mar 2023
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
Yueming Lyu
Tianwei Lin
Fu Li
Dongliang He
Jing Dong
Tien-Ping Tan
31
38
0
11 Mar 2023
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation
Rui Zhao
Wei Li
Zhipeng Hu
Lincheng Li
Zhengxia Zou
Z. Shi
Changjie Fan
21
18
0
02 Mar 2023
Text-driven Visual Synthesis with Latent Diffusion Prior
Tingbo Liao
Songwei Ge
Yiran Xu
Yao-Chih Lee
Badour Albahar
Jia-Bin Huang
DiffM
15
6
0
16 Feb 2023
Eliminating Contextual Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Zuopeng Yang
Tianshu Chu
Xin Lin
Erdun Gao
Daqing Liu
J. Yang
Chaoyue Wang
DiffM
13
16
0
05 Feb 2023
Shape-aware Text-driven Layered Video Editing
Yao-Chih Lee
Ji-Ze Jang
Yi-Ting Chen
Elizabeth Qiu
Jia-Bin Huang
VGen
DiffM
23
51
0
30 Jan 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
58
99
0
30 Jan 2023
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Zixian Ma
Jerry Hong
Mustafa Omer Gul
Mona Gandhi
Irena Gao
Ranjay Krishna
CoGe
18
124
0
13 Dec 2022
Disentangled Representation Learning
Xin Eric Wang
Hong Chen
Siao Tang
Zihao Wu
Wenwu Zhu
DRL
13
77
0
21 Nov 2022
Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
Hao Tang
Ling Shao
Philip H. S. Torr
N. Sebe
13
12
0
12 Nov 2022
Diffusion Models already have a Semantic Latent Space
Mingi Kwon
Jaeseok Jeong
Youngjung Uh
10
172
0
20 Oct 2022
CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable, and Controllable Text-Guided Face Manipulation
Chenliang Zhou
Fangcheng Zhong
Cengiz Öztireli
CLIP
40
19
0
08 Oct 2022
clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP
Justin N. M. Pinkney
Chuan Li
CLIP
VLM
34
19
0
05 Oct 2022
Facial Expression Translation using Landmark Guided GANs
Hao Tang
N. Sebe
CVBM
8
3
0
05 Sep 2022
Exploring CLIP for Assessing the Look and Feel of Images
Jianyi Wang
Kelvin C. K. Chan
Chen Change Loy
VLM
6
512
0
25 Jul 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
12
47
0
15 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Fei Wu
Longhui Wei
Qi Tian
DiffM
9
12
0
02 Jun 2022
End-to-End Visual Editing with a Generatively Pre-Trained Artist
A. Brown
Cheng-Yang Fu
Omkar M. Parkhi
Tamara L. Berg
Andrea Vedaldi
DiffM
9
8
0
03 May 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Designing an Encoder for StyleGAN Image Manipulation
Omer Tov
Yuval Alaluf
Yotam Nitzan
Or Patashnik
Daniel Cohen-Or
188
651
0
04 Feb 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
252
157
0
02 Jan 2021
Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation
Bowen Li
Xiaojuan Qi
Philip H. S. Torr
Thomas Lukasiewicz
GAN
102
68
0
23 Oct 2020
Semi-Supervised StyleGAN for Disentanglement Learning
Weili Nie
Tero Karras
Animesh Garg
Shoubhik Debhath
Anjul Patney
Ankit B. Patel
Anima Anandkumar
DRL
81
72
0
06 Mar 2020
Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation
Hao Tang
Philip H. S. Torr
N. Sebe
17
31
0
03 Feb 2020
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
262
10,183
0
12 Dec 2018
1