ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.13333
  4. Cited By
Predict, Prevent, and Evaluate: Disentangled Text-Driven Image
  Manipulation Empowered by Pre-Trained Vision-Language Model

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model

26 November 2021
Zipeng Xu
Tianwei Lin
Hao Tang
Fu Li
Dongliang He
N. Sebe
Radu Timofte
Luc Van Gool
Errui Ding
    EGVM
ArXivPDFHTML

Papers citing "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

38 / 38 papers shown
Title
DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement
  Representation and Adversarial Learning
DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial Learning
Dino Ienco
C. Dantas
22
1
0
05 Aug 2024
Vision Language Modeling of Content, Distortion and Appearance for Image
  Quality Assessment
Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment
Fei Zhou
Zhicong Huang
Tianhao Gu
Guoping Qiu
CoGe
VLM
43
0
0
14 Jun 2024
Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE
  Distillation and Diffusion Probabilistic Feedback
Closed-Loop Unsupervised Representation Disentanglement with βββ-VAE Distillation and Diffusion Probabilistic Feedback
Xin Jin
Bo Li
Baao Xie
Wenyao Zhang
Jinming Liu
Ziqiang Li
Tao Yang
Wenjun Zeng
DRL
DiffM
CoGe
24
7
0
04 Feb 2024
Graph Transformer GANs with Graph Masked Modeling for Architectural
  Layout Generation
Graph Transformer GANs with Graph Masked Modeling for Architectural Layout Generation
Hao Tang
Ling Shao
N. Sebe
Luc Van Gool
13
5
0
15 Jan 2024
Focus on Your Instruction: Fine-grained and Multi-instruction Image
  Editing by Attention Modulation
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Qin Guo
Tianwei Lin
DiffM
13
28
0
15 Dec 2023
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided
  Image Editing
DeltaSpace: A Semantic-aligned Feature Space for Flexible Text-guided Image Editing
Yueming Lyu
Kang Zhao
Bo Peng
Yue Jiang
Yingya Zhang
Jing Dong
17
2
0
12 Oct 2023
Subjective Face Transform using Human First Impressions
Subjective Face Transform using Human First Impressions
Chaitanya Roygaga
Joshua Krinsky
Kai Zhang
Kenny Kwok
Aparna Bharati
CVBM
34
0
0
27 Sep 2023
Interactive Neural Painting
Interactive Neural Painting
E. Peruzzo
Willi Menapace
Vidit Goel
F. Arrigoni
H. Tang
...
Nikita Orlov
Yuxiao Hu
Humphrey Shi
N. Sebe
Elisa Ricci
11
2
0
31 Jul 2023
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic
  Image Synthesis
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis
H. Tang
Guolei Sun
N. Sebe
Luc Van Gool
GAN
16
8
0
22 Jul 2023
FashionTex: Controllable Virtual Try-on with Text and Texture
FashionTex: Controllable Virtual Try-on with Text and Texture
Anran Lin
Nanxuan Zhao
Shuliang Ning
Yuda Qiu
Baoyuan Wang
Xiaoguang Han
DiffM
17
12
0
08 May 2023
Text-guided Eyeglasses Manipulation with Spatial Constraints
Text-guided Eyeglasses Manipulation with Spatial Constraints
Jiacheng Wang
Ping Liu
Jingen Liu
Wei-ping Xu
DiffM
16
6
0
25 Apr 2023
Not Only Generative Art: Stable Diffusion for Content-Style
  Disentanglement in Art Analysis
Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis
Yankun Wu
Yuta Nakashima
Noa Garcia
CoGe
DiffM
13
26
0
20 Apr 2023
Robust Text-driven Image Editing Method that Adaptively Explores
  Directions in Latent Spaces of StyleGAN and CLIP
Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP
Tsuyoshi Baba
Kosuke Nishida
Kyosuke Nishida
CLIP
30
1
0
03 Apr 2023
SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a
  Spectral Perspective
SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
Zipeng Xu
Songlong Xing
E. Sangineto
N. Sebe
CLIP
9
2
0
16 Mar 2023
DeltaEdit: Exploring Text-free Training for Text-Driven Image
  Manipulation
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
Yueming Lyu
Tianwei Lin
Fu Li
Dongliang He
Jing Dong
Tien-Ping Tan
31
38
0
11 Mar 2023
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation
Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation
Rui Zhao
Wei Li
Zhipeng Hu
Lincheng Li
Zhengxia Zou
Z. Shi
Changjie Fan
21
18
0
02 Mar 2023
Text-driven Visual Synthesis with Latent Diffusion Prior
Text-driven Visual Synthesis with Latent Diffusion Prior
Tingbo Liao
Songwei Ge
Yiran Xu
Yao-Chih Lee
Badour Albahar
Jia-Bin Huang
DiffM
15
6
0
16 Feb 2023
Eliminating Contextual Prior Bias for Semantic Image Editing via
  Dual-Cycle Diffusion
Eliminating Contextual Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Zuopeng Yang
Tianshu Chu
Xin Lin
Erdun Gao
Daqing Liu
J. Yang
Chaoyue Wang
DiffM
13
16
0
05 Feb 2023
Shape-aware Text-driven Layered Video Editing
Shape-aware Text-driven Layered Video Editing
Yao-Chih Lee
Ji-Ze Jang
Yi-Ting Chen
Elizabeth Qiu
Jia-Bin Huang
VGen
DiffM
23
51
0
30 Jan 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
58
99
0
30 Jan 2023
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Zixian Ma
Jerry Hong
Mustafa Omer Gul
Mona Gandhi
Irena Gao
Ranjay Krishna
CoGe
18
124
0
13 Dec 2022
Disentangled Representation Learning
Disentangled Representation Learning
Xin Eric Wang
Hong Chen
Siao Tang
Zihao Wu
Wenwu Zhu
DRL
13
77
0
21 Nov 2022
Bipartite Graph Reasoning GANs for Person Pose and Facial Image
  Synthesis
Bipartite Graph Reasoning GANs for Person Pose and Facial Image Synthesis
Hao Tang
Ling Shao
Philip H. S. Torr
N. Sebe
13
12
0
12 Nov 2022
Diffusion Models already have a Semantic Latent Space
Diffusion Models already have a Semantic Latent Space
Mingi Kwon
Jaeseok Jeong
Youngjung Uh
10
172
0
20 Oct 2022
CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features
  for a Disentangled, Interpretable, and Controllable Text-Guided Face
  Manipulation
CLIP-PAE: Projection-Augmentation Embedding to Extract Relevant Features for a Disentangled, Interpretable, and Controllable Text-Guided Face Manipulation
Chenliang Zhou
Fangcheng Zhong
Cengiz Öztireli
CLIP
40
19
0
08 Oct 2022
clip2latent: Text driven sampling of a pre-trained StyleGAN using
  denoising diffusion and CLIP
clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP
Justin N. M. Pinkney
Chuan Li
CLIP
VLM
34
19
0
05 Oct 2022
Facial Expression Translation using Landmark Guided GANs
Facial Expression Translation using Landmark Guided GANs
Hao Tang
N. Sebe
CVBM
8
3
0
05 Sep 2022
Exploring CLIP for Assessing the Look and Feel of Images
Exploring CLIP for Assessing the Look and Feel of Images
Jianyi Wang
Kelvin C. K. Chan
Chen Change Loy
VLM
6
512
0
25 Jul 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
12
47
0
15 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Fei Wu
Longhui Wei
Qi Tian
DiffM
9
12
0
02 Jun 2022
End-to-End Visual Editing with a Generatively Pre-Trained Artist
End-to-End Visual Editing with a Generatively Pre-Trained Artist
A. Brown
Cheng-Yang Fu
Omkar M. Parkhi
Tamara L. Berg
Andrea Vedaldi
DiffM
9
8
0
03 May 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Designing an Encoder for StyleGAN Image Manipulation
Designing an Encoder for StyleGAN Image Manipulation
Omer Tov
Yuval Alaluf
Yotam Nitzan
Or Patashnik
Daniel Cohen-Or
188
651
0
04 Feb 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
252
157
0
02 Jan 2021
Lightweight Generative Adversarial Networks for Text-Guided Image
  Manipulation
Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation
Bowen Li
Xiaojuan Qi
Philip H. S. Torr
Thomas Lukasiewicz
GAN
102
68
0
23 Oct 2020
Semi-Supervised StyleGAN for Disentanglement Learning
Semi-Supervised StyleGAN for Disentanglement Learning
Weili Nie
Tero Karras
Animesh Garg
Shoubhik Debhath
Anjul Patney
Ankit B. Patel
Anima Anandkumar
DRL
81
72
0
06 Mar 2020
Multi-Channel Attention Selection GANs for Guided Image-to-Image
  Translation
Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation
Hao Tang
Philip H. S. Torr
N. Sebe
17
31
0
03 Feb 2020
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
262
10,183
0
12 Dec 2018
1