Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2102.01645
Cited By
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search
2 February 2021
Federico A. Galatolo
M. G. Cimino
G. Vaglini
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"
50 / 53 papers shown
Title
Studying Classifier(-Free) Guidance From a Classifier-Centric Perspective
Xiaoming Zhao
Alexander Schwing
FaML
63
0
0
13 Mar 2025
From Creation to Curriculum: Examining the role of generative AI in Arts Universities
Atticus Sims
68
0
0
21 Dec 2024
Posterior sampling via Langevin dynamics based on generative priors
Vishal Purohit
Matthew Repasky
Jianfeng Lu
Qiang Qiu
Yao Xie
Xiuyuan Cheng
DiffM
28
1
0
02 Oct 2024
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRL
SSL
24
2
0
01 Oct 2024
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
40
7
0
02 Mar 2024
Towards Explainable, Safe Autonomous Driving with Language Embeddings for Novelty Identification and Active Learning: Framework and Experimental Analysis with Real-World Data Sets
Ross Greer
Mohan M. Trivedi
32
19
0
11 Feb 2024
Cross-Modal Coordination Across a Diverse Set of Input Modalities
Jorge Sánchez
Rodrigo Laguna
VLM
30
0
0
29 Jan 2024
CLIPDrawX: Primitive-based Explanations for Text Guided Sketch Synthesis
Nityanand Mathur
Shyam Marjit
Abhra Chaudhuri
Anjan Dutta
CLIP
15
0
0
04 Dec 2023
VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search
Shuting He
Hao Luo
Wei Jiang
Xudong Jiang
Henghui Ding
11
38
0
13 Nov 2023
Reference-based Restoration of Digitized Analog Videotapes
Lorenzo Agnolucci
L. Galteri
Marco Bertini
A. Bimbo
VGen
28
1
0
20 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
45
29
0
03 Oct 2023
Completing Visual Objects via Bridging Generation and Segmentation
Xiang Li
Yinpeng Chen
Chung-Ching Lin
Hao Chen
Kai Hu
Rita Singh
Bhiksha Raj
Lijuan Wang
Zicheng Liu
DiffM
19
4
0
01 Oct 2023
Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
Guisheng Liu
Yi Li
Zhengcong Fei
Haiyan Fu
Xiangyang Luo
Yanqing Guo
VLM
DiffM
17
7
0
10 Sep 2023
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou
Daquan Zhou
Zuo-Liang Zhu
Yaxing Wang
Qibin Hou
Jiashi Feng
27
10
0
08 Sep 2023
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
Fengxiang Bie
Yibo Yang
Zhongzhu Zhou
Adam Ghanem
Minjia Zhang
...
Pareesa Ameneh Golnari
David A. Clifton
Yuxiong He
Dacheng Tao
S. Song
EGVM
25
18
0
02 Sep 2023
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
Alberto Baldrati
Marco Bertini
Tiberio Uricchio
A. Bimbo
CLIP
CoGe
11
29
0
22 Aug 2023
ECO: Ensembling Context Optimization for Vision-Language Models
Lorenzo Agnolucci
Alberto Baldrati
Francesco Todino
Federico Becattini
Marco Bertini
A. Bimbo
VLM
15
5
0
26 Jul 2023
Composite Diffusion | whole >= Σparts
Vikram Jamwal
S. Ramaneswaran
DiffM
18
0
0
25 Jul 2023
Safeguarding Data in Multimodal AI: A Differentially Private Approach to CLIP Training
Alyssa Huang
Peihan Liu
Ryumei Nakada
Linjun Zhang
Wanrong Zhang
VLM
68
5
0
13 Jun 2023
PaintSeg: Training-free Segmentation via Painting
Xiang Li
Chung-Ching Lin
Yinpeng Chen
Zicheng Liu
Jinglu Wang
Bhiksha Raj
32
5
0
30 May 2023
GlyphDiffusion: Text Generation as Image Generation
Junyi Li
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
DiffM
23
2
0
25 Apr 2023
RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models
Seulki Park
Daeho Um
Hajung Yoon
Sanghyuk Chun
Sangdoo Yun
Jin Young Choi
25
2
0
21 Apr 2023
Text-guided Image-and-Shape Editing and Generation: A Short Survey
Cheng-Kang Ted Chao
Y. Gingold
30
3
0
18 Apr 2023
MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Yizhuo Lu
Changde Du
Dianpeng Wang
Huiguang He
DiffM
128
39
0
24 Mar 2023
IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining
Chihaya Matsuhira
Marc A. Kastner
Takahiro Komamizu
Takatsugu Hirayama
Keisuke Doman
Yasutomo Kawanishi
Ichiro Ide
32
6
0
06 Mar 2023
Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data
Ryumei Nakada
Halil Ibrahim Gulluk
Zhun Deng
Wenlong Ji
James Y. Zou
Linjun Zhang
SSL
VLM
42
34
0
13 Feb 2023
TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image models
Federico A. Galatolo
M. G. Cimino
E. Cogotti
11
4
0
15 Dec 2022
Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs
Guangrun Wang
Philip H. S. Torr
28
8
0
27 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
15
11
0
14 Nov 2022
Towards Real-Time Text2Video via CLIP-Guided, Pixel-Level Optimization
Peter Schaldenbrand
Zhixuan Liu
Jean Oh
CLIP
11
0
0
23 Oct 2022
FRIDA: A Collaborative Robot Painter with a Differentiable, Real2Sim2Real Planning Environment
Peter Schaldenbrand
James McCann
Jean Oh
18
27
0
03 Oct 2022
Are metrics measuring what they should? An evaluation of image captioning task metrics
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
19
9
0
04 Jul 2022
A Study on the Evaluation of Generative Models
Eyal Betzalel
Coby Penso
Aviv Navon
Ethan Fetaya
EGVM
25
48
0
22 Jun 2022
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs
Tal Shaharabany
Yoad Tewel
Lior Wolf
ObjD
36
15
0
19 Jun 2022
Deep Learning and Synthetic Media
Raphaël Millière
18
18
0
11 May 2022
End-to-End Visual Editing with a Generatively Pre-Trained Artist
A. Brown
Cheng-Yang Fu
Omkar M. Parkhi
Tamara L. Berg
Andrea Vedaldi
DiffM
27
8
0
03 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
67
6,627
0
13 Apr 2022
Large-scale Bilingual Language-Image Contrastive Learning
ByungSoo Ko
Geonmo Gu
VLM
19
14
0
28 Mar 2022
StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Translation
Peter Schaldenbrand
Zhixuan Liu
Jean Oh
CLIP
27
44
0
24 Feb 2022
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
64
3,466
0
20 Dec 2021
Semantic Segmentation In-the-Wild Without Seeing Any Segmentation Examples
Nir Zabari
Yedid Hoshen
VLM
25
26
0
06 Dec 2021
FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization
Xingchao Liu
Chengyue Gong
Lemeng Wu
Shujian Zhang
Haoran Su
Qiang Liu
CLIP
25
89
0
02 Dec 2021
LAFITE: Towards Language-Free Training for Text-to-Image Generation
Yufan Zhou
Ruiyi Zhang
Changyou Chen
Chunyuan Li
Chris Tensmeyer
Tong Yu
Jiuxiang Gu
Jinhui Xu
Tong Sun
VLM
21
162
0
27 Nov 2021
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP
Andreas Fürst
Elisabeth Rumetshofer
Johannes Lehner
Viet-Hung Tran
Fei Tang
...
David P. Kreil
Michael K Kopp
G. Klambauer
Angela Bitto-Nemling
Sepp Hochreiter
VLM
CLIP
199
102
0
21 Oct 2021
AffectGAN: Affect-Based Generative Art Driven by Semantics
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
GAN
25
12
0
30 Sep 2021
Modern Evolution Strategies for Creativity: Fitting Concrete Images and Abstract Concepts
Yingtao Tian
David R Ha
63
42
0
18 Sep 2021
CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders
Kevin Frans
Lisa Soros
Olaf Witkowski
CLIP
19
203
0
28 Jun 2021
Differentiable Quality Diversity
Matthew C. Fontaine
S. Nikolaidis
32
89
0
07 Jun 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
72
7,410
0
11 May 2021
StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery
Or Patashnik
Zongze Wu
Eli Shechtman
Daniel Cohen-Or
Dani Lischinski
CLIP
VLM
17
1,190
0
31 Mar 2021
1
2
Next