The CLIP Model is Secretly an Image-to-Prompt Converter

The CLIP Model is Secretly an Image-to-Prompt Converter

22 May 2023

Yuxuan Ding

Lingqiao Liu

Papers citing "The CLIP Model is Secretly an Image-to-Prompt Converter"

8 / 8 papers shown

Title
Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation Md. Naimur Asif Borno Md Sakib Hossain Shovon Asmaa Soliman Al-Moisheer Mohammad Ali Moni 29 0 0 11 May 2025
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Michael Toker Ido Galil Hadas Orgad Rinon Gal Yoad Tewel Gal Chechik Yonatan Belinkov DiffM 54 2 0 12 Jan 2025
ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts Dmitry Petrov Pradyumn Goyal Divyansh Shivashok Yuanming Tao Melinos Averkiou E. Kalogerakis 66 0 0 03 Dec 2024
Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP Zeliang Zhang Zhuo Liu Mingqian Feng Chenliang Xu 22 3 0 23 Sep 2024
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation Ginger Delmas Philippe Weinzaepfel Francesc Moreno-Noguer Grégory Rogez 34 2 0 10 Sep 2024
An Analysis of Human Alignment of Latent Diffusion Models Lorenz Linhardt Marco Morik Sidney Bender Naima Elosegui Borras DiffM 31 3 0 13 Mar 2024
Improving Image Restoration through Removing Degradations in Textual Representations Jingbo Lin Zhilu Zhang Yuxiang Wei Dongwei Ren Dongsheng Jiang Wangmeng Zuo 21 25 0 28 Dec 2023
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 253 4,774 0 24 Feb 2021