Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.12716
Cited By
The CLIP Model is Secretly an Image-to-Prompt Converter
22 May 2023
Yuxuan Ding
Chunna Tian
Haoxuan Ding
Lingqiao Liu
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The CLIP Model is Secretly an Image-to-Prompt Converter"
8 / 8 papers shown
Title
Replay-Based Continual Learning with Dual-Layered Distillation and a Streamlined U-Net for Efficient Text-to-Image Generation
Md. Naimur Asif Borno
Md Sakib Hossain Shovon
Asmaa Soliman Al-Moisheer
Mohammad Ali Moni
29
0
0
11 May 2025
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models
Michael Toker
Ido Galil
Hadas Orgad
Rinon Gal
Yoad Tewel
Gal Chechik
Yonatan Belinkov
DiffM
54
2
0
12 Jan 2025
ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts
Dmitry Petrov
Pradyumn Goyal
Divyansh Shivashok
Yuanming Tao
Melinos Averkiou
E. Kalogerakis
66
0
0
03 Dec 2024
Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP
Zeliang Zhang
Zhuo Liu
Mingqian Feng
Chenliang Xu
22
3
0
23 Sep 2024
PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation
Ginger Delmas
Philippe Weinzaepfel
Francesc Moreno-Noguer
Grégory Rogez
34
2
0
10 Sep 2024
An Analysis of Human Alignment of Latent Diffusion Models
Lorenz Linhardt
Marco Morik
Sidney Bender
Naima Elosegui Borras
DiffM
31
3
0
13 Mar 2024
Improving Image Restoration through Removing Degradations in Textual Representations
Jingbo Lin
Zhilu Zhang
Yuxiang Wei
Dongwei Ren
Dongsheng Jiang
Wangmeng Zuo
21
25
0
28 Dec 2023
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,774
0
24 Feb 2021
1