Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.03539
Cited By
Is CLIP the main roadblock for fine-grained open-world perception?
International Conference on Content-Based Multimedia Indexing (CBMI), 2024
4 April 2024
Lorenzo Bianchi
F. Carrara
Nicola Messina
Fabrizio Falchi
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (27★)
Papers citing
"Is CLIP the main roadblock for fine-grained open-world perception?"
6 / 6 papers shown
ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation
Sherry X. Chen
Yi Wei
Luowei Zhou
Suren Kumar
336
5
0
09 Jul 2025
MoralCLIP: Contrastive Alignment of Vision-and-Language Representations with Moral Foundations Theory
Ana Carolina Condez
Diogo Tavares
João Magalhães
VLM
324
0
0
06 Jun 2025
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
Tadeusz Dziarmaga
Marcin Kądziołka
Artur Kasymov
Marcin Mazur
EGVM
488
0
0
24 Mar 2025
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Computer Vision and Pattern Recognition (CVPR), 2024
Haicheng Wang
Chen Ju
Weixiong Lin
Shuai Xiao
Mengting Chen
...
Mingshuai Yao
Jinsong Lan
Ying Chen
Qingwen Liu
Yanfeng Wang
VLM
CLIP
410
14
0
30 Nov 2024
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Georgia Gabriela Sampaio
Ruixiang Zhang
Shuangfei Zhai
Jiatao Gu
J. Susskind
Navdeep Jaitly
Yizhe Zhang
DiffM
CLIP
327
2
0
02 Nov 2024
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
Pouyan Navard
Amin Karimi Monsefi
Mengxi Zhou
Wei-Lun Chao
Alper Yilmaz
R. Ramnath
DiffM
444
6
0
02 Oct 2024
1
Page 1 of 1