Is CLIP the main roadblock for fine-grained open-world perception?

International Conference on Content-Based Multimedia Indexing (CBMI), 2024

4 April 2024

ArXiv (abs)PDF HTML Github (27★)

Papers citing "Is CLIP the main roadblock for fine-grained open-world perception?"

6 / 6 papers shown

ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation

336

09 Jul 2025

MoralCLIP: Contrastive Alignment of Vision-and-Language Representations with Moral Foundations Theory

324

06 Jun 2025

PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models

488

24 Mar 2025

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2024

...

410

30 Nov 2024

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models

Georgia Gabriela Sampaio

327

02 Nov 2024

KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models

Wei-Lun Chao

444

02 Oct 2024