ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.03539
  4. Cited By
Is CLIP the main roadblock for fine-grained open-world perception?

Is CLIP the main roadblock for fine-grained open-world perception?

International Conference on Content-Based Multimedia Indexing (CBMI), 2024
4 April 2024
Lorenzo Bianchi
F. Carrara
Nicola Messina
Fabrizio Falchi
    VLM
ArXiv (abs)PDFHTMLGithub (27★)

Papers citing "Is CLIP the main roadblock for fine-grained open-world perception?"

6 / 6 papers shown
ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation
ADIEE: Automatic Dataset Creation and Scorer for Instruction-Guided Image Editing Evaluation
Sherry X. Chen
Yi Wei
Luowei Zhou
Suren Kumar
336
5
0
09 Jul 2025
MoralCLIP: Contrastive Alignment of Vision-and-Language Representations with Moral Foundations Theory
MoralCLIP: Contrastive Alignment of Vision-and-Language Representations with Moral Foundations Theory
Ana Carolina Condez
Diogo Tavares
João Magalhães
VLM
324
0
0
06 Jun 2025
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
Tadeusz Dziarmaga
Marcin Kądziołka
Artur Kasymov
Marcin Mazur
EGVM
488
0
0
24 Mar 2025
Advancing Myopia To Holism: Fully Contrastive Language-Image
  Pre-training
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2024
Haicheng Wang
Chen Ju
Weixiong Lin
Shuai Xiao
Mengting Chen
...
Mingshuai Yao
Jinsong Lan
Ying Chen
Qingwen Liu
Yanfeng Wang
VLMCLIP
410
14
0
30 Nov 2024
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Georgia Gabriela Sampaio
Ruixiang Zhang
Shuangfei Zhai
Jiatao Gu
J. Susskind
Navdeep Jaitly
Yizhe Zhang
DiffMCLIP
327
2
0
02 Nov 2024
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models
Pouyan Navard
Amin Karimi Monsefi
Mengxi Zhou
Wei-Lun Chao
Alper Yilmaz
R. Ramnath
DiffM
444
6
0
02 Oct 2024
1
Page 1 of 1