Assessing Color Vision Test in Large Vision-language Models

15 July 2025

Hongfei Ye

Bin Chen

Wenxi Liu

Yu Zhang

Zhao Li

Dandan Ni

Hongyang Chen

VLM

ArXiv (abs)PDF HTML Github

Main:7 Pages

9 Figures

Bibliography:1 Pages

5 Tables

Abstract

With the widespread adoption of large vision-language models, the capacity for color vision in these models is crucial. However, the color vision abilities of large visual-language models have not yet been thoroughly explored. To address this gap, we define a color vision testing task for large vision-language models and construct a dataset \footnote{Anonymous Github Showing some of the datathis https URL} that covers multiple categories of test questions and tasks of varying difficulty levels. Furthermore, we analyze the types of errors made by large vision-language models and propose fine-tuning strategies to enhance their performance in color vision tests.

View on arXiv

Comments on this paper