PartImageNet++ Dataset: Enhancing Visual Models with High-Quality Part Annotations

4 January 2026

Xiao Li

Zilong Liu

Yining Liu

Zhuhong Li

Na Dong

Sitian Qin

Xiaolin Hu

VLM

ArXiv (abs)PDF HTML Github (358★)

Main:16 Pages

7 Figures

Bibliography:6 Pages

4 Tables

Appendix:5 Pages

Abstract

To address the scarcity of high-quality part annotations in existing datasets, we introduce PartImageNet++ (PIN++), a dataset that provides detailed part annotations for all categories in ImageNet-1K. With 100 annotated images per category, totaling 100K images, PIN++ represents the most comprehensive dataset covering a diverse range of object categories. Leveraging PIN++, we propose a Multi-scale Part-supervised recognition Model (MPM) for robust classification on ImageNet-1K. We first trained a part segmentation network using PIN++ and used it to generate pseudo part labels for the remaining unannotated images. MPM then integrated a conventional recognition architecture with auxiliary bypass layers, jointly supervised by both pseudo part labels and the original part annotations. Furthermore, we conducted extensive experiments on PIN++, including part segmentation, object segmentation, and few-shot learning, exploring various ways to leverage part annotations in downstream tasks. Experimental results demonstrated that our approach not only enhanced part-based models for robust object recognition but also established strong baselines for multiple downstream tasks, highlighting the potential of part annotations in improving model performance. The dataset and the code are available atthis https URL.

View on arXiv

Comments on this paper