86

Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection

International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Main:8 Pages
2 Figures
Bibliography:2 Pages
2 Tables
Abstract

In trustworthy medical diagnosis systems, integrating out-of-distribution (OOD) detection aims to identify unknown diseases in samples, thereby mitigating the risk of misdiagnosis. In this study, we propose a novel OOD detection framework based on vision-language models (VLMs), which integrates hierarchical visual information to cope with challenging unknown diseases that resemble known diseases. Specifically, a cross-scale visual fusion strategy is proposed to couple visual embeddings from multiple scales. This enriches the detailed representation of medical images and thus improves the discrimination of unknown diseases. Moreover, a cross-scale hard pseudo-OOD sample generation strategy is proposed to benefit OOD detection maximally. Experimental evaluations on three public medical datasets support that the proposed framework achieves superior OOD detection performance compared to existing methods. The source code is available atthis https URL.

View on arXiv
Comments on this paper