52

Epistemic Uncertainty Quantification for Pre-trained VLMs via Riemannian Flow Matching

Li Ju
Mayank Nautiyal
Andreas Hellander
Ekta Vats
Prashant Singh
Main:8 Pages
5 Figures
Bibliography:2 Pages
10 Tables
Appendix:7 Pages
Abstract

Vision-Language Models (VLMs) are typically deterministic in nature and lack intrinsic mechanisms to quantify epistemic uncertainty, which reflects the model's lack of knowledge or ignorance of its own representations. We theoretically motivate negative log-density of an embedding as a proxy for the epistemic uncertainty, where low-density regions signify model ignorance. The proposed method REPVLM computes the probability density on the hyperspherical manifold of the VLM embeddings using Riemannian Flow Matching. We empirically demonstrate that REPVLM achieves near-perfect correlation between uncertainty and prediction error, significantly outperforming existing baselines. Beyond classification, we also demonstrate that the model also provides a scalable metric for out-of-distribution detection and automated data curation.

View on arXiv
Comments on this paper