Epistemic Uncertainty Quantification for Pre-trained VLMs via Riemannian Flow Matching

29 January 2026

Li Ju

Mayank Nautiyal

Andreas Hellander

Ekta Vats

Prashant Singh

BDL

ArXiv (abs)PDF HTML Github

Main:8 Pages

5 Figures

Bibliography:2 Pages

10 Tables

Appendix:7 Pages

Abstract

Vision-Language Models (VLMs) are typically deterministic in nature and lack intrinsic mechanisms to quantify epistemic uncertainty, which reflects the model's lack of knowledge or ignorance of its own representations. We theoretically motivate negative log-density of an embedding as a proxy for the epistemic uncertainty, where low-density regions signify model ignorance. The proposed method REPVLM computes the probability density on the hyperspherical manifold of the VLM embeddings using Riemannian Flow Matching. We empirically demonstrate that REPVLM achieves near-perfect correlation between uncertainty and prediction error, significantly outperforming existing baselines. Beyond classification, we also demonstrate that the model also provides a scalable metric for out-of-distribution detection and automated data curation.

View on arXiv

Comments on this paper