Richer Bayesian Last Layers with Subsampled NTK Features

1 February 2026

Sergio Calvo-Ordoñez

Jonathan Plenk

Richard Bergna

Álvaro Cartea

Yarin Gal

Jose Miguel Hernández-Lobato

Kamil Ciosek

BDL

UQCV

ArXiv (abs)PDF HTML

Main:8 Pages

2 Figures

Bibliography:3 Pages

6 Tables

Appendix:8 Pages

Abstract

Bayesian Last Layers (BLLs) provide a convenient and computationally efficient way to estimate uncertainty in neural networks. However, they underestimate epistemic uncertainty because they apply a Bayesian treatment only to the final layer, ignoring uncertainty induced by earlier layers. We propose a method that improves BLLs by leveraging a projection of Neural Tangent Kernel (NTK) features onto the space spanned by the last-layer features. This enables posterior inference that accounts for variability of the full network while retaining the low computational cost of inference of a standard BLL. We show that our method yields posterior variances that are provably greater or equal to those of a standard BLL, correcting its tendency to underestimate epistemic uncertainty. To further reduce computational cost, we introduce a uniform subsampling scheme for estimating the projection matrix and for posterior inference. We derive approximation bounds for both types of sub-sampling. Empirical evaluations on UCI regression, contextual bandits, image classification, and out-of-distribution detection tasks in image and tabular datasets, demonstrate improved calibration and uncertainty estimates compared to standard BLLs and competitive baselines, while reducing computational cost.

View on arXiv

Comments on this paper