v1v2 (latest)

PAC-Bayes Analysis for Recalibration in Classification

10 June 2024

Masahiro Fujisawa

Futoshi Futami

ArXiv (abs)PDF HTML

Main:9 Pages

8 Figures

Bibliography:3 Pages

10 Tables

Appendix:26 Pages

Abstract

Nonparametric estimation using uniform-width binning is a standard approach for evaluating the calibration performance of machine learning models. However, existing theoretical analyses of the bias induced by binning are limited to binary classification, creating a significant gap with practical applications such as multiclass classification. Additionally, many parametric recalibration algorithms lack theoretical guarantees for their generalization performance. To address these issues, we conduct a generalization analysis of calibration error using the probably approximately correct Bayes framework. This approach enables us to derive the first optimizable upper bound for generalization error in the calibration context. On the basis of our theory, we propose a generalization-aware recalibration algorithm. Numerical experiments show that our algorithm enhances the performance of Gaussian process-based recalibration across various benchmark datasets and models.

View on arXiv

Comments on this paper