182
v1v2v3v4v5 (latest)

Statistical Significance of Feature Importance Rankings

Conference on Uncertainty in Artificial Intelligence (UAI), 2024
Main:8 Pages
8 Figures
Bibliography:4 Pages
4 Tables
Appendix:9 Pages
Abstract

Feature importance scores are ubiquitous tools for understanding the predictions of machine learning models. However, many popular attribution methods suffer from high instability due to random sampling. Leveraging novel ideas from hypothesis testing, we devise techniques that ensure the most important features are correct with high-probability guarantees. These assess the set of KK top-ranked features, as well as the order of its elements. Given a set of local or global importance scores, we demonstrate how to retrospectively verify the stability of the highest ranks. We then introduce two efficient sampling algorithms that identify the KK most important features, perhaps in order, with probability exceeding 1α1-\alpha. The theoretical justification for these procedures is validated empirically on SHAP and LIME.

View on arXiv
Comments on this paper