24
0

Stability Regularized Cross-Validation

Abstract

We revisit the problem of ensuring strong test-set performance via cross-validation. Motivated by the generalization theory literature, we propose a nested k-fold cross-validation scheme that selects hyperparameters by minimizing a weighted sum of the usual cross-validation metric and an empirical model-stability measure. The weight on the stability term is itself chosen via a nested cross-validation procedure. This reduces the risk of strong validation set performance and poor test set performance due to instability. We benchmark our procedure on a suite of 13 real-world UCI datasets, and find that, compared to k-fold cross-validation over the same hyperparameters, it improves the out-of-sample MSE for sparse ridge regression and CART by 4% on average, but has no impact on XGBoost. This suggests that for interpretable and unstable models, such as sparse regression and CART, our approach is a viable and computationally affordable method for improving test-set performance.

View on arXiv
@article{cory-wright2025_2505.06927,
  title={ Stability Regularized Cross-Validation },
  author={ Ryan Cory-Wright and Andrés Gómez },
  journal={arXiv preprint arXiv:2505.06927},
  year={ 2025 }
}
Comments on this paper