44

Uncertainty quantification for iterative algorithms in linear models with application to early stopping

Abstract

This paper investigates the iterates \hbb1,,\hbbT\hbb^1,\dots,\hbb^T obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension pp is comparable with the sample size nn, i.e., pnp \asymp n. The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thresholding (FISTA). The paper proposes novel estimators for the generalization error of the iterate \hbbt\hbb^t for any fixed iteration tt along the trajectory. These estimators are proved to be n\sqrt n-consistent under Gaussian designs. Applications to early-stopping are provided: when the generalization error of the iterates is a U-shape function of the iteration tt, the estimates allow to select from the data an iteration t^\hat t that achieves the smallest generalization error along the trajectory. Additionally, we provide a technique for developing debiasing corrections and valid confidence intervals for the components of the true coefficient vector from the iterate \hbbt\hbb^t at any finite iteration tt. Extensive simulations on synthetic data illustrate the theoretical results.

View on arXiv
Comments on this paper