Uncertainty quantification for iterative algorithms in linear models
with application to early stopping
This paper investigates the iterates obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension is comparable with the sample size , i.e., . The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thresholding (FISTA). The paper proposes novel estimators for the generalization error of the iterate for any fixed iteration along the trajectory. These estimators are proved to be -consistent under Gaussian designs. Applications to early-stopping are provided: when the generalization error of the iterates is a U-shape function of the iteration , the estimates allow to select from the data an iteration that achieves the smallest generalization error along the trajectory. Additionally, we provide a technique for developing debiasing corrections and valid confidence intervals for the components of the true coefficient vector from the iterate at any finite iteration . Extensive simulations on synthetic data illustrate the theoretical results.
View on arXiv