On the Optimal Weighted Regularization in Overparameterized Linear Regression

We consider the linear model with in the overparameterized regime . We estimate via generalized (weighted) ridge regression: , where is the weighting matrix. Under a random design setting with general data covariance and anisotropic prior on the true coefficients , we provide an exact characterization of the prediction risk in the proportional asymptotic limit . Our general setup leads to a number of interesting findings. We outline precise conditions that decide the sign of the optimal setting for the ridge parameter and confirm the implicit regularization effect of overparameterization, which theoretically justifies the surprising empirical observation that can be negative in the overparameterized regime. We also characterize the double descent phenomenon for principal component regression (PCR) when both and are anisotropic. Finally, we determine the optimal weighting matrix for both the ridgeless () and optimally regularized () case, and demonstrate the advantage of the weighted objective over standard ridge regression and PCR.
View on arXiv