96

Eigenvector overlaps in large sample covariance matrices and nonlinear shrinkage estimators

Abstract

Consider a data matrix Y=[y1,,yN]Y = [\mathbf{y}_1, \cdots, \mathbf{y}_N] of size M×NM \times N, where the columns are independent observations from a random vector y\mathbf{y} with zero mean and population covariance Σ\Sigma. Let ui\mathbf{u}_i and vj\mathbf{v}_j denote the left and right singular vectors of YY, respectively. This study investigates the eigenvector/singular vector overlaps ui,D1uj\langle {\mathbf{u}_i, D_1 \mathbf{u}_j} \rangle, vi,D2vj\langle {\mathbf{v}_i, D_2 \mathbf{v}_j} \rangle and ui,D3vj\langle {\mathbf{u}_i, D_3 \mathbf{v}_j} \rangle, where DkD_k are general deterministic matrices with bounded operator norms. We establish the convergence in probability of these eigenvector overlaps toward their deterministic counterparts with explicit convergence rates, when the dimension MM scales proportionally with the sample size NN. Building on these findings, we offer a more precise characterization of the loss for Ledoit and Wolf's nonlinear shrinkage estimators of the population covariance Σ\Sigma.

View on arXiv
Comments on this paper