36

Bootstraps Regularize Singular Correlation Matrices

Journal of Computational and Applied Mathematics (JCAM), 2020
Abstract

I show analytically that the average of kk bootstrapped correlation matrices rapidly becomes positive-definite as kk increases, which provides a simple approach to regularize singular Pearson correlation matrices. If nn is the number of objects and tt the number of features, the averaged correlation matrix is almost surely positive-definite if k>ee1nt1.58ntk> \frac{e}{e-1}\frac{n}{t}\simeq 1.58\frac{n}{t} in the limit of large tt and nn. The probability of obtaining a positive-definite correlation matrix with kk bootstraps is also derived for finite nn and tt. Finally, I demonstrate that the number of required bootstraps is always smaller than nn. This method is particularly relevant in fields where nn is orders of magnitude larger than the size of data points tt, e.g., in finance, genetics, social science, or image processing.

View on arXiv
Comments on this paper