91

Bulk Spectra of Truncated Sample Covariance Matrices

Main:23 Pages
2 Figures
Bibliography:1 Pages
Appendix:2 Pages
Abstract

Determinantal Point Processes (DPPs), which originate from quantum and statistical physics, are known for modelling diversity. Recent research [Ghosh and Rigollet (2020)] has demonstrated that certain matrix-valued UU-statistics (that are truncated versions of the usual sample covariance matrix) can effectively estimate parameters in the context of Gaussian DPPs and enhance dimension reduction techniques, outperforming standard methods like PCA in clustering applications. This paper explores the spectral properties of these matrix-valued UU-statistics in the null setting of an isotropic design. These matrices may be represented as XLXX L X^\top, where XX is a data matrix and LL is the Laplacian matrix of a random geometric graph associated to XX. The main mathematically interesting twist here is that the matrix LL is dependent on XX. We give complete descriptions of the bulk spectra of these matrix-valued UU-statistics in terms of the Stieltjes transforms of their empirical spectral measures. The results and the techniques are in fact able to address a broader class of kernelised random matrices, connecting their limiting spectra to generalised Mar\v{c}enko-Pastur laws and free probability.

View on arXiv
Comments on this paper