Bulk Spectra of Truncated Sample Covariance Matrices
Determinantal Point Processes (DPPs), which originate from quantum and statistical physics, are known for modelling diversity. Recent research [Ghosh and Rigollet (2020)] has demonstrated that certain matrix-valued -statistics (that are truncated versions of the usual sample covariance matrix) can effectively estimate parameters in the context of Gaussian DPPs and enhance dimension reduction techniques, outperforming standard methods like PCA in clustering applications. This paper explores the spectral properties of these matrix-valued -statistics in the null setting of an isotropic design. These matrices may be represented as , where is a data matrix and is the Laplacian matrix of a random geometric graph associated to . The main mathematically interesting twist here is that the matrix is dependent on . We give complete descriptions of the bulk spectra of these matrix-valued -statistics in terms of the Stieltjes transforms of their empirical spectral measures. The results and the techniques are in fact able to address a broader class of kernelised random matrices, connecting their limiting spectra to generalised Mar\v{c}enko-Pastur laws and free probability.
View on arXiv