35
0

The crititcal threshold level on Kendall's tau statistic concerning minimax estimation of sparse correlation matrices

Abstract

Let X1,...,XnRpX_1,...,X_n\in\mathbb{R}^{p} be a sample from an elliptical distribution with correlation matrix ρ\rho and Kendall's tau correlation matrix τ\tau. Besides the minimax rate cn,p(logpn)1q/2c_{n,p}(\frac{\log p}{n})^{1-q/2} of estimation for ρ\rho under the Frobenius norm over large classes of correlation matrices with sparse rows, where the parameters cn,pc_{n,p} and qq depend on the class of sparse correlation matrices, we establish a critical threshold level regarding the minimax rate for a natural threshold estimator based on Kendall's tau sample correlation matrix τ^\hat\tau. More precisely we identify a constant α>0\alpha^\ast>0 such that the proposed estimator attains the minimax rate for any entrywise threshold level α(logpn)1/2\alpha(\frac{\log p}{n})^{1/2} with α>α\alpha>\alpha^\ast. In general this is not anymore true for α<α\alpha<\alpha^\ast. This critical value α\alpha^\ast is given by 2π3\frac{\sqrt{2}\pi}{3} and therefore by choosing α\alpha slightly larger than α\alpha^\ast the corresponding estimator does not only achieve the minimax rate but provides a non-trivial estimate of the true correlation matrix even for moderate sample sizes nn. The main ingredient to provide the critical threshold level is a sharp large deviation result for Kendall's tau sample correlation if the underlying 22-dimensional elliptical distribution implies weak correlation between the components. This result is evolved from an asymptotic expansion of the number of permutations with a certain number of inversions. To the best of the authors knowledge this is the first work concerning critical threshold constants.

View on arXiv
Comments on this paper