48
11
v1v2 (latest)

A unified matrix model including both CCA and F matrices in multivariate analysis: the largest eigenvalue and its applications

Abstract

Let \bbZM1×N=\bbT12\bbX\bbZ_{M_1\times N}=\bbT^{\frac{1}{2}}\bbX where (\bbT12)2=\bbT(\bbT^{\frac{1}{2}})^2=\bbT is a positive definite matrix and \bbX\bbX consists of independent random variables with mean zero and variance one. This paper proposes a unified matrix model \bold{\bbom}=(\bbZ\bbU_2\bbU_2^T\bbZ^T)^{-1}\bbZ\bbU_1\bbU_1^T\bbZ^T, where \bbU1\bbU_1 and \bbU2\bbU_2 are isometric with dimensions N×N1N\times N_1 and N×(NN2)N\times (N-N_2) respectively such that \bbU1T\bbU1=\bbIN1\bbU_1^T\bbU_1=\bbI_{N_1}, \bbU2T\bbU2=\bbINN2\bbU_2^T\bbU_2=\bbI_{N-N_2} and \bbU1T\bbU2=0\bbU_1^T\bbU_2=0. Moreover, \bbU1\bbU_1 and \bbU2\bbU_2 (random or non-random) are independent of \bbZM1×N\bbZ_{M_1\times N} and with probability tending to one, rank(\bbU1)=N1rank(\bbU_1)=N_1 and rank(\bbU2)=NN2rank(\bbU_2)=N-N_2. We establish the asymptotic Tracy-Widom distribution for its largest eigenvalue under moment assumptions on \bbX\bbX when N1,N2N_1,N_2 and M1M_1 are comparable. By selecting appropriate matrices \bbU1\bbU_1 and \bbU2\bbU_2, the asymptotic distributions of the maximum eigenvalues of the matrices used in Canonical Correlation Analysis (CCA) and of F matrices (including centered and non-centered versions) can be both obtained from that of \bbom\bold{\bbom}. %In particular, \bbom\bbom can also cover nonzero mean by appropriate matrices \bbU1\bbU_1 and \bbU2\bbU_2. %relax the zero mean value restriction for F matrix in \cite{WY} to allow for any nonzero mean vetors. %thus a direct application of our proposed Tracy-Widom distribution is the independence testing via CCA. Moreover, via appropriate matrices \bbU1\bbU_1 and \bbU2\bbU_2, this matrix \bbom\bold{\bbom} can be applied to some multivariate testing problems that cannot be done by the traditional CCA matrix.

View on arXiv
Comments on this paper