8
0

Spectral Statistics of the Sample Covariance Matrix for High Dimensional Linear Gaussians

Abstract

Performance of ordinary least squares(OLS) method for the \emph{estimation of high dimensional stable state transition matrix} AA(i.e., spectral radius ρ(A)<1\rho(A)<1) from a single noisy observed trajectory of the linear time invariant(LTI)\footnote{Linear Gaussian (LG) in Markov chain literature} system X:(x0,x1,,xN1)X_{-}:(x_0,x_1, \ldots,x_{N-1}) satisfying \begin{equation} x_{t+1}=Ax_{t}+w_{t}, \hspace{10pt} \text{ where } w_{t} \thicksim N(0,I_{n}), \end{equation} heavily rely on negative moments of the sample covariance matrix: (XX)=i=0N1xixi(X_{-}X_{-}^{*})=\sum_{i=0}^{N-1}x_{i}x_{i}^{*} and singular values of EXEX_{-}^{*}, where EE is a rectangular Gaussian ensemble E=[w0,,wN1]E=[w_0, \ldots, w_{N-1}]. Negative moments requires sharp estimates on all the eigenvalues λ1(XX)λn(XX)0\lambda_{1}\big(X_{-}X_{-}^{*}\big) \geq \ldots \geq \lambda_{n}\big(X_{-}X_{-}^{*}\big) \geq 0. Leveraging upon recent results on spectral theorem for non-Hermitian operators in \cite{naeem2023spectral}, along with concentration of measure phenomenon and perturbation theory(Gershgorins' and Cauchys' interlacing theorem) we show that only when A=AA=A^{*}, typical order of λj(XX)[NnN,N+nN]\lambda_{j}\big(X_{-}X_{-}^{*}\big) \in \big[N-n\sqrt{N}, N+n\sqrt{N}\big] for all j[n]j \in [n]. However, in \emph{high dimensions} when AA has only one distinct eigenvalue λ\lambda with geometric multiplicity of one, then as soon as eigenvalue leaves \emph{complex half unit disc}, largest eigenvalue suffers from curse of dimensionality: λ1(XX)=Ω(Nneαλn)\lambda_{1}\big(X_{-}X_{-}^{*}\big)=\Omega\big( \lfloor\frac{N}{n}\rfloor e^{\alpha_{\lambda}n} \big), while smallest eigenvalue λn(XX)(0,N+N]\lambda_{n}\big(X_{-}X_{-}^{*}\big) \in (0, N+\sqrt{N}]. Consequently, OLS estimator incurs a \emph{phase transition} and becomes \emph{transient: increasing iteration only worsens estimation error}, all of this happening when the dynamics are generated from stable systems.

View on arXiv
Comments on this paper