ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.02756
9
28

On the computational and statistical complexity of over-parameterized matrix sensing

27 January 2021
Jiacheng Zhuo
Jeongyeol Kwon
Nhat Ho
C. Caramanis
ArXivPDFHTML
Abstract

We consider solving the low rank matrix sensing problem with Factorized Gradient Descend (FGD) method when the true rank is unknown and over-specified, which we refer to as over-parameterized matrix sensing. If the ground truth signal X∗∈Rd∗d\mathbf{X}^* \in \mathbb{R}^{d*d}X∗∈Rd∗d is of rank rrr, but we try to recover it using FF⊤\mathbf{F} \mathbf{F}^\topFF⊤ where F∈Rd∗k\mathbf{F} \in \mathbb{R}^{d*k}F∈Rd∗k and k>rk>rk>r, the existing statistical analysis falls short, due to a flat local curvature of the loss function around the global maxima. By decomposing the factorized matrix F\mathbf{F}F into separate column spaces to capture the effect of extra ranks, we show that ∥FtFt−X∗∥F2\|\mathbf{F}_t \mathbf{F}_t - \mathbf{X}^*\|_{F}^2∥Ft​Ft​−X∗∥F2​ converges to a statistical error of O~(kdσ2/n)\tilde{\mathcal{O}} ({k d \sigma^2/n})O~(kdσ2/n) after O~(σrσnd)\tilde{\mathcal{O}}(\frac{\sigma_{r}}{\sigma}\sqrt{\frac{n}{d}})O~(σσr​​dn​​) number of iterations where Ft\mathbf{F}_tFt​ is the output of FGD after ttt iterations, σ2\sigma^2σ2 is the variance of the observation noise, σr\sigma_{r}σr​ is the rrr-th largest eigenvalue of X∗\mathbf{X}^*X∗, and nnn is the number of sample. Our results, therefore, offer a comprehensive picture of the statistical and computational complexity of FGD for the over-parameterized matrix sensing problem.

View on arXiv
Comments on this paper