ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.03905
21
2

Black-Box kkk-to-111-PCA Reductions: Theory and Applications

6 March 2024
A. Jambulapati
Syamantak Kumar
Jerry Li
Shourya Pandey
Ankit Pensia
Kevin Tian
ArXivPDFHTML
Abstract

The kkk-principal component analysis (kkk-PCA) problem is a fundamental algorithmic primitive that is widely-used in data analysis and dimensionality reduction applications. In statistical settings, the goal of kkk-PCA is to identify a top eigenspace of the covariance matrix of a distribution, which we only have black-box access to via samples. Motivated by these settings, we analyze black-box deflation methods as a framework for designing kkk-PCA algorithms, where we model access to the unknown target matrix via a black-box 111-PCA oracle which returns an approximate top eigenvector, under two popular notions of approximation. Despite being arguably the most natural reduction-based approach to kkk-PCA algorithm design, such black-box methods, which recursively call a 111-PCA oracle kkk times, were previously poorly-understood. Our main contribution is significantly sharper bounds on the approximation parameter degradation of deflation methods for kkk-PCA. For a quadratic form notion of approximation we term ePCA (energy PCA), we show deflation methods suffer no parameter loss. For an alternative well-studied approximation notion we term cPCA (correlation PCA), we tightly characterize the parameter regimes where deflation methods are feasible. Moreover, we show that in all feasible regimes, kkk-cPCA deflation algorithms suffer no asymptotic parameter loss for any constant kkk. We apply our framework to obtain state-of-the-art kkk-PCA algorithms robust to dataset contamination, improving prior work in sample complexity by a poly(k)\mathsf{poly}(k)poly(k) factor.

View on arXiv
Comments on this paper