ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1406.5986
108
97
v1v2 (latest)

A Statistical Perspective on Randomized Sketching for Ordinary Least-Squares

23 June 2014
Garvesh Raskutti
Michael W. Mahoney
ArXiv (abs)PDFHTML
Abstract

We consider statistical aspects of solving large-scale least-squares (LS) problems using randomized sketching algorithms. Prior work has typically adopted an \emph{algorithmic perspective}, in that it has made no statistical assumptions on the input XXX and YYY, and instead it has assumed that the data (X,Y)(X,Y)(X,Y) are fixed and worst-case. In this paper, we adopt a \emph{statistical perspective}, and we consider the mean-squared error performance of randomized sketching algorithms, when data (X,Y)(X, Y)(X,Y) are generated according to a statistical linear model Y=Xβ+ϵY = X \beta + \epsilonY=Xβ+ϵ, where ϵ\epsilonϵ is a noise process. To do this, we first develop a framework for assessing, in a unified manner, algorithmic and statistical aspects of randomized sketching methods. We then consider the statistical predicition efficiency (SPE) and the statistical residual efficiency (SRE) of the sketched LS estimator; and we use our framework to provide results for several types of random projection and random sampling sketching algorithms. Among other results, we show that the SRE can be bounded when p≲r≪np \lesssim r \ll np≲r≪n but that the SPE typically requires the sample size rrr to be substantially larger. Our theoretical results reveal that, depending on the specifics of the situation, leverage-based sampling methods can perform as well as or better than projection methods. Our empirical results reveal that when rrr is only slightly greater than ppp and much less than nnn, projection-based methods out-perform sampling-based methods, but as rrr grows, sampling methods start to out-perform projection methods.

View on arXiv
Comments on this paper