ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.09380
69
124

High-Dimensional Robust Mean Estimation in Nearly-Linear Time

23 November 2018
Yu Cheng
Ilias Diakonikolas
Rong Ge
ArXiv (abs)PDFHTML
Abstract

We study the fundamental problem of high-dimensional mean estimation in a robust model where a constant fraction of the samples are adversarially corrupted. Recent work gave the first polynomial time algorithms for this problem with dimension-independent error guarantees for several families of structured distributions. In this work, we give the first nearly-linear time algorithms for high-dimensional robust mean estimation. Specifically, we focus on distributions with (i) known covariance and sub-gaussian tails, and (ii) unknown bounded covariance. Given NNN samples on Rd\mathbb{R}^dRd, an ϵ\epsilonϵ-fraction of which may be arbitrarily corrupted, our algorithms run in time O~(Nd)/poly(ϵ)\tilde{O}(Nd) / \mathrm{poly}(\epsilon)O~(Nd)/poly(ϵ) and approximate the true mean within the information-theoretically optimal error, up to constant factors. Previous robust algorithms with comparable error guarantees have running times Ω~(Nd2)\tilde{\Omega}(N d^2)Ω~(Nd2), for ϵ=Ω(1)\epsilon = \Omega(1)ϵ=Ω(1). Our algorithms rely on a natural family of SDPs parameterized by our current guess ν\nuν for the unknown mean μ⋆\mu^\starμ⋆. We give a win-win analysis establishing the following: either a near-optimal solution to the primal SDP yields a good candidate for μ⋆\mu^\starμ⋆ -- independent of our current guess ν\nuν -- or the dual SDP yields a new guess ν′\nu'ν′ whose distance from μ⋆\mu^\starμ⋆ is smaller by a constant factor. We exploit the special structure of the corresponding SDPs to show that they are approximately solvable in nearly-linear time. Our approach is quite general, and we believe it can also be applied to obtain nearly-linear time algorithms for other high-dimensional robust learning problems.

View on arXiv
Comments on this paper