52
21

Statistical Query Lower Bounds for List-Decodable Linear Regression

Abstract

We study the problem of list-decodable linear regression, where an adversary can corrupt a majority of the examples. Specifically, we are given a set TT of labeled examples (x,y)Rd×R(x, y) \in \mathbb{R}^d \times \mathbb{R} and a parameter 0<α<1/20< \alpha <1/2 such that an α\alpha-fraction of the points in TT are i.i.d. samples from a linear regression model with Gaussian covariates, and the remaining (1α)(1-\alpha)-fraction of the points are drawn from an arbitrary noise distribution. The goal is to output a small list of hypothesis vectors such that at least one of them is close to the target regression vector. Our main result is a Statistical Query (SQ) lower bound of dpoly(1/α)d^{\mathrm{poly}(1/\alpha)} for this problem. Our SQ lower bound qualitatively matches the performance of previously developed algorithms, providing evidence that current upper bounds for this task are nearly best possible.

View on arXiv
Comments on this paper