16
4

Efficient List-Decodable Regression using Batches

Abstract

We begin the study of list-decodable linear regression using batches. In this setting only an α(0,1]\alpha \in (0,1] fraction of the batches are genuine. Each genuine batch contains n\ge n i.i.d. samples from a common unknown distribution and the remaining batches may contain arbitrary or even adversarial samples. We derive a polynomial time algorithm that for any nΩ~(1/α)n\ge \tilde \Omega(1/\alpha) returns a list of size O(1/α2)\mathcal O(1/\alpha^2) such that one of the items in the list is close to the true regression parameter. The algorithm requires only O~(d/α2)\tilde{\mathcal{O}}(d/\alpha^2) genuine batches and works under fairly general assumptions on the distribution. The results demonstrate the utility of batch structure, which allows for the first polynomial time algorithm for list-decodable regression, which may be impossible for the non-batch setting, as suggested by a recent SQ lower bound \cite{diakonikolas2021statistical} for the non-batch setting.

View on arXiv
Comments on this paper