Information-Computation Tradeoffs for Noiseless Linear Regression with Oblivious Contamination
- FedML
We study the task of noiseless linear regression under Gaussian covariates in the presence of additive oblivious contamination. Specifically, we are given i.i.d.\ samples from a distribution on with and , where is drawn independently of from an unknown distribution . Moreover, satisfies . The goal is to accurately recover the regressor to small -error. Ignoring computational considerations, this problem is known to be solvable using samples. On the other hand, the best known polynomial-time algorithms require samples. Here we provide formal evidence that the quadratic dependence in is inherent for efficient algorithms. Specifically, we show that any efficient Statistical Query algorithm for this task requires VSTAT complexity at least .
View on arXiv