$\ell_1$ -regression with Heavy-tailed Distributions

2 May 2018

Abstract

In this paper, we consider the problem of linear regression with heavy-tailed distributions. Different from previous studies that use the squared loss to measure the performance, we choose the absolute loss, which is more robust in the presence of large prediction errors. To address the challenge that both the input and output could be heavy-tailed, we propose a truncated minimization problem, and demonstrate that it enjoys an $\widetilde{O}(\sqrt{d/n})$ excess risk, where $d$ is the dimensionality and $n$ is the number of samples. Compared with traditional work on $\ell_1$ -regression, the main advantage of our result is that we achieve a high-probability risk bound without exponential moment conditions on the input and output. Our theoretical guarantee is derived from a novel combination of the PAC-Bayesian analysis and the covering number.

View on arXiv

Comments on this paper

ℓ1\ell_1ℓ1​-regression with Heavy-tailed Distributions

$\ell_1$ -regression with Heavy-tailed Distributions