33
3

1\ell_1-regression with Heavy-tailed Distributions

Abstract

In this paper, we consider the problem of linear regression with heavy-tailed distributions. Different from previous studies that use the squared loss to measure the performance, we choose the absolute loss, which is more robust in the presence of large prediction errors. To address the challenge that both the input and output could be heavy-tailed, we propose a truncated minimization problem, and demonstrate that it enjoys an O~(d/n)\widetilde{O}(\sqrt{d/n}) excess risk, where dd is the dimensionality and nn is the number of samples. Compared with traditional work on 1\ell_1-regression, the main advantage of our result is that we achieve a high-probability risk bound without exponential moment conditions on the input and output. Our theoretical guarantee is derived from a novel combination of the PAC-Bayesian analysis and the covering number.

View on arXiv
Comments on this paper