399

The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

Machine-mediated learning (ML), 2020
Abstract

We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems. We solve, in minutes, sparse regression problems with p107p\sim10^7 features and decision tree induction problems with p105p\sim10^5 features. The proposed method operates in two phases; we first determine the backbone set, that consists of potentially relevant features, by solving a number of tractable subproblems; then, we solve a reduced problem, considering only the backbone features. Numerical experiments demonstrate that our method competes with optimal solutions, when exact methods apply, and substantially outperforms baseline heuristics, when exact methods do not scale, both in terms of recovering the true relevant features and in its out-of-sample predictive performance.

View on arXiv
Comments on this paper