A Minimax Approach to Supervised Learning

Neural Information Processing Systems (NeurIPS), 2016

7 June 2016

Abstract

Given a task of predicting $Y$ from $X$ , a loss function $L$ , and a set of probability distributions $\Gamma$ on $(X,Y)$ , what is the optimal decision rule minimizing the worst-case expected loss over $\Gamma$ ? In this paper, we address this question by introducing a generalization of the principle of maximum entropy. Applying this principle to sets of distributions with marginal on $X$ constrained to be the empirical marginal from the data, we develop a general minimax approach for supervised learning problems which reduces to the maximum likelihood problem over generalized linear models. Through this framework, we develop two classification algorithms called the minimax SVM and the minimax Brier classifier. The minimax SVM, which is a relaxed version of the standard SVM, minimizes the worst-case 0-1 loss over the structured set of distribution, and by our numerical experiments can outperform the SVM. The minimax Brier classifier utilizes the Huber penalty function for a robust classification. We also explore the application of the developed framework on robust feature selection.

View on arXiv

Comments on this paper