564

A Minimax Approach to Supervised Learning

Neural Information Processing Systems (NeurIPS), 2016
Abstract

Given a task of predicting YY from XX, a loss function LL, and a set of probability distributions Γ\Gamma on (X,Y)(X,Y), what is the optimal decision rule minimizing the worst-case expected loss over Γ\Gamma? In this paper, we address this question by introducing a generalization of the principle of maximum entropy. Applying this principle to sets of distributions with marginal on XX constrained to be the empirical marginal from the data, we develop a general minimax approach for supervised learning problems which reduces to the maximum likelihood problem over generalized linear models. Through this framework, we develop two classification algorithms called the minimax SVM and the minimax Brier classifier. The minimax SVM, which is a relaxed version of the standard SVM, minimizes the worst-case 0-1 loss over the structured set of distribution, and by our numerical experiments can outperform the SVM. The minimax Brier classifier utilizes the Huber penalty function for a robust classification. We also explore the application of the developed framework on robust feature selection.

View on arXiv
Comments on this paper