148
v1v2v3 (latest)

Predicting with Distributions

Annual Conference Computational Learning Theory (COLT), 2016
Abstract

We consider a new learning model in which a joint distribution over vector pairs (x,y)(x,y) is determined by an unknown function c(x)c(x) that maps input vectors xx not to individual outputs, but to entire {\em distributions\/} over output vectors yy. Our main results take the form of rather general reductions from our model to algorithms for PAC learning the function class and the distribution class separately, and show that virtually every such combination yields an efficient algorithm in our model. Our methods include a randomized reduction to classification noise and an application of Le Cam's method to obtain robust learning algorithms.

View on arXiv
Comments on this paper