290

Proper PAC learning is compressing

Information Theory and Applications Workshop (ITA), 2015
Abstract

We prove that proper PAC learnability implies compression. Namely, if a concept CΣXC \subseteq \Sigma^X is properly PAC learnable with dd samples, then CC has a sample compression scheme of size 2O(d)2^{O(d)}. In particular, every boolean concept class with constant VC dimension has a sample compression scheme of constant size. This answers a question of Littlestone and Warmuth (1986). The proof uses an approximate minimax phenomenon for boolean matrices of low VC dimension.

View on arXiv
Comments on this paper