Proper PAC learning is compressing

Abstract
We prove that proper PAC learnability implies compression. Namely, if a concept is properly PAC learnable with samples, then has a sample compression scheme of size . In particular, every boolean concept class with constant VC dimension has a sample compression scheme of constant size. This answers a question of Littlestone and Warmuth (1986). The proof uses an approximate minimax phenomenon for boolean matrices of low VC dimension.
View on arXivComments on this paper