What Can We Learn Privately?

6 March 2008

S. Kasiviswanathan

Abstract

Learning problems form an important category of computational tasks that generalizes many of the computations researchers apply to large real-life datasets. We ask: what concept classes can be learned privately, namely, by an algorithm whose output does not depend too heavily on any one input or specific training example? Our goal is a complexity-theoretic classification of learning problems efficiently solvable with the privacy restriction, that is, by epsilon-differentially private algorithms. We consider four private learning models: local interactive (LI), learning non-interactive (LNI), centralized interactive (CI), and centralized non-interactive (CNI). We give a characterization of these models with respect to standard learning models, PAC and SQ. We show that for learning problems LNI is a strict subset of LI, and LI is a strict subset of CNI=CI. This characterization takes into account the number of samples required for learning, but not computational efficiency. We also present a partial characterization of these models when efficiency is taken into account.

View on arXiv

Comments on this paper