23
10

Geometrizing rates of convergence under local differential privacy constraints

Abstract

We study the problem of estimating a functional θ(P)\theta(\mathbb P) of an unknown probability distribution PP\mathbb P \in\mathcal P in which the original iid sample X1,,XnX_1,\dots, X_n is kept private even from the statistician via an α\alpha-local differential privacy constraint. Let ωTV\omega_{TV} denote the modulus of continuity of the functional θ\theta over P\mathcal P, with respect to total variation distance. For a large class of loss functions ll and a fixed privacy level α\alpha, we prove that the privatized minimax risk is equivalent to l(ωTV(n1/2))l(\omega_{TV}(n^{-1/2})) to within constants, under regularity conditions that are satisfied, in particular, if θ\theta is linear and P\mathcal P is convex. Our results complement the theory developed by Donoho and Liu (1991) with the nowadays highly relevant case of privatized data. Somewhat surprisingly, the difficulty of the estimation problem in the private case is characterized by ωTV\omega_{TV}, whereas, it is characterized by the Hellinger modulus of continuity if the original data X1,,XnX_1,\dots, X_n are available. We also find that for locally private estimation of linear functionals over a convex model a simple sample mean estimator, based on independently and binary privatized observations, always achieves the minimax rate. We further provide a general recipe for choosing the functional parameter in the optimal binary privatization mechanisms and illustrate the general theory in numerous examples. Our theory allows to quantify the price to be paid for local differential privacy in a large class of estimation problems. This price appears to be highly problem specific.

View on arXiv
Comments on this paper