29
0

Distribution-Aware Mean Estimation under User-level Local Differential Privacy

Abstract

We consider the problem of mean estimation under user-level local differential privacy, where nn users are contributing through their local pool of data samples. Previous work assume that the number of data samples is the same across users. In contrast, we consider a more general and realistic scenario where each user u[n]u \in [n] owns mum_u data samples drawn from some generative distribution μ\mu; mum_u being unknown to the statistician but drawn from a known distribution MM over N\mathbb{N}^\star. Based on a distribution-aware mean estimation algorithm, we establish an MM-dependent upper bounds on the worst-case risk over μ\mu for the task of mean estimation. We then derive a lower bound. The two bounds are asymptotically matching up to logarithmic factors and reduce to known bounds when mu=mm_u = m for any user uu.

View on arXiv
Comments on this paper