548

Hiding Among the Clones: A Simple and Nearly Optimal Analysis of Privacy Amplification by Shuffling

IEEE Annual Symposium on Foundations of Computer Science (FOCS), 2020
Abstract

Recent work of Erlingsson, Feldman, Mironov, Raghunathan, Talwar, and Thakurta [EFMRTT19] demonstrates that random shuffling of input data amplifies differential privacy guarantees. Such amplification leads to substantially stronger privacy guarantees for systems in which data is contributed anonymously [BEMMRLRKTS17] and for the analysis of noisy stochastic gradient descent. We show that an ε0\varepsilon_0-locally differentially private algorithm, under shuffling with nn users, amplifies to a (Θ((1eε0)eε0log(1/δ)n),δ)(\Theta((1-e^{-\varepsilon_0})\sqrt{\frac{e^{\varepsilon_0}\log(1/\delta)}{n}}), \delta)-central differential privacy guarantee. This significantly improves over previous work and achieves the asymptotically optimal dependence on ε0\varepsilon_0. Our result is based on a new approach that is simpler than previous work and extends to approximate differential privacy with nearly the same guarantees. Our work also yields an empirical method to derive tighter bounds on the central ε\varepsilon and we show that it gets to within a small constant factor of the correct bound. As a direct corollary of our analysis we derive a simple and asymptotically optimal algorithm for discrete distribution estimation in the shuffle model of privacy [CSUZZ19].

View on arXiv
Comments on this paper