35
4

Differentially Private Top-k Selection via Canonical Lipschitz Mechanism

Abstract

Selecting the top-kk highest scoring items under differential privacy (DP) is a fundamental task with many applications. This work presents three new results. First, the exponential mechanism, permute-and-flip and report-noisy-max, as well as their oneshot variants, are unified into the Lipschitz mechanism, an additive noise mechanism with a single DP-proof via a mandated Lipschitz property for the noise distribution. Second, this new generalized mechanism is paired with a canonical loss function to obtain the canonical Lipschitz mechanism, which can directly select k-subsets out of dd items in O(dk+dlogd)O(dk+d \log d) time. The canonical loss function assesses subsets by how many users must change for the subset to become top-kk. Third, this composition-free approach to subset selection improves utility guarantees by an Ω(logk)\Omega(\log k) factor compared to one-by-one selection via sequential composition, and our experiments on synthetic and real-world data indicate substantial utility improvements.

View on arXiv
Comments on this paper