499

FriendlyCore: Practical Differentially Private Aggregation

Abstract

Differentially private algorithms for common metric aggregation tasks, such as clustering or averaging, often have limited practicality due to their complexity or a large number of data points that is required for accurate results. We propose a simple and practical tool, FriendlyCore\mathsf{FriendlyCore}, that takes a set of points D{\cal D} from an unrestricted (pseudo) metric space as input. When D{\cal D} has effective diameter rr, FriendlyCore\mathsf{FriendlyCore} returns a ``stable'' subset CD{\cal C} \subseteq {\cal D} that includes all points, except possibly few outliers, and is {\em certified} to have diameter rr. FriendlyCore\mathsf{FriendlyCore} can be used to preprocess the input before privately aggregating it, potentially simplifying the aggregation or boosting its accuracy. Surprisingly, FriendlyCore\mathsf{FriendlyCore} is light-weight with no dependence on the dimension. We empirically demonstrate its advantages in boosting the accuracy of mean estimation and clustering tasks such as kk-means and kk-GMM, outperforming tailored methods.

View on arXiv
Comments on this paper