FriendlyCore: Practical Differentially Private Aggregation
Differentially private algorithms for common metric aggregation tasks, such as clustering or averaging, often have limited practicality due to their complexity or a large number of data points that is required for accurate results. We propose a simple and practical tool, , that takes a set of points from an unrestricted (pseudo) metric space as input. When has effective diameter , returns a ``stable'' subset that includes all points, except possibly few outliers, and is {\em certified} to have diameter . can be used to preprocess the input before privately aggregating it, potentially simplifying the aggregation or boosting its accuracy. Surprisingly, is light-weight with no dependence on the dimension. We empirically demonstrate its advantages in boosting the accuracy of mean estimation and clustering tasks such as -means and -GMM, outperforming tailored methods.
View on arXiv