169

Connect the dots: Dataset Condensation, Differential Privacy, and Adversarial Uncertainty

Abstract

Our work focuses on understanding the underpinning mechanism of dataset condensation by drawing connections with (ϵ\epsilon, δ\delta)-differential privacy where the optimal noise, ϵ\epsilon, is chosen by adversarial uncertainty \cite{Grining2017}. We can answer the question about the inner workings of the dataset condensation procedure. Previous work \cite{dong2022} proved the link between dataset condensation (DC) and (ϵ\epsilon, δ\delta)-differential privacy. However, it is unclear from existing works on ablating DC to obtain a lower-bound estimate of ϵ\epsilon that will suffice for creating high-fidelity synthetic data. We suggest that adversarial uncertainty is the most appropriate method to achieve an optimal noise level, ϵ\epsilon. As part of the internal dynamics of dataset condensation, we adopt a satisfactory scheme for noise estimation that guarantees high-fidelity data while providing privacy.

View on arXiv
Comments on this paper