10
0

Information-theoretic Estimation of the Risk of Privacy Leaks

Main:4 Pages
4 Figures
Bibliography:1 Pages
Appendix:1 Pages
Abstract

Recent work~\cite{Liu2016} has shown that dependencies between items in a dataset can lead to privacy leaks. We extend this concept to privacy-preserving transformations, considering a broader set of dependencies captured by correlation metrics. Specifically, we measure the correlation between the original data and their noisy responses from a randomizer as an indicator of potential privacy breaches. This paper aims to leverage information-theoretic measures, such as the Maximal Information Coefficient (MIC), to estimate privacy leaks and derive novel, computationally efficient privacy leak estimators. We extend the ρ1\rho_1-to-ρ2\rho_2 formulation~\cite{Evfimievski2003} to incorporate entropy, mutual information, and the degree of anonymity for a more comprehensive measure of privacy risk. Our proposed hybrid metric can identify correlation dependencies between attributes in the dataset, serving as a proxy for privacy leak vulnerabilities. This metric provides a computationally efficient worst-case measure of privacy loss, utilizing the inherent characteristics of the data to prevent privacy breaches.

View on arXiv
@article{odoh2025_2506.12328,
  title={ Information-theoretic Estimation of the Risk of Privacy Leaks },
  author={ Kenneth Odoh },
  journal={arXiv preprint arXiv:2506.12328},
  year={ 2025 }
}
Comments on this paper