264
v1v2 (latest)

Fisher Information and Mutual Information Constraints

International Symposium on Information Theory (ISIT), 2021
Abstract

We consider the processing of statistical samples XPθX\sim P_\theta by a channel p(yx)p(y|x), and characterize how the statistical information from the samples for estimating the parameter θRd\theta\in\mathbb{R}^d can scale with the mutual information or capacity of the channel. We show that if the statistical model has a sub-Gaussian score function, then the trace of the Fisher information matrix for estimating θ\theta from YY can scale at most linearly with the mutual information between XX and YY. We apply this result to obtain minimax lower bounds in distributed statistical estimation problems, and obtain a tight preconstant for Gaussian mean estimation. We then show how our Fisher information bound can also imply mutual information or Jensen-Shannon divergence based distributed strong data processing inequalities.

View on arXiv
Comments on this paper