433
v1v2v3v4 (latest)

A Few Interactions Improve Distributed Nonparametric Estimation, Optimally

IEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021
Abstract

Consider the problem of nonparametric estimation of an unknown β\beta-H\"older smooth density pXYp_{XY} at a given point, where XX and YY are both dd dimensional. An infinite sequence of i.i.d.\ samples (Xi,Yi)(X_i,Y_i) are generated according to this distribution, and two terminals observe (Xi)(X_i) and (Yi)(Y_i), respectively. They are allowed to exchange kk bits either in oneway or interactively in order for Bob to estimate the unknown density. We show that the minimax mean square risk is order (klogk)2βd+2β\left(\frac{k}{\log k} \right)^{-\frac{2\beta}{d+2\beta}} for one-way protocols and k2βd+2βk^{-\frac{2\beta}{d+2\beta}} for interactive protocols. The logarithmic improvement is nonexistent in the parametric counterparts, and therefore can be regarded as a consequence of nonparametric nature of the problem. Moreover, a few rounds of interactions achieve the interactive minimax rate: the number of rounds can grow as slowly as the super-logarithm (i.e., inverse tetration) of kk. The proof of the upper bound is based on a novel multi-round scheme for estimating the joint distribution of a pair of biased Bernoulli variables, and the lower bound is built on a sharp estimate of a symmetric strong data processing constant for biased Bernoulli variables.

View on arXiv
Comments on this paper