41
81

Ensemble estimators for multivariate entropy estimation

Abstract

The problem of estimation of density functionals like entropy and mutual information has received much attention in the statistics and information theory communities. A large class of estimators of functionals of the probability density suffer from the curse of dimensionality, wherein the mean squared error (MSE) decays increasingly slowly as a function of the sample size TT as the dimension dd of the samples increases. In particular, the rate is often glacially slow of order O(Tγ/d)O(T^{-{\gamma}/{d}}), where γ>0\gamma>0 is a rate parameter. Examples of such estimators include kernel density estimators, kk-nearest neighbor (kk-NN) density estimators, kk-NN entropy estimators, intrinsic dimension estimators and other examples. In this paper, we propose a weighted affine combination of an ensemble of such estimators, where optimal weights can be chosen such that the weighted estimator converges at a much faster dimension invariant rate of O(T1)O(T^{-1}). Furthermore, we show that these optimal weights can be determined by solving a convex optimization problem which can be performed offline and does not require training data. We illustrate the superior performance of our weighted estimator for two important applications: (i) estimating the Panter-Dite distortion-rate factor and (ii) estimating the Shannon entropy for testing the probability distribution of a random sample.

View on arXiv
Comments on this paper