Minimax Estimation of the Distance
Abstract
We consider the problem of estimating the distance between two discrete probability measures and from empirical data in a nonasymptotic and large alphabet setting. We construct minimax rate-optimal estimators for when is either known or unknown, and show that the performance of the optimal estimators with samples is essentially that of the Maximum Likelihood Estimators (MLE) with samples. Hence, the \emph{effective sample size enlargement} phenomenon, identified in Jiao \emph{et al.} (2015), holds for this problem as well. However, the construction of optimal estimators for requires new techniques and insights beyond the \emph{Approximation} methodology of functional estimation in Jiao \emph{et al.} (2015).
View on arXivComments on this paper
