188

A Statistical Taylor Theorem and Extrapolation of Truncated Densities

Annual Conference Computational Learning Theory (COLT), 2021
Abstract

We show a statistical version of Taylor's theorem and apply this result to non-parametric density estimation from truncated samples, which is a classical challenge in Statistics \cite{woodroofe1985estimating, stute1993almost}. The single-dimensional version of our theorem has the following implication: "For any distribution PP on [0,1][0, 1] with a smooth log-density function, given samples from the conditional distribution of PP on [a,a+ε][0,1][a, a + \varepsilon] \subset [0, 1], we can efficiently identify an approximation to PP over the \emph{whole} interval [0,1][0, 1], with quality of approximation that improves with the smoothness of PP." To the best of knowledge, our result is the first in the area of non-parametric density estimation from truncated samples, which works under the hard truncation model, where the samples outside some survival set SS are never observed, and applies to multiple dimensions. In contrast, previous works assume single dimensional data where each sample has a different survival set SS so that samples from the whole support will ultimately be collected.

View on arXiv
Comments on this paper