59
25
v1v2v3v4 (latest)

Optimality of Maximum Likelihood for Log-Concave Density Estimation and Bounded Convex Regression

Abstract

In this paper, we study two problems: (1) estimation of a dd-dimensional log-concave distribution and (2) bounded multivariate convex regression with random design with an underlying log-concave density or a compactly supported distribution with a continuous density. First, we show that for all d4d \ge 4 the maximum likelihood estimators of both problems achieve an optimal risk of Θd(n2/(d+1))\Theta_d(n^{-2/(d+1)}) (up to a logarithmic factor) in terms of squared Hellinger distance and L2L_2 squared distance, respectively. Previously, the optimality of both these estimators was known only for d3d\le 3. We also prove that the ϵ\epsilon-entropy numbers of the two aforementioned families are equal up to logarithmic factors. We complement these results by proving a sharp bound Θd(n2/(d+4))\Theta_d(n^{-2/(d+4)}) on the minimax rate (up to logarithmic factors) with respect to the total variation distance. Finally, we prove that estimating a log-concave density - even a uniform distribution on a convex set - up to a fixed accuracy requires the number of samples \emph{at least} exponential in the dimension. We do that by improving the dimensional constant in the best known lower bound for the minimax rate from 2dn2/(d+1)2^{-d}\cdot n^{-2/(d+1)} to cn2/(d+1)c\cdot n^{-2/(d+1)} (when d2d\geq 2).

View on arXiv
Comments on this paper