56
19

Optimal estimation of variance in nonparametric regression with random design

Abstract

Consider the heteroscedastic nonparametric regression model with random design \begin{align*} Y_i = f(X_i) + V^{1/2}(X_i)\varepsilon_i, \quad i=1,2,\ldots,n, \end{align*} with f()f(\cdot) and V()V(\cdot) α\alpha- and β\beta-H\"older smooth, respectively. We show that the minimax rate of estimating V()V(\cdot) under both local and global squared risks is of the order \begin{align*} \max\Big\{n^{-\frac{8\alpha\beta}{4\alpha\beta + 2\alpha + \beta}}, n^{-\frac{2\beta}{2\beta+1}}\Big\}. \end{align*} This result extends the fixed design rate max{n4α,n2β/(2β+1)}\max\{n^{-4\alpha}, n^{-2\beta/(2\beta+1)}\} derived in Wang et al. [2008] in a non-trivial manner, as indicated by the entanglement of α\alpha and β\beta. In the special case of constant variance, we show that the minimax rate is n8α/(4α+1)n1n^{-8\alpha/(4\alpha+1)}\vee n^{-1} for variance estimation, which further implies the same rate for quadratic functional estimation and thus unifies the minimax rate under the nonparametric regression model with those under the density model and the white noise model. To achieve the minimax rate, we develop a U-statistic-based local polynomial estimator and a lower bound that is constructed over a specified distribution family of randomness designed for both εi\varepsilon_i and XiX_i.

View on arXiv
Comments on this paper