247

Optimal partition recovery in general graphs

International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Abstract

We consider a novel graph-structured change point problem. We observe a random vector with piecewise constant mean and whose independent, sub-Gaussian coordinates correspond to the nn nodes of a fixed graph. We are interested in recovering the partition of the nodes associated to the constancy regions of the mean vector. Although graph-valued signals of this type have been previously studied in the literature for the different tasks of testing for the presence of an anomalous cluster and of estimating the mean vector, no localisation results are known outside the classical case of chain graphs. When the partition S\mathcal{S} consists of only two elements, we characterise the difficulty of the localisation problem in terms of: the maximal noise variance σ2\sigma^2, the size Δ\Delta of the smaller element of the partition, the magnitude κ\kappa of the difference in the signal values and the sum of the effective resistance edge weights r(S)|\partial_r(\mathcal{S})| of the corresponding cut. We demonstrate an information theoretical lower bound implying that, in the low signal-to-noise ratio regime κ2Δσ2r(S)11\kappa^2 \Delta \sigma^{-2} |\partial_r(\mathcal{S})|^{-1} \lesssim 1, no consistent estimator of the true partition exists. On the other hand, when κ2Δσ2r(S)1ζnlog{r(E)}\kappa^2 \Delta \sigma^{-2} |\partial_r(\mathcal{S})|^{-1} \gtrsim \zeta_n \log\{r(|E|)\}, with r(E)r(|E|) being the sum of effective resistance weighted edges and ζn\zeta_n being any diverging sequence in nn, we show that a polynomial-time, approximate 0\ell_0-penalised least squared estimator delivers a localisation error of order $ \kappa^{-2} \sigma^2 |\partial_r(\mathcal{S})| \log\{r(|E|)\}$. Aside from the log{r(E)}\log\{r(|E|)\} term, this rate is minimax optimal. Finally, we provide upper bounds on the localisation error for more general partitions of unknown sizes.

View on arXiv
Comments on this paper