11
2

Uncertainty quantification in the stochastic block model with an unknown number of classes

Abstract

We study the frequentist properties of Bayesian statistical inference for the stochastic block model, with an unknown number of classes of varying sizes. We equip the space of vertex labellings with a prior on the number of classes and, conditionally, a prior on the labels. The number of classes may grow to infinity as a function of the number of vertices, depending on the sparsity of the graph. We derive non-asymptotic posterior contraction rates of the form Pθ0,nΠn(BnXn)ϵnP_{\theta_{0,n}}\Pi_n(B_n\mid X^n)\le \epsilon_n, where XnX^n is the observed graph, generated according to Pθ0,nP_{\theta_{0,n}}, BnB_n is either {θ0,n}\{\theta_{0, n}\} or, in the very sparse case, a ball around θ0,n\theta_{0,n} of known extent, and ϵn\epsilon_n is an explicit rate of convergence. These results enable conversion of credible sets to confidence sets. In the sparse case, credible tests are shown to be confidence sets. In the very sparse case, credible sets are enlarged to form confidence sets. Confidence levels are explicit, for each nn, as a function of the credible level and the rate of convergence. Hypothesis testing between the number of classes is considered with the help of posterior odds, and is shown to be consistent. Explicit upper bounds on errors of the first and second type and an explicit lower bound on the power of the tests are given.

View on arXiv
Comments on this paper