31
16

Consistency of modularity clustering on random geometric graphs

Abstract

We consider a large class of random geometric graphs constructed from samples Xn={X1,X2,,Xn}\mathcal{X}_n = \{X_1,X_2,\ldots,X_n\} of independent, identically distributed observations of an underlying probability measure ν\nu on a bounded domain DRdD\subset \mathbb{R}^d. The popular `modularity' clustering method specifies a partition Un\mathcal{U}_n of the set Xn\mathcal{X}_n as the solution of an optimization problem. In this paper, under conditions on ν\nu and DD, we derive scaling limits of the modularity clustering on random geometric graphs. Among other results, we show a geometric form of consistency: When the number of clusters is a priori bounded above, the discrete optimal partitions Un\mathcal{U}_n converge in a certain sense to a continuum partition U\mathcal{U} of the underlying domain DD, characterized as the solution of a type of Kelvin's shape optimization problem.

View on arXiv
Comments on this paper