52
4

Multi-class Graph Clustering via Approximated Effective pp-Resistance

Abstract

This paper develops an approximation to the (effective) pp-resistance and applies it to multi-class clustering. Spectral methods based on the graph Laplacian and its generalization to the graph pp-Laplacian have been a backbone of non-euclidean clustering techniques. The advantage of the pp-Laplacian is that the parameter pp induces a controllable bias on cluster structure. The drawback of pp-Laplacian eigenvector based methods is that the third and higher eigenvectors are difficult to compute. Thus, instead, we are motivated to use the pp-resistance induced by the pp-Laplacian for clustering. For pp-resistance, small pp biases towards clusters with high internal connectivity while large pp biases towards clusters of small ``extent,'' that is a preference for smaller shortest-path distances between vertices in the cluster. However, the pp-resistance is expensive to compute. We overcome this by developing an approximation to the pp-resistance. We prove upper and lower bounds on this approximation and observe that it is exact when the graph is a tree. We also provide theoretical justification for the use of pp-resistance for clustering. Finally, we provide experiments comparing our approximated pp-resistance clustering to other pp-Laplacian based methods.

View on arXiv
Comments on this paper