157
v1v2 (latest)

KK-Means and Gaussian Mixture Modeling with a Separation Constraint

Abstract

We consider the problem of clustering with KK-means and Gaussian mixture models with a constraint on the separation between the centers in the context of real-valued data. We first propose a dynamic programming approach to solving the KK-means problem with a separation constraint on the centers, building on (Wang and Song, 2011). In the context of fitting a Gaussian mixture model, we then propose an EM algorithm that incorporates such a constraint. A separation constraint can help regularize the output of a clustering algorithm, and we provide both simulated and real data examples to illustrate this point.

View on arXiv
Comments on this paper