161

Breathing kk-Means

Abstract

We propose a new algorithm for the kk-means problem which repeatedly increases and decreases the number of centroids by mm in order to find an approximate solution. New centroids are inserted in areas where they will likely reduce the error. The subsequent removal of centroids is done such that the resulting raise in error is small. After each increase or decrease step standard kk-means is performed. Termination is guaranteed by decrementing mm after each increase/decrease cycle unless the overall error was lowered. In experiments with Gaussian mixture distributions the new algorithm produced on average solutions several percent better than kk-means++.

View on arXiv
Comments on this paper