91
0

Provable Imbalanced Point Clustering

Abstract

We suggest efficient and provable methods to compute an approximation for imbalanced point clustering, that is, fitting kk-centers to a set of points in Rd\mathbb{R}^d, for any d,k1d,k\geq 1. To this end, we utilize \emph{coresets}, which, in the context of the paper, are essentially weighted sets of points in Rd\mathbb{R}^d that approximate the fitting loss for every model in a given set, up to a multiplicative factor of 1±ε1\pm\varepsilon. We provide [Section 3 and Section E in the appendix] experiments that show the empirical contribution of our suggested methods for real images (novel and reference), synthetic data, and real-world data. We also propose choice clustering, which by combining clustering algorithms yields better performance than each one separately.

View on arXiv
@article{denisov2025_2408.14225,
  title={ Provable Imbalanced Point Clustering },
  author={ David Denisov and Dan Feldman and Shlomi Dolev and Michael Segal },
  journal={arXiv preprint arXiv:2408.14225},
  year={ 2025 }
}
Comments on this paper