ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.01926
14
0

Are Easy Data Easy (for K-Means)

2 August 2023
M. Klopotek
ArXivPDFHTML
Abstract

This paper investigates the capability of correctly recovering well-separated clusters by various brands of the kkk-means algorithm. The concept of well-separatedness used here is derived directly from the common definition of clusters, which imposes an interplay between the requirements of within-cluster-homogenicity and between-clusters-diversity. Conditions are derived for a special case of well-separated clusters such that the global minimum of kkk-means cost function coincides with the well-separatedness. An experimental investigation is performed to find out whether or no various brands of kkk-means are actually capable of discovering well separated clusters. It turns out that they are not. A new algorithm is proposed that is a variation of kkk-means++ via repeated {sub}sampling when choosing a seed. The new algorithm outperforms four other algorithms from kkk-means family on the task.

View on arXiv
Comments on this paper