39
0

Flexible Bivariate Beta Mixture Model: A Probabilistic Approach for Clustering Complex Data Structures

Abstract

Clustering is essential in data analysis and machine learning, but traditional algorithms like kk-means and Gaussian Mixture Models (GMM) often fail with nonconvex clusters. To address the challenge, we introduce the Flexible Bivariate Beta Mixture Model (FBBMM), which utilizes the flexibility of the bivariate beta distribution to handle diverse and irregular cluster shapes. Using the Expectation Maximization (EM) algorithm and Sequential Least Squares Programming (SLSQP) optimizer for parameter estimation, we validate FBBMM on synthetic and real-world datasets, demonstrating its superior performance in clustering complex data structures, offering a robust solution for big data analytics across various domains. We release the experimental code atthis https URL.

View on arXiv
@article{hsu2025_2502.19938,
  title={ Flexible Bivariate Beta Mixture Model: A Probabilistic Approach for Clustering Complex Data Structures },
  author={ Yung-Peng Hsu and Hung-Hsuan Chen },
  journal={arXiv preprint arXiv:2502.19938},
  year={ 2025 }
}
Comments on this paper