156

Flexible Bivariate Beta Mixture Model: A Probabilistic Approach for Clustering Complex Data Structures

Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2025
Main:8 Pages
3 Figures
Bibliography:2 Pages
8 Tables
Abstract

Clustering is essential in data analysis and machine learning, but traditional algorithms like kk-means and Gaussian Mixture Models (GMM) often fail with nonconvex clusters. To address the challenge, we introduce the Flexible Bivariate Beta Mixture Model (FBBMM), which utilizes the flexibility of the bivariate beta distribution to handle diverse and irregular cluster shapes. Using the Expectation Maximization (EM) algorithm and Sequential Least Squares Programming (SLSQP) optimizer for parameter estimation, we validate FBBMM on synthetic and real-world datasets, demonstrating its superior performance in clustering complex data structures, offering a robust solution for big data analytics across various domains. We release the experimental code atthis https URL.

View on arXiv
Comments on this paper