121

Kernel K-means clustering of distributional data

Main:14 Pages
3 Figures
Bibliography:2 Pages
15 Tables
Appendix:8 Pages
Abstract

We consider the problem of clustering a sample of probability distributions from a random distribution on Rp\mathbb R^p. Our proposed partitioning method makes use of a symmetric, positive-definite kernel kk and its associated reproducing kernel Hilbert space (RKHS) H\mathcal H. By mapping each distribution to its corresponding kernel mean embedding in H\mathcal H, we obtain a sample in this RKHS where we carry out the KK-means clustering procedure, which provides an unsupervised classification of the original sample. The procedure is simple and computationally feasible even for dimension p>1p>1. The simulation studies provide insight into the choice of the kernel and its tuning parameter. The performance of the proposed clustering procedure is illustrated on a collection of Synthetic Aperture Radar (SAR) images.

View on arXiv
Comments on this paper