163
v1v2 (latest)

Learning Mixtures of Gaussians with Censored Data

International Conference on Machine Learning (ICML), 2023
Abstract

We study the problem of learning mixtures of Gaussians with censored data. Statistical learning with censored data is a classical problem, with numerous practical applications, however, finite-sample guarantees for even simple latent variable models such as Gaussian mixtures are missing. Formally, we are given censored data from a mixture of univariate Gaussians i=1kwiN(μi,σ2), \sum_{i=1}^k w_i \mathcal{N}(\mu_i,\sigma^2), i.e. the sample is observed only if it lies inside a set SS. The goal is to learn the weights wiw_i and the means μi\mu_i. We propose an algorithm that takes only 1εO(k)\frac{1}{\varepsilon^{O(k)}} samples to estimate the weights wiw_i and the means μi\mu_i within ε\varepsilon error.

View on arXiv
Comments on this paper