ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.00194
341
38
v1v2v3v4 (latest)

Challenges with EM in application to weakly identifiable mixture models

International Conference on Artificial Intelligence and Statistics (AISTATS), 2019
1 February 2019
Raaz Dwivedi
Nhat Ho
K. Khamaru
Martin J. Wainwright
Sai Li
Bin Yu
ArXiv (abs)PDFHTML
Abstract

We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on nnn i.i.d. samples are known to have lower accuracy than the classical n−12n^{- \frac{1}{2}}n−21​ error. We investigate whether the Expectation-Maximization (EM) algorithm also converges slowly for these models. We first demonstrate via simulation studies a broad range of over-specified mixture models for which the EM algorithm converges very slowly, both in one and higher dimensions. We provide a complete analytical characterization of this behavior for fitting data generated from a multivariate standard normal distribution using two-component Gaussian mixture with varying location and scale parameters. Our results reveal distinct regimes in the convergence behavior of EM as a function of the dimension ddd. In the multivariate setting (d≥2d \geq 2d≥2), when the covariance matrix is constrained to a multiple of the identity matrix, the EM algorithm converges in order (n/d)12(n/d)^{\frac{1}{2}}(n/d)21​ steps and returns estimates that are at a Euclidean distance of order (n/d)−14{(n/d)^{-\frac{1}{4}}}(n/d)−41​ and (nd)−12{ (n d)^{- \frac{1}{2}}}(nd)−21​ from the true location and scale parameter respectively. On the other hand, in the univariate setting (d=1d = 1d=1), the EM algorithm converges in order n34n^{\frac{3}{4} }n43​ steps and returns estimates that are at a Euclidean distance of order n−18{ n^{- \frac{1}{8}}}n−81​ and n−14{ n^{-\frac{1} {4}}}n−41​ from the true location and scale parameter respectively. Establishing the slow rates in the univariate setting requires a novel localization argument with two stages, with each stage involving an epoch-based argument applied to a different surrogate EM operator at the population level. We also show multivariate (d≥2d \geq 2d≥2) examples, involving more general covariance matrices, that exhibit the same slow rates as the univariate case.

View on arXiv
Comments on this paper