ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.01324
46
0
v1v2 (latest)

Near-Optimal Clustering in Mixture of Markov Chains

2 June 2025
Junghyun Lee
Yassir Jedra
Alexandre Proutière
Se-Young Yun
ArXiv (abs)PDFHTML
Main:8 Pages
Bibliography:7 Pages
Appendix:21 Pages
Abstract

We study the problem of clustering TTT trajectories of length HHH, each generated by one of KKK unknown ergodic Markov chains over a finite state space of size SSS. The goal is to accurately group trajectories according to their underlying generative model. We begin by deriving an instance-dependent, high-probability lower bound on the clustering error rate, governed by the weighted KL divergence between the transition kernels of the chains. We then present a novel two-stage clustering algorithm. In Stage~I, we apply spectral clustering using a new injective Euclidean embedding for ergodic Markov chains -- a contribution of independent interest that enables sharp concentration results. Stage~II refines the initial clusters via a single step of likelihood-based reassignment. Our method achieves a near-optimal clustering error with high probability, under the conditions H=Ω~(γps−1(S2∨πmin⁡−1))H = \tilde{\Omega}(\gamma_{\mathrm{ps}}^{-1} (S^2 \vee \pi_{\min}^{-1}))H=Ω~(γps−1​(S2∨πmin−1​)) and TH=Ω~(γps−1S2)TH = \tilde{\Omega}(\gamma_{\mathrm{ps}}^{-1} S^2 )TH=Ω~(γps−1​S2), where πmin⁡\pi_{\min}πmin​ is the minimum stationary probability of a state across the KKK chains and γps\gamma_{\mathrm{ps}}γps​ is the minimum pseudo-spectral gap. These requirements provide significant improvements, if not at least comparable, to the state-of-the-art guarantee (Kausik et al., 2023), and moreover, our algorithm offers a key practical advantage: unlike existing approach, it requires no prior knowledge of model-specific quantities (e.g., separation between kernels or visitation probabilities). We conclude by discussing the inherent gap between our upper and lower bounds, providing insights into the unique structure of this clustering problem.

View on arXiv
@article{lee2025_2506.01324,
  title={ Near-Optimal Clustering in Mixture of Markov Chains },
  author={ Junghyun Lee and Yassir Jedra and Alexandre Proutière and Se-Young Yun },
  journal={arXiv preprint arXiv:2506.01324},
  year={ 2025 }
}
Comments on this paper