ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 0706.3434
375
8
v1v2 (latest)

Separating populations with wide data: A spectral analysis

25 June 2007
Avrim Blum
A. Coja-Oghlan
A. Frieze
Shuheng Zhou
ArXiv (abs)PDFHTML
Abstract

In this paper, we consider the problem of partitioning a small data sample drawn from a mixture of kkk product distributions. We are interested in the case that individual features are of low average quality γ\gammaγ, and we want to use as few of them as possible to correctly partition the sample. We analyze a spectral technique that is able to approximately optimize the total data size--the product of number of data points nnn and the number of features KKK--needed to correctly perform this partitioning as a function of 1/γ1/\gamma1/γ for K>nK>nK>n. Our goal is motivated by an application in clustering individuals according to their population of origin using markers, when the divergence between any two of the populations is small.

View on arXiv
Comments on this paper