9
0

Learning Partitions with Optimal Query and Round Complexities

Main:26 Pages
5 Figures
Bibliography:3 Pages
Abstract

We consider the basic problem of learning an unknown partition of nn elements into at most kk sets using simple queries that reveal information about a small subset of elements. Our starting point is the well-studied pairwise same-set queries which ask if a pair of elements belong to the same class. It is known that non-adaptive algorithms require Θ(n2)\Theta(n^2) queries, while adaptive algorithms require Θ(nk)\Theta(nk) queries, and the best known algorithm uses k1k-1 rounds. This problem has been studied extensively over the last two decades in multiple communities due to its fundamental nature and relevance to clustering, active learning, and crowd sourcing. In many applications, it is of high interest to reduce adaptivity while minimizing query complexity. We give a complete characterization of the deterministic query complexity of this problem as a function of the number of rounds, rr, interpolating between the non-adaptive and adaptive settings: for any constant rr, the query complexity is Θ(n1+12r1k112r1)\Theta(n^{1+\frac{1}{2^r-1}}k^{1-\frac{1}{2^r-1}}). Our algorithm only needs O(loglogn)O(\log \log n) rounds to attain the optimal O(nk)O(nk) query complexity.Next, we consider two generalizations of pairwise queries to subsets SS of size at most ss: (1) weak subset queries which return the number of classes intersected by SS, and (2) strong subset queries which return the entire partition restricted on SS. Once again in crowd sourcing applications, queries on large sets may be prohibitive. For non-adaptive algorithms, we show Ω(n2/s2)\Omega(n^2/s^2) strong queries are needed. Perhaps surprisingly, we show that there is a non-adaptive algorithm using weak queries that matches this bound up to log-factors for all sns \leq \sqrt{n}. More generally, we obtain nearly matching upper and lower bounds for algorithms using subset queries in terms of both the number of rounds, rr, and the query size bound, ss.

View on arXiv
@article{black2025_2505.05009,
  title={ Learning Partitions with Optimal Query and Round Complexities },
  author={ Hadley Black and Arya Mazumdar and Barna Saha },
  journal={arXiv preprint arXiv:2505.05009},
  year={ 2025 }
}
Comments on this paper