51
2

On the Sample Complexity of Rank Regression from Pairwise Comparisons

Abstract

We consider a rank regression setting, in which a dataset of NN samples with features in Rd\mathbb{R}^d is ranked by an oracle via MM pairwise comparisons. Specifically, there exists a latent total ordering of the samples; when presented with a pair of samples, a noisy oracle identifies the one ranked higher with respect to the underlying total ordering. A learner observes a dataset of such comparisons and wishes to regress sample ranks from their features. We show that to learn the model parameters with ϵ>0\epsilon > 0 accuracy, it suffices to conduct MΩ(dNlog3N/ϵ2)M \in \Omega(dN\log^3 N/\epsilon^2) comparisons uniformly at random when NN is Ω(d/ϵ2)\Omega(d/\epsilon^2).

View on arXiv
Comments on this paper