Quality control in sublinear time: a case study via random graphs
Many algorithms are designed to work well on average over inputs. When running such an algorithm on an arbitrary input, we must ask: Can we trust the algorithm on this input? We identify a new class of algorithmic problems addressing this, which we call "Quality Control Problems." These problems are specified by a (positive, real-valued) "quality function" and a distribution such that, with high probability, a sample drawn from is "high quality," meaning its -value is near . The goal is to accept inputs and reject potentially adversarially generated inputs with far from . The objective of quality control is thus weaker than either component problem: testing for "" or testing if , and offers the possibility of more efficient algorithms.In this work, we consider the sublinear version of the quality control problem, where and the goal is to solve the -quality problem with queries and time. As a case study, we consider random graphs, i.e., (and ), and the -clique count function , where is the number of -cliques in . Testing if with one sample, let alone with sublinear query access to the sample, is of course impossible. Testing if requires samples. In contrast, we show that the quality control problem for (with for some constant ) with respect to can be tested with queries and time, showing quality control is provably superpolynomially more efficient in this setting. More generally, for a motif of maximum degree , the respective quality control problem can be solved with queries and running time.
View on arXiv