390

Random Subgraph Detection Using Queries

Abstract

The planted densest subgraph detection problem refers to the task of testing whether in a given (random) graph there is a subgraph that is unusually dense. Specifically, we observe an undirected and unweighted graph on nn nodes. Under the null hypothesis, the graph is a realization of an Erd\H{o}s-R\'{e}nyi graph with edge probability (or, density) qq. Under the alternative, there is a subgraph on kk vertices with edge probability p>qp>q. The statistical as well as the computational barriers of this problem are well-understood for a wide range of the edge parameters pp and qq. In this paper, we consider a natural variant of the above problem, where one can only observe a small part of the graph using adaptive edge queries. For this model, we determine the number of queries necessary and sufficient for detecting the presence of the planted subgraph. Specifically, we show that any (possibly randomized) algorithm must make Q=Ω(n2k2χ4(pq)log2n)\mathsf{Q} = \Omega(\frac{n^2}{k^2\chi^4(p||q)}\log^2n) adaptive queries (on expectation) to the adjacency matrix of the graph to detect the planted subgraph with probability more than 1/21/2, where χ2(pq)\chi^2(p||q) is the Chi-Square distance. On the other hand, we devise a quasi-polynomial-time algorithm that detects the planted subgraph with high probability by making Q=O(n2k2χ4(pq)log2n)\mathsf{Q} = O(\frac{n^2}{k^2\chi^4(p||q)}\log^2n) non-adaptive queries. We then propose a polynomial-time algorithm which is able to detect the planted subgraph using Q=O(n3k3χ2(pq)log3n)\mathsf{Q} = O(\frac{n^3}{k^3\chi^2(p||q)}\log^3 n) queries. We conjecture that in the leftover regime, where n2k2Qn3k3\frac{n^2}{k^2}\ll\mathsf{Q}\ll \frac{n^3}{k^3}, no polynomial-time algorithms exist. Our results resolve two questions posed in \cite{racz2020finding}, where the special case of adaptive detection and recovery of a planted clique was considered.

View on arXiv
Comments on this paper