Group testing with nested pools
In order to identify the infected individuals of a population, their samples are divided in equally sized groups called pools and a single laboratory test is applied to each pool. Individuals whose samples belong to pools that test negative are declared healthy, while each pool that tests positive is divided into smaller, equally sized pools which are tested in the next stage. This scheme is called adaptive, because the composition of the pools at each stage depends on results from previous stages, and nested because each pool is a subset of a pool of the previous stage. Is the infection probability is not smaller than it is best to test each sample (no pooling). If , we compute the mean and the variance of the number of tests per individual as a function of the pool sizes in the first stages; in the -th stage all remaining samples are tested. The case was proposed by Dorfman in his seminal paper in 1943. The goal is to minimize , which is called the cost associated to~. We show that for the optimal choice is one of four possible schemes, which are explicitly described. For we show overwhelming numerical evidence that the best choice is , with a precise description of the range of 's where each holds. We then focus on schemes of the type , and estimate that the cost of the best scheme of this type for , determined by the choice of , is of order . This is the same order as that of the cost of the optimal scheme, and the difference of these costs is explicitly bounded. As an example, for the optimal choice is , , with cost ; that is, the mean number of tests required to screen 100 individuals is 20.
View on arXiv