13
0

Robustness against data loss with Algebraic Statistics

Abstract

The paper describes an algorithm that, given an initial design Fn\mathcal{F}_n of size nn and a linear model with pp parameters, provides a sequence FnFnkFp\mathcal{F}_n \supset \ldots \supset \mathcal{F}_{n-k} \supset \ldots \supset \mathcal{F}_p of nested \emph{robust} designs. The sequence is obtained by the removal, one by one, of the runs of Fn\mathcal{F}_n till a pp-run \emph{saturated} design Fp\mathcal{F}_p is obtained. The potential impact of the algorithm on real applications is high. The initial fraction Fn\mathcal{F}_n can be of any type and the output sequence can be used to organize the experimental activity. The experiments can start with the runs corresponding to Fp\mathcal{F}_p and continue adding one run after the other (from Fnk\mathcal{F}_{n-k} to Fnk+1\mathcal{F}_{n-k+1}) till the initial design Fn\mathcal{F}_n is obtained. In this way, if for some unexpected reasons the experimental activity must be stopped before the end when only nkn-k runs are completed, the corresponding Fnk\mathcal{F}_{n-k} has a high value of robustness for k{1,,np}k \in \{1, \ldots, n-p\}. The algorithm uses the circuit basis, a special representation of the kernel of a matrix with integer entries. The effectiveness of the algorithm is demonstrated through the use of simulations.

View on arXiv
Comments on this paper