Statistically efficient thinning of a Markov chain sampler
It is common to subsample Markov chain samples to reduce the storage burden of the output. It is also well known that discarding out of every observations will not improve statistical efficiency. It is less frequently remarked that subsampling a Markov chain allows one to omit some of the computation beyond that needed to simply advance the chain. When this reduced computation is accounted for, thinning the Markov chain by subsampling it can improve statistical efficiency. Given an autocorrelation parameter and a cost ratio , this paper shows how to compute the most efficient subsampling frequency . The optimal grows rapidly as increases towards . The resulting efficiency gain depends primarily on , not . Taking (no thinning) is optimal when . For it is optimal if and only if .
View on arXiv