Efficient Parallelization of a Ubiquitous Sequential Computation
- LRM
Abstract
We find a succinct expression for computing the sequence in parallel with two prefix sums, given , , , and initial value . On parallel processors, the computation of elements incurs time and space. Sequences of this form are ubiquitous in science and engineering, making efficient parallelization useful for a vast number of applications. We implement our expression in software, test it on parallel hardware, and verify that it executes faster than sequential computation by a factor of .
View on arXivComments on this paper
