v1v2 (latest)
Parallel Distributed Breadth First Search on the Kepler Architecture
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2014
Abstract
We present the results obtained by using an evolution of our CUDA-based solution for the exploration, via a Breadth First Search, of large graphs. This latest version exploits at its best the features of the Kepler architecture and relies on a combination of techniques to reduce both the number of communications among the GPUs and the amount of exchanged data. The final result is a code that can visit more than 800 billion edges in a second by using a cluster equipped with 4096 Tesla K20X GPUs.
View on arXivComments on this paper
