Parallel Distributed Breadth First Search on the Kepler Architecture
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2014
Abstract
We present the results obtained by using an evolution of our CUDA-based solution for the exploration, via a Breadth First Search, of large graphs. This latest version exploits at its best the features of the Kepler architecture and relies on a 2D decomposition of the adjacency matrix to reduce the number of communications among the GPUs. The final result is a code that can visit billion edges in a second by using a cluster equipped with 4096 Tesla K20X GPUs.
View on arXivComments on this paper
