v1v2v3 (latest)

Exploiting Multiple Levels of Parallelism in Sparse Matrix-Matrix Multiplication

3 October 2015

Papers citing "Exploiting Multiple Levels of Parallelism in Sparse Matrix-Matrix Multiplication"

38 / 38 papers shown

Slicing Is All You Need: Towards A Universal One-Sided Algorithm for Distributed Matrix Multiplication

Benjamin Brock

Renato Golin

10 Oct 2025

Sparsity-Aware Communication for Distributed Graph Neural Network TrainingInternational Conference on Parallel Processing (ICPP), 2024

Ujjaini Mukhodopadhyay

382

07 Apr 2025

A sparsity-aware distributed-memory algorithm for sparse-sparse matrix multiplicationInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024

Yuxi Hong

A. Buluç

169

26 Aug 2024

Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix MultiplicationInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024

Isuru Ranawaka

Md Taufique Hussain

Charles Block

Gerasimos Gerogiannis

Josep Torrellas

Ariful Azad

188

21 Aug 2024

SpComm3D: A Framework for Enabling Sparse Communication in 3D Sparse Kernels

Nabil Abubaker

Torsten Hoefler

271

30 Apr 2024

NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator

Kaustubh Shivdikar

Nicolas Bohm Agostini

343

23 Apr 2024

RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUsInternational Conference on Supercomputing (ICS), 2023

Benjamin Brock

A. Buluç

Katherine Yelick

221

29 Nov 2023

Optimization of SpGEMM with Risc-V vector instructions

Valentin Le Fèvre

Marc Casas

143

04 Mar 2023

A Distributed Block Chebyshev-Davidson Algorithm for Parallel Spectral ClusteringJournal of Scientific Computing (J. Sci. Comput.), 2022

Qiyuan Pang

Haizhao Yang

191

08 Dec 2022

Parallel, Portable Algorithms for Distance-2 Maximal Independent Set and Graph CoarseningIEEE International Parallel and Distributed Processing Symposium (IPDPS), 2022

Brian Kelley

S. Rajamanickam

141

06 Apr 2022

pylspack: Parallel algorithms and data structures for sketching, column subset selection, regression and leverage scoresACM Transactions on Mathematical Software (TOMS), 2022

Aleksandros Sobczyk

Efstratios Gallopoulos

222

05 Mar 2022

Fast Dynamic Updates and Dynamic SpGEMM on MPI-Distributed GraphsIEEE International Conference on Cluster Computing (Cluster), 2022

Alexander van der Grinten

G. Custers

Duy Le Thanh

Henning Meyerhenke

133

17 Feb 2022

Parallel Algorithms for Adding a Collection of Sparse MatricesIEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPS), 2021

173

19 Dec 2021

Parallel Algorithms for Masked Sparse Matrix-Matrix Products

129

18 Nov 2021

Combinatorial BLAS 2.0: Scaling combinatorial algorithms on distributed-memory systemsIEEE Transactions on Parallel and Distributed Systems (TPDS), 2021

118

28 Jun 2021

The Chunks and Tasks Matrix Library 2.0

104

23 Nov 2020

Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme ScaleIEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

157

16 Oct 2020

Reducing Communication in Graph Neural Network Training

Alok Tripathy

Katherine Yelick

A. Buluç

GNN

280

116

07 May 2020

Bandwidth-Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation BlockingACM Symposium on Parallelism in Algorithms and Architectures (SPAA), 2020

124

26 Feb 2020

A Systematic Survey of General Sparse Matrix-Matrix MultiplicationACM Computing Surveys (ACM CSUR), 2020

226

26 Feb 2020

Optimizing High Performance Markov Clustering for Pre-Exascale ArchitecturesIEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

114

24 Feb 2020

SpArch: Efficient Architecture for Sparse Matrix MultiplicationInternational Symposium on High-Performance Computer Architecture (HPCA), 2020

Zhekai Zhang

Hanrui Wang

Song Han

W. Dally

187

271

20 Feb 2020

The Parallelism Motifs of Genomic Data Analysis

Katherine Yelick

...

179

20 Jan 2020

Communication-Efficient Jaccard Similarity for High-Performance Distributed Genome Comparisons

Maciej Besta

Raghavendra Kanakagiri

393

11 Nov 2019

Efficient computation of the density matrix with error control on distributed computer systems

Anastasia Kruchinina

Elias Rudberg

Emanuel H. Rubensson

104

27 Sep 2019

Prior-preconditioned conjugate gradient method for accelerated Gibbs sampling in "large

n

& large

p

" Bayesian sparse regression

A. Nishimura

M. Suchard

372

29 Oct 2018

Implementing Push-Pull Efficiently in GraphBLAS

Carl Yang

A. Buluç

John Douglas Owens

217

10 Apr 2018

High-performance sparse matrix-matrix products on Intel KNL and multicore architectures

129

05 Apr 2018

Sparse Matrix Multiplication and Triangle Listing in the Congested Clique Model

K. Censor-Hillel

Dean Leitersdorf

Elia Turner

176

13 Feb 2018

Multi-threaded Sparse Matrix-Matrix Multiplication for Many-Core and GPU Architectures

Mehmet Deveci

C. Trott

S. Rajamanickam

144

09 Jan 2018

Communication-Avoiding Optimization Methods for Distributed Massive-Scale Sparse Inverse Covariance EstimationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2017

Katherine Yelick

123

30 Oct 2017

Distributed Triangle Counting in the Graphulo Matrix Math Library

D. Hutchison

120

20 Aug 2017

Increasing the Efficiency of Sparse Matrix-Matrix Multiplication with a 2.5D Algorithm and One-Sided MPI

124

29 May 2017

Scaling betweenness centrality using communication-efficient sparse matrix multiplication

227

22 Sep 2016

Novel Graph Processor Architecture, Prototype System, and Results

22 Jul 2016

Mathematical Foundations of the GraphBLAS

...

226

241

18 Jun 2016

Hypergraph Partitioning for Sparse Matrix-Matrix Multiplication

117

17 Mar 2016

Locality-aware parallel block-sparse matrix-matrix multiplication using the Chunks and Tasks programming model

Emanuel H. Rubensson

Elias Rudberg

193

30 Jan 2015