v1v2 (latest)

FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks

7 November 2020

Papers citing "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks"

26 / 26 papers shown

AutoSAGE: Input-Aware CUDA Scheduling for Sparse GNN Aggregation (SpMM/SDDMM) and CSR Attention

Aleksandar Stankovic

173

17 Nov 2025

FuseFlow: A Fusion-Centric Compilation Framework for Sparse Deep Learning on Streaming Dataflow

198

06 Nov 2025

Fused3S: Fast Sparse Attention on Tensor CoresInternational Conference on Supercomputing (ICS), 2025

Zitong Li

Aparna Chandramowlishwaran

GNN

231

12 May 2025

Ember: A Compiler for Efficient Embedding Operations on Decoupled Access-Execute Architectures

...

347

14 Apr 2025

Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence

554

08 Jan 2025

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

...

690

177

02 Jan 2025

DF-GNN: Dynamic Fusion Framework for Attention Graph Neural Networks on GPUsLOG IN (LOG IN), 2024

307

25 Nov 2024

Distributed-Memory Parallel Algorithms for Sparse Matrix and Sparse Tall-and-Skinny Matrix MultiplicationInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024

Isuru Ranawaka

Md Taufique Hussain

Charles Block

Gerasimos Gerogiannis

Josep Torrellas

Ariful Azad

251

21 Aug 2024

GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU

444

03 Apr 2024

iSpLib: A Library for Accelerating Graph Neural Networks using Auto-tuned Sparse Operations

243

21 Mar 2024

JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix MultiplicationIEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2023

Qiang Fu

Thomas B. Rolinger

H. H. Huang

308

09 Dec 2023

Performance Optimization of Deep Learning Sparse Matrix Kernels on Intel Max Series GPU

Mohammad Zubair

Christoph Bauinger

291

01 Nov 2023

SENSEi: Input-Sensitive Compilation for Accelerating GNNs

Damitha Sandeepa Lenadora

Vimarsh Sathia

Gerasimos Gerogiannis

218

27 Jun 2023

A Survey on Graph Neural Network Acceleration: Algorithms, Systems, and Customized Hardware

315

24 Jun 2023

BitGNN: Unleashing the Performance Potential of Binary Graph Neural Networks on GPUsInternational Conference on Supercomputing (ICS), 2023

379

04 May 2023

PhysGraph: Physics-Based Integration Using Graph Neural Networks

247

27 Jan 2023

Hector: An Efficient Programming and Compilation Framework for Implementing Relational Graph Neural Networks in GPU ArchitecturesInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2023

310

16 Jan 2023

Scalable Graph Convolutional Network Training on Distributed-Memory SystemsProceedings of the VLDB Endowment (PVLDB), 2022

G. Demirci

Aparajita Haldar

Hakan Ferhatosmanoglu

GNN

393

09 Dec 2022

Architectural Implications of Embedding Dimension during GCN on CPU and GPU

M. Adiletta

David Brooks

Gu-Yeon Wei

GNN

110

01 Dec 2022

Distributed Graph Neural Network Training: A SurveyACM Computing Surveys (ACM CSUR), 2022

Lei Chen

469

101

01 Nov 2022

RSC: Accelerating Graph Neural Networks Training via Randomized Sparse ComputationsInternational Conference on Machine Learning (ICML), 2022

Daochen Zha

353

19 Oct 2022

SparseTIR: Composable Abstractions for Sparse Compilation in Deep LearningInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022

548

128

11 Jul 2022

Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency AnalysisIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Maciej Besta

Torsten Hoefler

GNN

590

19 May 2022

Distributed-Memory Sparse Kernels for Machine LearningIEEE International Parallel and Distributed Processing Symposium (IPDPS), 2022

203

15 Mar 2022

A Comprehensive Analytical Survey on Unsupervised and Semi-Supervised Graph Representation Learning Methods

Md. Khaledur Rahman

A. Azad

AI4TS

210

20 Dec 2021

Parallel Minimum Spanning Forest Computation using Sparse Matrix Kernels

Tim Baer

Raghavendra Kanakagiri

Edgar Solomonik

268

10 Oct 2021