Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.08947
Cited By
SpArch: Efficient Architecture for Sparse Matrix Multiplication
International Symposium on High-Performance Computer Architecture (HPCA), 2020
20 February 2020
Zhekai Zhang
Hanrui Wang
Song Han
W. Dally
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SpArch: Efficient Architecture for Sparse Matrix Multiplication"
50 / 60 papers shown
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
Tianhao Zhu
Dahu Feng
Erhu Feng
Yubin Xia
174
1
0
07 Oct 2025
SparseMap: A Sparse Tensor Accelerator Framework Based on Evolution Strategy
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025
Boran Zhao
Haiming Zhai
Zihang Yuan
Hetian Liu
Tian Xia
Wenzhe zhao
Pengju Ren
97
1
0
18 Aug 2025
The Ubiquitous Sparse Matrix-Matrix Products
Aydın Buluç
202
1
0
06 Aug 2025
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
International Symposium on Computer Architecture (ISCA), 2025
Minsu Kim
Seongmin Hong
RyeoWook Ko
S. Choi
Hunjong Lee
Junsoo Kim
Joo-Young Kim
Jongse Park
362
13
0
24 Mar 2025
An Efficient Sparse Fine-Tuning with Low Quantization Error via Neural Network Pruning
Cen-Jhih Li
Aditya Bhaskara
478
0
0
17 Feb 2025
EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models
International Symposium on High-Performance Computer Architecture (HPCA), 2025
Jaehoon Heo
Adiwena Putra
Jieon Yoon
Sungwoong Yune
Hangyeol Lee
Ji-Hoon Kim
Joo-Young Kim
DiffM
380
8
0
10 Jan 2025
HC-SpMM: Accelerating Sparse Matrix-Matrix Multiplication for Graphs with Hybrid GPU Cores
IEEE International Conference on Data Engineering (ICDE), 2024
Zhonggen Li
Xiangyu Ke
Yifan Zhu
Yunjun Gao
Yaofeng Tu
386
2
0
12 Dec 2024
SHyPar: A Spectral Coarsening Approach to Hypergraph Partitioning
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2024
Hamed Sajadinia
Ali Aghdaei
Zhuo Feng
395
1
0
09 Oct 2024
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Zhijian Liu
Zhuoyang Zhang
Samir Khaki
Shang Yang
Haotian Tang
Chenfeng Xu
Kurt Keutzer
Song Han
SSeg
376
4
0
26 Jul 2024
SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution
Ziang Yin
Nicholas Gangi
Meng Zhang
Jeff Zhang
Rena Huang
Jiaqi Gu
362
9
0
07 Jul 2024
Misam: Using ML in Dataflow Selection of Sparse-Sparse Matrix Multiplication
Sanjali Yadav
Bahar Asgari
121
0
0
14 Jun 2024
Secure and Efficient General Matrix Multiplication On Cloud Using Homomorphic Encryption
Journal of Supercomputing (J. Supercomput.), 2024
Yang Gao
Gang Quan
Soamar Homsi
Wujie Wen
Liqiang Wang
388
13
0
03 May 2024
Privacy-aware Berrut Approximated Coded Computing for Federated Learning
Xavier Martínez Luana
Rebeca P. Díaz Redondo
Manuel Fernández-Veiga
FedML
525
2
0
02 May 2024
FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor Contraction
Gabriel Kulp
Andrew Ensinger
Lizhong Chen
222
3
0
25 Apr 2024
NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator
Kaustubh Shivdikar
Nicolas Bohm Agostini
Malith Jayaweera
Gilbert Jonatan
José L. Abellán
Ajay Joshi
John Kim
David Kaeli
GNN
423
7
0
23 Apr 2024
Random Search as a Baseline for Sparse Neural Network Architecture Search
Rezsa Farahani
337
0
0
13 Mar 2024
No Free Prune: Information-Theoretic Barriers to Pruning at Initialization
Tanishq Kumar
Kevin Luo
Mark Sellke
352
9
0
02 Feb 2024
Transformer-QEC: Quantum Error Correction Code Decoding with Transferable Transformers
Hanrui Wang
Pengyu Liu
Kevin Shao
Dantong Li
Jiaqi Gu
David Z. Pan
Yongshan Ding
Song Han
306
27
0
27 Nov 2023
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs
Micro (MICRO), 2023
Haotian Tang
Shang Yang
Zhijian Liu
Ke Hong
Zhongming Yu
Xiuyu Li
Guohao Dai
Yu Wang
Song Han
320
52
0
25 Oct 2023
SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World
Xing-Hua Yao
Qinghao Hu
Tielong Liu
Zitao Mo
Zeyu Zhu
Zhengyang Zhuge
Jia Cheng
408
6
0
20 Sep 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
212
2
0
08 Jul 2023
Reparo: Loss-Resilient Generative Codec for Video Conferencing
Tianhong Li
Vibhaalakshmi Sivaraman
Pantea Karimi
Lijie Fan
M. Alizadeh
Dina Katabi
312
18
0
23 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
312
46
0
22 May 2023
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous Driving
International Symposium on High-Performance Computer Architecture (HPCA), 2023
Minjae Lee
Seongmin Park
Hyung-Se Kim
Minyong Yoon
Jangwhan Lee
Junwon Choi
Nam Sung Kim
Mingu Kang
Jungwook Choi
3DPC
314
12
0
12 May 2023
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
International Symposium on High-Performance Computer Architecture (HPCA), 2023
Geonhwa Jeong
S. Damani
Abhimanyu Bambhaniya
Eric Qin
C. Hughes
S. Subramoney
Hyesoon Kim
T. Krishna
MoE
421
37
0
17 Feb 2023
SGCN: Exploiting Compressed-Sparse Features in Deep Graph Convolutional Network Accelerators
International Symposium on High-Performance Computer Architecture (HPCA), 2023
Mingi Yoo
Jaeyong Song
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
319
32
0
25 Jan 2023
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network Accelerators
International Conference on Parallel Architectures and Compilation Techniques (PACT), 2022
Min-hee Yoo
Jaeyong Song
Hyeyoon Lee
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
318
5
0
24 Jan 2023
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
International Conference on Field-Programmable Technology (ICFPT), 2022
Jenny Yang
Jaeuk Kim
Joo-Young Kim
226
2
0
29 Oct 2022
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
International Symposium on High-Performance Computer Architecture (HPCA), 2022
Haoran You
Zhanyi Sun
Huihong Shi
Zhongzhi Yu
Yang Zhao
Yongan Zhang
Chaojian Li
Baopu Li
Yingyan Lin
ViT
412
135
0
18 Oct 2022
Chiplets and the Codelet Model
D. Fox
J. M. Diaz
Xiaoming Li
105
0
0
13 Sep 2022
DiVa: An Accelerator for Differentially Private Machine Learning
Micro (MICRO), 2022
Beom-Joo Park
Ranggi Hwang
Dongho Yoon
Yoonhyuk Choi
Minsoo Rhu
311
13
0
26 Aug 2022
OpSparse: a Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs
IEEE Access (IEEE Access), 2022
Zhaoyang Du
Yijin Guan
Tianchan Guan
Dimin Niu
Linyong Huang
Hongzhong Zheng
Yuan Xie
304
10
0
15 Jun 2022
Accelerating CPU-Based Sparse General Matrix Multiplication With Binary Row Merging
IEEE Access (IEEE Access), 2022
Zhaoyang Du
Yijin Guan
Tianchan Guan
Dimin Niu
Hongzhong Zheng
Yuan Xie
320
4
0
14 Jun 2022
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling
Micro (MICRO), 2022
Yannan Nellie Wu
Po-An Tsai
A. Parashar
Vivienne Sze
J. Emer
287
87
0
12 May 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
286
136
0
25 Apr 2022
Boosting Pruned Networks with Linear Over-parameterization
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yundi Qian
Siyuan Pan
Xiaoshuang Li
Jie Zhang
Liang Hou
Xiaobing Tu
256
4
0
25 Apr 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks
International Symposium on High-Performance Computer Architecture (HPCA), 2022
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
628
38
0
01 Mar 2022
QOC: Quantum On-Chip Training with Parameter Shift and Gradient Pruning
Design Automation Conference (DAC), 2022
Hanrui Wang
Zi-Chen Li
Jiaqi Gu
Yongshan Ding
David Z. Pan
Song Han
585
61
0
26 Feb 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation
International Conference on Learning Representations (ICLR), 2022
Cong Guo
Yuxian Qiu
Jingwen Leng
Xiaotian Gao
Chen Zhang
Yunxin Liu
Fan Yang
Yuhao Zhu
Minyi Guo
MQ
289
89
0
14 Feb 2022
Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators
P. S. Labini
M. Bernaschi
Francesco Silvestri
Flavio Vella
147
3
0
11 Feb 2022
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems
Christina Giannoula
Ivan Fernandez
Juan Gómez Luna
N. Koziris
G. Goumas
O. Mutlu
MoE
437
27
0
13 Jan 2022
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
269
0
0
09 Nov 2021
Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix Dense-Matrix Multiplication
Symposium on Field Programmable Gate Arrays (FPGA), 2021
Linghao Song
Yuze Chi
Atefeh Sohrabizadeh
Young-kyu Choi
Jason Lau
Jason Cong
GNN
361
86
0
22 Sep 2021
Towards Memory-Efficient Neural Networks via Multi-Level in situ Generation
IEEE International Conference on Computer Vision (ICCV), 2021
Jiaqi Gu
Hanqing Zhu
Chenghao Feng
Mingjie Liu
Zixuan Jiang
Ray T. Chen
David Z. Pan
242
4
0
25 Aug 2021
QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits
International Symposium on High-Performance Computer Architecture (HPCA), 2021
Hanrui Wang
Yongshan Ding
Jiaqi Gu
Zirui Li
Chengyue Wu
David Z. Pan
Frederic T. Chong
Song Han
683
270
0
22 Jul 2021
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN Acceleration
International Symposium on High-Performance Computer Architecture (HPCA), 2021
Zhi-Gang Liu
P. Whatmough
Yuhao Zhu
Matthew Mattina
MQ
284
110
0
16 Jul 2021
GPTPU: Accelerating Applications using Edge Tensor Processing Units
International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021
Kuan-Chieh Hsu
Hung-Wei Tseng
260
34
0
22 Jun 2021
SMASH: Sparse Matrix Atomic Scratchpad Hashing
Kaustubh Shivdikar
272
7
0
29 May 2021
Dual-side Sparse Tensor Core
International Symposium on Computer Architecture (ISCA), 2021
Yang-Feng Wang
Chen Zhang
Zhiqiang Xie
Cong Guo
Yunxin Liu
Jingwen Leng
279
94
0
20 May 2021
GPU Semiring Primitives for Sparse Neighborhood Methods
Conference on Machine Learning and Systems (MLSys), 2021
Corey J. Nolet
Divye Gala
Edward Raff
Joe Eaton
Brad Rees
John Zedlewski
Tim Oates
241
7
0
13 Apr 2021
1
2
Next
Page 1 of 2