ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.08947
  4. Cited By
SpArch: Efficient Architecture for Sparse Matrix Multiplication

SpArch: Efficient Architecture for Sparse Matrix Multiplication

International Symposium on High-Performance Computer Architecture (HPCA), 2020
20 February 2020
Zhekai Zhang
Hanrui Wang
Song Han
W. Dally
ArXiv (abs)PDFHTML

Papers citing "SpArch: Efficient Architecture for Sparse Matrix Multiplication"

50 / 60 papers shown
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
Tianhao Zhu
Dahu Feng
Erhu Feng
Yubin Xia
174
1
0
07 Oct 2025
SparseMap: A Sparse Tensor Accelerator Framework Based on Evolution Strategy
SparseMap: A Sparse Tensor Accelerator Framework Based on Evolution StrategyIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025
Boran Zhao
Haiming Zhai
Zihang Yuan
Hetian Liu
Tian Xia
Wenzhe zhao
Pengju Ren
97
1
0
18 Aug 2025
The Ubiquitous Sparse Matrix-Matrix Products
The Ubiquitous Sparse Matrix-Matrix Products
Aydın Buluç
202
1
0
06 Aug 2025
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache QuantizationInternational Symposium on Computer Architecture (ISCA), 2025
Minsu Kim
Seongmin Hong
RyeoWook Ko
S. Choi
Hunjong Lee
Junsoo Kim
Joo-Young Kim
Jongse Park
362
13
0
24 Mar 2025
An Efficient Sparse Fine-Tuning with Low Quantization Error via Neural Network Pruning
An Efficient Sparse Fine-Tuning with Low Quantization Error via Neural Network Pruning
Cen-Jhih Li
Aditya Bhaskara
478
0
0
17 Feb 2025
EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models
EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion ModelsInternational Symposium on High-Performance Computer Architecture (HPCA), 2025
Jaehoon Heo
Adiwena Putra
Jieon Yoon
Sungwoong Yune
Hangyeol Lee
Ji-Hoon Kim
Joo-Young Kim
DiffM
380
8
0
10 Jan 2025
HC-SpMM: Accelerating Sparse Matrix-Matrix Multiplication for Graphs
  with Hybrid GPU Cores
HC-SpMM: Accelerating Sparse Matrix-Matrix Multiplication for Graphs with Hybrid GPU CoresIEEE International Conference on Data Engineering (ICDE), 2024
Zhonggen Li
Xiangyu Ke
Yifan Zhu
Yunjun Gao
Yaofeng Tu
386
2
0
12 Dec 2024
SHyPar: A Spectral Coarsening Approach to Hypergraph Partitioning
SHyPar: A Spectral Coarsening Approach to Hypergraph PartitioningIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2024
Hamed Sajadinia
Ali Aghdaei
Zhuo Feng
395
1
0
09 Oct 2024
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Sparse Refinement for Efficient High-Resolution Semantic Segmentation
Zhijian Liu
Zhuoyang Zhang
Samir Khaki
Shang Yang
Haotian Tang
Chenfeng Xu
Kurt Keutzer
Song Han
SSeg
376
4
0
26 Jul 2024
SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with
  Thermal-Tolerant, Power-Efficient In-situ Light Redistribution
SCATTER: Algorithm-Circuit Co-Sparse Photonic Accelerator with Thermal-Tolerant, Power-Efficient In-situ Light Redistribution
Ziang Yin
Nicholas Gangi
Meng Zhang
Jeff Zhang
Rena Huang
Jiaqi Gu
362
9
0
07 Jul 2024
Misam: Using ML in Dataflow Selection of Sparse-Sparse Matrix
  Multiplication
Misam: Using ML in Dataflow Selection of Sparse-Sparse Matrix Multiplication
Sanjali Yadav
Bahar Asgari
121
0
0
14 Jun 2024
Secure and Efficient General Matrix Multiplication On Cloud Using
  Homomorphic Encryption
Secure and Efficient General Matrix Multiplication On Cloud Using Homomorphic EncryptionJournal of Supercomputing (J. Supercomput.), 2024
Yang Gao
Gang Quan
Soamar Homsi
Wujie Wen
Liqiang Wang
388
13
0
03 May 2024
Privacy-aware Berrut Approximated Coded Computing for Federated Learning
Privacy-aware Berrut Approximated Coded Computing for Federated Learning
Xavier Martínez Luana
Rebeca P. Díaz Redondo
Manuel Fernández-Veiga
FedML
525
2
0
02 May 2024
FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor
  Contraction
FLAASH: Flexible Accelerator Architecture for Sparse High-Order Tensor Contraction
Gabriel Kulp
Andrew Ensinger
Lizhong Chen
222
3
0
25 Apr 2024
NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled
  Spatial Accelerator
NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator
Kaustubh Shivdikar
Nicolas Bohm Agostini
Malith Jayaweera
Gilbert Jonatan
José L. Abellán
Ajay Joshi
John Kim
David Kaeli
GNN
423
7
0
23 Apr 2024
Random Search as a Baseline for Sparse Neural Network Architecture
  Search
Random Search as a Baseline for Sparse Neural Network Architecture Search
Rezsa Farahani
337
0
0
13 Mar 2024
No Free Prune: Information-Theoretic Barriers to Pruning at
  Initialization
No Free Prune: Information-Theoretic Barriers to Pruning at Initialization
Tanishq Kumar
Kevin Luo
Mark Sellke
352
9
0
02 Feb 2024
Transformer-QEC: Quantum Error Correction Code Decoding with
  Transferable Transformers
Transformer-QEC: Quantum Error Correction Code Decoding with Transferable Transformers
Hanrui Wang
Pengyu Liu
Kevin Shao
Dantong Li
Jiaqi Gu
David Z. Pan
Yongshan Ding
Song Han
306
27
0
27 Nov 2023
TorchSparse++: Efficient Training and Inference Framework for Sparse
  Convolution on GPUs
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsMicro (MICRO), 2023
Haotian Tang
Shang Yang
Zhijian Liu
Ke Hong
Zhongming Yu
Xiuyu Li
Guohao Dai
Yu Wang
Song Han
320
52
0
25 Oct 2023
SpikingNeRF: Making Bio-inspired Neural Networks See through the Real
  World
SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World
Xing-Hua Yao
Qinghao Hu
Tielong Liu
Zitao Mo
Zeyu Zhu
Zhengyang Zhuge
Jia Cheng
408
6
0
20 Sep 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication
  Kernels
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
212
2
0
08 Jul 2023
Reparo: Loss-Resilient Generative Codec for Video Conferencing
Reparo: Loss-Resilient Generative Codec for Video Conferencing
Tianhong Li
Vibhaalakshmi Sivaraman
Pantea Karimi
Lijie Fan
M. Alizadeh
Dina Katabi
312
18
0
23 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical
  Structured Sparsity
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
312
46
0
22 May 2023
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for
  Autonomous Driving
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous DrivingInternational Symposium on High-Performance Computer Architecture (HPCA), 2023
Minjae Lee
Seongmin Park
Hyung-Se Kim
Minyong Yoon
Jangwhan Lee
Junwon Choi
Nam Sung Kim
Mingu Kang
Jungwook Choi
3DPC
314
12
0
12 May 2023
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile
  Acceleration on CPUs
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUsInternational Symposium on High-Performance Computer Architecture (HPCA), 2023
Geonhwa Jeong
S. Damani
Abhimanyu Bambhaniya
Eric Qin
C. Hughes
S. Subramoney
Hyesoon Kim
T. Krishna
MoE
421
37
0
17 Feb 2023
SGCN: Exploiting Compressed-Sparse Features in Deep Graph Convolutional
  Network Accelerators
SGCN: Exploiting Compressed-Sparse Features in Deep Graph Convolutional Network AcceleratorsInternational Symposium on High-Performance Computer Architecture (HPCA), 2023
Mingi Yoo
Jaeyong Song
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
319
32
0
25 Jan 2023
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional
  Network Accelerators
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network AcceleratorsInternational Conference on Parallel Architectures and Compilation Techniques (PACT), 2022
Min-hee Yoo
Jaeyong Song
Hyeyoon Lee
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
318
5
0
24 Jan 2023
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight
  Grouping for Multi-Agent Reinforcement Learning
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement LearningInternational Conference on Field-Programmable Technology (ICFPT), 2022
Jenny Yang
Jaeuk Kim
Joo-Young Kim
226
2
0
29 Oct 2022
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-DesignInternational Symposium on High-Performance Computer Architecture (HPCA), 2022
Haoran You
Zhanyi Sun
Huihong Shi
Zhongzhi Yu
Yang Zhao
Yongan Zhang
Chaojian Li
Baopu Li
Yingyan Lin
ViT
412
135
0
18 Oct 2022
Chiplets and the Codelet Model
Chiplets and the Codelet Model
D. Fox
J. M. Diaz
Xiaoming Li
105
0
0
13 Sep 2022
DiVa: An Accelerator for Differentially Private Machine Learning
DiVa: An Accelerator for Differentially Private Machine LearningMicro (MICRO), 2022
Beom-Joo Park
Ranggi Hwang
Dongho Yoon
Yoonhyuk Choi
Minsoo Rhu
311
13
0
26 Aug 2022
OpSparse: a Highly Optimized Framework for Sparse General Matrix
  Multiplication on GPUs
OpSparse: a Highly Optimized Framework for Sparse General Matrix Multiplication on GPUsIEEE Access (IEEE Access), 2022
Zhaoyang Du
Yijin Guan
Tianchan Guan
Dimin Niu
Linyong Huang
Hongzhong Zheng
Yuan Xie
304
10
0
15 Jun 2022
Accelerating CPU-Based Sparse General Matrix Multiplication With Binary
  Row Merging
Accelerating CPU-Based Sparse General Matrix Multiplication With Binary Row MergingIEEE Access (IEEE Access), 2022
Zhaoyang Du
Yijin Guan
Tianchan Guan
Dimin Niu
Hongzhong Zheng
Yuan Xie
320
4
0
14 Jun 2022
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator ModelingMicro (MICRO), 2022
Yannan Nellie Wu
Po-An Tsai
A. Parashar
Vivienne Sze
J. Emer
287
87
0
12 May 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and
  Applications
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
286
136
0
25 Apr 2022
Boosting Pruned Networks with Linear Over-parameterization
Boosting Pruned Networks with Linear Over-parameterizationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022
Yundi Qian
Siyuan Pan
Xiaoshuang Li
Jie Zhang
Liang Hou
Xiaobing Tu
256
4
0
25 Apr 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for
  Memory-Efficient Graph Convolutional Neural Networks
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural NetworksInternational Symposium on High-Performance Computer Architecture (HPCA), 2022
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
628
38
0
01 Mar 2022
QOC: Quantum On-Chip Training with Parameter Shift and Gradient Pruning
QOC: Quantum On-Chip Training with Parameter Shift and Gradient PruningDesign Automation Conference (DAC), 2022
Hanrui Wang
Zi-Chen Li
Jiaqi Gu
Yongshan Ding
David Z. Pan
Song Han
585
61
0
26 Feb 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian
  Approximation
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian ApproximationInternational Conference on Learning Representations (ICLR), 2022
Cong Guo
Yuxian Qiu
Jingwen Leng
Xiaotian Gao
Chen Zhang
Yunxin Liu
Fan Yang
Yuhao Zhu
Minyi Guo
MQ
289
89
0
14 Feb 2022
Blocking Techniques for Sparse Matrix Multiplication on Tensor
  Accelerators
Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators
P. S. Labini
M. Bernaschi
Francesco Silvestri
Flavio Vella
147
3
0
11 Feb 2022
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real
  Processing-In-Memory Systems
SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Systems
Christina Giannoula
Ivan Fernandez
Juan Gómez Luna
N. Koziris
G. Goumas
O. Mutlu
MoE
437
27
0
13 Jan 2022
Phantom: A High-Performance Computational Core for Sparse Convolutional
  Neural Networks
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
269
0
0
09 Nov 2021
Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix
  Dense-Matrix Multiplication
Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix Dense-Matrix MultiplicationSymposium on Field Programmable Gate Arrays (FPGA), 2021
Linghao Song
Yuze Chi
Atefeh Sohrabizadeh
Young-kyu Choi
Jason Lau
Jason Cong
GNN
361
86
0
22 Sep 2021
Towards Memory-Efficient Neural Networks via Multi-Level in situ
  Generation
Towards Memory-Efficient Neural Networks via Multi-Level in situ GenerationIEEE International Conference on Computer Vision (ICCV), 2021
Jiaqi Gu
Hanqing Zhu
Chenghao Feng
Mingjie Liu
Zixuan Jiang
Ray T. Chen
David Z. Pan
242
4
0
25 Aug 2021
QuantumNAS: Noise-Adaptive Search for Robust Quantum Circuits
QuantumNAS: Noise-Adaptive Search for Robust Quantum CircuitsInternational Symposium on High-Performance Computer Architecture (HPCA), 2021
Hanrui Wang
Yongshan Ding
Jiaqi Gu
Zirui Li
Chengyue Wu
David Z. Pan
Frederic T. Chong
Song Han
683
270
0
22 Jul 2021
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN
  Acceleration
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN AccelerationInternational Symposium on High-Performance Computer Architecture (HPCA), 2021
Zhi-Gang Liu
P. Whatmough
Yuhao Zhu
Matthew Mattina
MQ
284
110
0
16 Jul 2021
GPTPU: Accelerating Applications using Edge Tensor Processing Units
GPTPU: Accelerating Applications using Edge Tensor Processing UnitsInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021
Kuan-Chieh Hsu
Hung-Wei Tseng
260
34
0
22 Jun 2021
SMASH: Sparse Matrix Atomic Scratchpad Hashing
SMASH: Sparse Matrix Atomic Scratchpad Hashing
Kaustubh Shivdikar
272
7
0
29 May 2021
Dual-side Sparse Tensor Core
Dual-side Sparse Tensor CoreInternational Symposium on Computer Architecture (ISCA), 2021
Yang-Feng Wang
Chen Zhang
Zhiqiang Xie
Cong Guo
Yunxin Liu
Jingwen Leng
279
94
0
20 May 2021
GPU Semiring Primitives for Sparse Neighborhood Methods
GPU Semiring Primitives for Sparse Neighborhood MethodsConference on Machine Learning and Systems (MLSys), 2021
Corey J. Nolet
Divye Gala
Edward Raff
Joe Eaton
Brad Rees
John Zedlewski
Tim Oates
241
7
0
13 Apr 2021
12
Next
Page 1 of 2