ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.07948
  4. Cited By
SparseDNN: Fast Sparse Deep Learning Inference on CPUs
v1v2v3v4 (latest)

SparseDNN: Fast Sparse Deep Learning Inference on CPUs

20 January 2021
Ziheng Wang
    MQ
ArXiv (abs)PDFHTMLGithub (53★)

Papers citing "SparseDNN: Fast Sparse Deep Learning Inference on CPUs"

12 / 12 papers shown
Title
Signal Collapse in One-Shot Pruning: When Sparse Models Fail to Distinguish Neural Representations
Signal Collapse in One-Shot Pruning: When Sparse Models Fail to Distinguish Neural Representations
Dhananjay Saikumar
Blesson Varghese
177
0
0
18 Feb 2025
Fast Tree-Field Integrators: From Low Displacement Rank to Topological
  Transformers
Fast Tree-Field Integrators: From Low Displacement Rank to Topological Transformers
Krzysztof Choromanski
Arijit Sehanobish
Somnath Basu Roy Chowdhury
Han Lin
Avinava Dubey
Tamás Sarlós
Snigdha Chaturvedi
AI4CE
149
2
0
22 Jun 2024
SNP: Structured Neuron-level Pruning to Preserve Attention Scores
SNP: Structured Neuron-level Pruning to Preserve Attention Scores
Kyunghwan Shim
Jaewoong Yun
Shinkook Choi
138
2
0
18 Apr 2024
DRIVE: Dual Gradient-Based Rapid Iterative Pruning
DRIVE: Dual Gradient-Based Rapid Iterative Pruning
Dhananjay Saikumar
Blesson Varghese
144
3
0
01 Apr 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Vithursan Thangarasa
Mahmoud Salem
Shreyas Saxena
Kevin Leong
Joel Hestness
Sean Lie
MedIm
199
2
0
01 Mar 2024
Sculpting Efficiency: Pruning Medical Imaging Models for On-Device
  Inference
Sculpting Efficiency: Pruning Medical Imaging Models for On-Device Inference
Sudarshan Sreeram
Bernhard Kainz
176
0
0
10 Sep 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication
  Kernels
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
150
2
0
08 Jul 2023
An Efficient Sparse Inference Software Accelerator for Transformer-based
  Language Models on CPUs
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Haihao Shen
Hengyu Meng
Bo Dong
Zhe Wang
Ofir Zafrir
...
Hanwen Chang
Qun Gao
Zi. Wang
Guy Boudoukh
Moshe Wasserblat
MoE
144
4
0
28 Jun 2023
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language
  Models
SPDF: Sparse Pre-training and Dense Fine-tuning for Large Language ModelsConference on Uncertainty in Artificial Intelligence (UAI), 2023
Vithursan Thangarasa
Abhay Gupta
William Marshall
Tianda Li
Kevin Leong
D. DeCoste
Sean Lie
Shreyas Saxena
MoEAI4CE
269
27
0
18 Mar 2023
ZeroFL: Efficient On-Device Training for Federated Learning with Local
  Sparsity
ZeroFL: Efficient On-Device Training for Federated Learning with Local SparsityInternational Conference on Learning Representations (ICLR), 2022
Xinchi Qiu
Javier Fernandez-Marques
Pedro Gusmão
Yan Gao
Titouan Parcollet
Nicholas D. Lane
FedML
182
85
0
04 Aug 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and
  Applications
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
226
132
0
25 Apr 2022
Two Sparsities Are Better Than One: Unlocking the Performance Benefits
  of Sparse-Sparse Networks
Two Sparsities Are Better Than One: Unlocking the Performance Benefits of Sparse-Sparse Networks
Kevin Lee Hunter
Lawrence Spracklen
Subutai Ahmad
179
22
0
27 Dec 2021
1