Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1708.04485
Cited By
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
23 May 2017
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks"
50 / 296 papers shown
KAN-SAs: Efficient Acceleration of Kolmogorov-Arnold Networks on Systolic Arrays
Sohaib Errabii
Olivier Sentieys
Marcello Traiola
103
2
0
20 Nov 2025
NeuroFlex: Column-Exact ANN-SNN Co-Execution Accelerator with Cost-Guided Scheduling
Varun Manjunath
Pranav Ramesh
Gopalakrishnan Srinivasan
148
0
0
07 Nov 2025
TsetlinKWS: A 65nm 16.58uW, 0.63mm2 State-Driven Convolutional Tsetlin Machine-Based Accelerator For Keyword Spotting
Baizhou Lin
Yuetong Fang
Renjing Xu
Rishad Shafik
Jagmohan Chauhan
135
0
0
28 Oct 2025
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
Tianhao Zhu
Dahu Feng
Erhu Feng
Yubin Xia
174
1
0
07 Oct 2025
SparseMap: A Sparse Tensor Accelerator Framework Based on Evolution Strategy
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025
Boran Zhao
Haiming Zhai
Zihang Yuan
Hetian Liu
Tian Xia
Wenzhe zhao
Pengju Ren
97
1
0
18 Aug 2025
FlexiSAGA: A Flexible Systolic Array GEMM Accelerator for Sparse and Dense Processing
Mika Markus Müller
Konstantin Lubeck
Alexander Louis-Ferdinand Jung
Jannik Steinmetz
Oliver Bringmann
GNN
256
0
0
02 Jun 2025
VUSA: Virtually Upscaled Systolic Array Architecture to Exploit Unstructured Sparsity in AI Acceleration
International Conference on Modern Circuits and Systems Technologies (ICMCST), 2025
Shereef Helal
Alberto García-Ortiz
Lennart Bamberg
205
0
0
01 Jun 2025
Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator
Akshat Ramachandran
Souvik Kundu
Arnab Raha
Shamik Kundu
Deepak K. Mathaikutty
Tushar Krishna
417
6
0
19 Apr 2025
An Efficient Training Algorithm for Models with Block-wise Sparsity
Ding Zhu
Zhiqun Zuo
Mohammad Mahdi Khalili
306
0
0
27 Mar 2025
Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure
Fatemeh Dehrouyeh
I. Shaer
Soodeh Nikan
F. Badrkhani Ajaei
Abdallah Shami
365
2
0
19 Mar 2025
REDACTOR: eFPGA Redaction for DNN Accelerator Security
IEEE International Symposium on Quality Electronic Design (ISQED), 2025
Yazan Baddour
A. Hedayatipour
Amin Rezaei
220
0
0
30 Jan 2025
Ditto: Accelerating Diffusion Model via Temporal Value Similarity
International Symposium on High-Performance Computer Architecture (HPCA), 2025
Sungbin Kim
Hyunwuk Lee
Wonho Cho
Mincheol Park
Won Woo Ro
536
16
0
20 Jan 2025
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
International Symposium on High-Performance Computer Architecture (HPCA), 2025
Guoyu Li
Shengyu Ye
Chong Chen
Yang Wang
Fan Yang
Ting Cao
Cheng Liu
Mohamed M. Sabry
Mao Yang
MQ
801
7
0
18 Jan 2025
Energy Backdoor Attack to Deep Neural Networks
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
H. B. Meftah
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
Kassem Kallas
AAML
SILM
207
2
0
14 Jan 2025
SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity
Kaidi Wang
Shuo Yang
Shuo Yang
Wenchao Ding
Quan Chen
Chen Chen
Minyi Guo
226
0
0
28 Oct 2024
PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy
Rachmad Vidya Wicaksana Putra
Muhammad Abdullah Hanif
Mohamed Bennai
223
0
0
05 Aug 2024
Vision-based Wearable Steering Assistance for People with Impaired Vision in Jogging
IEEE International Conference on Robotics and Automation (ICRA), 2024
Xiaotong Liu
Sunandan Adhikary
Zhijun Li
308
3
0
01 Aug 2024
Event-based Optical Flow on Neuromorphic Processor: ANN vs. SNN Comparison based on Activation Sparsification
Neural Networks (NN), 2024
Yingfu Xu
Guangzhi Tang
Amirreza Yousefzadeh
Guido de Croon
Manolis Sifalakis
295
11
0
29 Jul 2024
The Magnificent Seven Challenges and Opportunities in Domain-Specific Accelerator Design for Autonomous Systems
Sabrina M. Neuman
Brian Plancher
Vijay Janapa Reddi
179
1
0
24 Jul 2024
SCOPE: Stochastic Cartographic Occupancy Prediction Engine for Uncertainty-Aware Dynamic Navigation
Zhanteng Xie
P. Dames
687
5
0
28 Jun 2024
Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization
Jungi Lee
Wonbeom Lee
Jaewoong Sim
MQ
342
41
0
16 Jun 2024
HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator
Zhewen Yu
Sudarshan Sreeram
Krish Agrawal
Junyi Wu
Alexander Montgomerie-Corcoran
Cheng Zhang
Jianyi Cheng
C. Bouganis
Yiren Zhao
336
2
0
05 Jun 2024
Dual sparse training framework: inducing activation map sparsity via Transformed
ℓ
1
\ell1
ℓ
1
regularization
Xiaolong Yu
Cong Tian
251
3
0
30 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Scientific Reports (Sci Rep), 2024
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
320
10
0
13 May 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Ruibing Jin
Xiaoli Li
348
9
0
09 May 2024
Sparse Laneformer
Ji Liu
Zifeng Zhang
Mingjie Lu
Hongyang Wei
Dong Li
Yile Xie
Jinzhang Peng
Lu Tian
Ashish Sirasao
E. Barsoum
265
4
0
11 Apr 2024
Lightweight Deep Learning for Resource-Constrained Environments: A Survey
Hou-I Liu
Marco Galindo
Hongxia Xie
Lai-Kuan Wong
Hong-Han Shuai
Yung-Hui Li
Wen-Huang Cheng
428
201
0
08 Apr 2024
The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision
Andreas Müller
Erwin Quiring
AAML
308
7
0
27 Mar 2024
FlexNN: A Dataflow-aware Flexible Deep Learning Accelerator for Energy-Efficient Edge Devices
Arnab Raha
Deepak A. Mathaikutty
Soumendu Kumar Ghosh
Shamik Kundu
226
14
0
14 Mar 2024
The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks
Jona te Lintelo
Stefanos Koffas
S. Picek
AAML
423
3
0
09 Feb 2024
Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Abhimanyu Bambhaniya
Amir Yazdanbakhsh
Suvinay Subramanian
Sheng-Chun Kao
Shivani Agrawal
Utku Evci
Tushar Krishna
368
25
0
07 Feb 2024
Activity Sparsity Complements Weight Sparsity for Efficient RNN Inference
Rishav Mukherji
Mark Schöne
Khaleelulla Khan Nazeer
Christian Mayr
Anand Subramoney
337
2
0
13 Nov 2023
SparseLock: Securing Neural Network Models in Deep Learning Accelerators
Nivedita Shrivastava
S. Sarangi
AAML
289
3
0
05 Nov 2023
Efficient Model-Based Deep Learning via Network Pruning and Fine-Tuning
Journal of Mathematical Imaging and Vision (JMIV), 2023
Chicago Y. Park
Weijie Gan
Zihao Zou
Yuyang Hu
Zhixin Sun
Ulugbek S. Kamilov
354
0
0
03 Nov 2023
YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs
International Conference on Compiler Construction (CC), 2023
Cyrus Zhou
Zack Hassman
Ruize Xu
Dhirpal Shah
Vaughn Richard
Yanjing Li
605
5
0
01 Oct 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
359
38
0
27 Aug 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation
Stylianos I. Venieris
Javier Fernandez-Marques
Nicholas D. Lane
MQ
192
4
0
25 Jul 2023
TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving Cars
International Conference on Multimedia Analysis and Pattern Recognition (ICMAPR), 2023
Huy Che Quang
Dinh Phuc Nguyen
Minh Pham
D. Lam
SSeg
539
30
0
20 Jul 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
ACM Computing Surveys (ACM Comput. Surv.), 2023
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Mohamed Bennai
K. Pekmestzi
Dimitrios Soudris
353
20
0
20 Jul 2023
Minimizing Energy Consumption of Deep Learning Models by Energy-Aware Training
International Conference on Image Analysis and Processing (ICIAP), 2023
Dario Lazzaro
Antonio Emanuele Cinà
Maura Pintor
Ambra Demontis
Battista Biggio
Fabio Roli
Marcello Pelillo
314
12
0
01 Jul 2023
RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on Edge
IEEE Internet of Things Journal (IEEE IoT J.), 2023
Adithya Krishna
Srikanth Rohit Nudurupati
Chandana D G
Pritesh Dwivedi
André van Schaik
M. Mehendale
Chetan Singh Thakur
164
26
0
10 Jun 2023
KAPLA: Pragmatic Representation and Fast Solving of Scalable NN Accelerator Dataflow
Zhiyao Li
Mingyu Gao
164
1
0
09 Jun 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
312
46
0
22 May 2023
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous Driving
International Symposium on High-Performance Computer Architecture (HPCA), 2023
Minjae Lee
Seongmin Park
Hyung-Se Kim
Minyong Yoon
Jangwhan Lee
Junwon Choi
Nam Sung Kim
Mingu Kang
Jungwook Choi
3DPC
317
12
0
12 May 2023
Energy-Latency Attacks to On-Device Neural Networks via Sponge Poisoning
Zijian Wang
Shuo Huang
Yu-Jen Huang
Helei Cui
SILM
246
15
0
06 May 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
337
162
0
27 Feb 2023
Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight CNN Inference
Farhad Taheri
Siavash Bayat Sarmadi
H. Mosanaei-Boorani
Reza Taheri
MQ
138
1
0
19 Feb 2023
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
International Symposium on High-Performance Computer Architecture (HPCA), 2023
Geonhwa Jeong
S. Damani
Abhimanyu Bambhaniya
Eric Qin
C. Hughes
S. Subramoney
Hyesoon Kim
T. Krishna
MoE
421
37
0
17 Feb 2023
Workload-Balanced Pruning for Sparse Spiking Neural Networks
IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), 2023
Ruokai Yin
Youngeun Kim
Yuhang Li
Abhishek Moitra
Nitin Satpute
Anna Hambitzer
Priyadarshini Panda
279
32
0
13 Feb 2023
Bit-balance: Model-Hardware Co-design for Accelerating NNs by Exploiting Bit-level Sparsity
IEEE transactions on computers (IEEE Trans. Comput.), 2023
Wenhao Sun
Zhiwei Zou
Deng Liu
Wendi Sun
Song Chen
Yi Kang
MQ
88
15
0
01 Feb 2023
1
2
3
4
5
6
Next
Page 1 of 6