SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks

23 May 2017

Rangharajan Venkatesan

Papers citing "SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks"

50 / 296 papers shown

KAN-SAs: Efficient Acceleration of Kolmogorov-Arnold Networks on Systolic Arrays

Sohaib Errabii

Olivier Sentieys

Marcello Traiola

103

20 Nov 2025

NeuroFlex: Column-Exact ANN-SNN Co-Execution Accelerator with Cost-Guided Scheduling

Varun Manjunath

Pranav Ramesh

Gopalakrishnan Srinivasan

151

07 Nov 2025

TsetlinKWS: A 65nm 16.58uW, 0.63mm2 State-Driven Convolutional Tsetlin Machine-Based Accelerator For Keyword Spotting

138

28 Oct 2025

From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs

174

07 Oct 2025

SparseMap: A Sparse Tensor Accelerator Framework Based on Evolution StrategyIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025

18 Aug 2025

FlexiSAGA: A Flexible Systolic Array GEMM Accelerator for Sparse and Dense Processing

Mika Markus Müller

Konstantin Lubeck

Alexander Louis-Ferdinand Jung

Jannik Steinmetz

Oliver Bringmann

GNN

256

02 Jun 2025

VUSA: Virtually Upscaled Systolic Array Architecture to Exploit Unstructured Sparsity in AI AccelerationInternational Conference on Modern Circuits and Systems Technologies (ICMCST), 2025

Shereef Helal

Alberto García-Ortiz

Lennart Bamberg

205

01 Jun 2025

Accelerating LLM Inference with Flexible N:M Sparsity via A Fully Digital Compute-in-Memory Accelerator

Deepak K. Mathaikutty

Tushar Krishna

418

19 Apr 2025

An Efficient Training Algorithm for Models with Block-wise Sparsity

Ding Zhu

Zhiqun Zuo

Mohammad Mahdi Khalili

308

27 Mar 2025

Pruning-Based TinyML Optimization of Machine Learning Models for Anomaly Detection in Electric Vehicle Charging Infrastructure

365

19 Mar 2025

REDACTOR: eFPGA Redaction for DNN Accelerator SecurityIEEE International Symposium on Quality Electronic Design (ISQED), 2025

Yazan Baddour

A. Hedayatipour

Amin Rezaei

220

30 Jan 2025

Ditto: Accelerating Diffusion Model via Temporal Value SimilarityInternational Symposium on High-Performance Computer Architecture (HPCA), 2025

538

20 Jan 2025

LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning AcceleratorInternational Symposium on High-Performance Computer Architecture (HPCA), 2025

807

18 Jan 2025

Energy Backdoor Attack to Deep Neural NetworksIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

211

14 Jan 2025

SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity

228

28 Oct 2024

PENDRAM: Enabling High-Performance and Energy-Efficient Processing of Deep Neural Networks through a Generalized DRAM Data Mapping Policy

Rachmad Vidya Wicaksana Putra

Muhammad Abdullah Hanif

Mohamed Bennai

223

05 Aug 2024

Vision-based Wearable Steering Assistance for People with Impaired Vision in JoggingIEEE International Conference on Robotics and Automation (ICRA), 2024

Xiaotong Liu

Sunandan Adhikary

Zhijun Li

308

01 Aug 2024

Event-based Optical Flow on Neuromorphic Processor: ANN vs. SNN Comparison based on Activation SparsificationNeural Networks (NN), 2024

295

29 Jul 2024

The Magnificent Seven Challenges and Opportunities in Domain-Specific Accelerator Design for Autonomous Systems

Sabrina M. Neuman

Brian Plancher

Vijay Janapa Reddi

179

24 Jul 2024

SCOPE: Stochastic Cartographic Occupancy Prediction Engine for Uncertainty-Aware Dynamic Navigation

Zhanteng Xie

P. Dames

697

28 Jun 2024

Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization

Jungi Lee

Wonbeom Lee

Jaewoong Sim

342

16 Jun 2024

HASS: Hardware-Aware Sparsity Search for Dataflow DNN Accelerator

Alexander Montgomerie-Corcoran

336

05 Jun 2024

$Dual sparse training framework: inducing activation map sparsity via Transformed $\ell1$ regularization$

Dual sparse training framework: inducing activation map sparsity via Transformed

\ell1

regularization

Xiaolong Yu

Cong Tian

259

30 May 2024

Neural Network Compression for Reinforcement Learning TasksScientific Reports (Sci Rep), 2024

320

13 May 2024

From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural NetworksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

Zhe Wang

...

348

09 May 2024

265

11 Apr 2024

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

429

206

08 Apr 2024

The Impact of Uniform Inputs on Activation Sparsity and Energy-Latency Attacks in Computer Vision

Andreas Müller

Erwin Quiring

AAML

311

27 Mar 2024

FlexNN: A Dataflow-aware Flexible Deep Learning Accelerator for Energy-Efficient Edge Devices

Arnab Raha

Deepak A. Mathaikutty

Soumendu Kumar Ghosh

Shamik Kundu

226

14 Mar 2024

The SkipSponge Attack: Sponge Weight Poisoning of Deep Neural Networks

428

09 Feb 2024

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

369

07 Feb 2024

Activity Sparsity Complements Weight Sparsity for Efficient RNN Inference

Rishav Mukherji

Mark Schöne

Khaleelulla Khan Nazeer

Christian Mayr

Anand Subramoney

345

13 Nov 2023

SparseLock: Securing Neural Network Models in Deep Learning Accelerators

Nivedita Shrivastava

S. Sarangi

AAML

290

05 Nov 2023

Efficient Model-Based Deep Learning via Network Pruning and Fine-TuningJournal of Mathematical Imaging and Vision (JMIV), 2023

Ulugbek S. Kamilov

355

03 Nov 2023

YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUsInternational Conference on Compiler Construction (CC), 2023

606

01 Oct 2023

Computation-efficient Deep Learning for Computer Vision: A Survey

Yulin Wang

Gao Huang

359

27 Aug 2023

Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation

Stylianos I. Venieris

Javier Fernandez-Marques

Nicholas D. Lane

194

25 Jul 2023

TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving CarsInternational Conference on Multimedia Analysis and Pattern Recognition (ICMAPR), 2023

541

20 Jul 2023

Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and ApplicationsACM Computing Surveys (ACM Comput. Surv.), 2023

Vasileios Leon

Muhammad Abdullah Hanif

353

20 Jul 2023

Minimizing Energy Consumption of Deep Learning Models by Energy-Aware TrainingInternational Conference on Image Analysis and Processing (ICIAP), 2023

Dario Lazzaro

Antonio Emanuele Cinà

Maura Pintor

Ambra Demontis

Battista Biggio

Fabio Roli

Marcello Pelillo

314

01 Jul 2023

RAMAN: A Re-configurable and Sparse tinyML Accelerator for Inference on EdgeIEEE Internet of Things Journal (IEEE IoT J.), 2023

Adithya Krishna

Srikanth Rohit Nudurupati

Chandana D G

Pritesh Dwivedi

André van Schaik

M. Mehendale

Chetan Singh Thakur

164

10 Jun 2023

KAPLA: Pragmatic Representation and Fast Solving of Scalable NN Accelerator Dataflow

Zhiyao Li

Mingyu Gao

164

09 Jun 2023

HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity

312

22 May 2023

SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous DrivingInternational Symposium on High-Performance Computer Architecture (HPCA), 2023

Mingu Kang

318

12 May 2023

Energy-Latency Attacks to On-Device Neural Networks via Sponge Poisoning

247

06 May 2023

Full Stack Optimization of Transformer Inference: a Survey

Sehoon Kim

Coleman Hooper

...

338

166

27 Feb 2023

Fixflow: A Framework to Evaluate Fixed-point Arithmetic in Light-Weight CNN Inference

Farhad Taheri

Siavash Bayat Sarmadi

H. Mosanaei-Boorani

Reza Taheri

138

19 Feb 2023

VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUsInternational Symposium on High-Performance Computer Architecture (HPCA), 2023

424

17 Feb 2023

Workload-Balanced Pruning for Sparse Spiking Neural NetworksIEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), 2023

Ruokai Yin

Youngeun Kim

Yuhang Li

279

13 Feb 2023

Bit-balance: Model-Hardware Co-design for Accelerating NNs by Exploiting Bit-level SparsityIEEE transactions on computers (IEEE Trans. Comput.), 2023

01 Feb 2023