Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01528
Cited By
EIE: Efficient Inference Engine on Compressed Deep Neural Network
4 February 2016
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EIE: Efficient Inference Engine on Compressed Deep Neural Network"
50 / 202 papers shown
Title
Event-Based Eye Tracking. 2025 Event-based Vision Workshop
Qinyu Chen
Chang Gao
Min Liu
Daniele Perrone
Yan Ru Pei
...
Hoang M. Truong
Vinh-Thuan Ly
Huy G. Tran
Thuan-Phat Nguyen
Tram T. Doan
43
0
0
25 Apr 2025
A 71.2-
μ
μ
μ
W Speech Recognition Accelerator with Recurrent Spiking Neural Network
Chih-Chyau Yang
Tian-Sheuan Chang
60
1
0
27 Mar 2025
Reservoir Network with Structural Plasticity for Human Activity Recognition
Abdullah M. Zyarah
Alaa M. Abdul-Hadi
Dhireesha Kudithipudi
29
2
0
01 Mar 2025
Advancing Weight and Channel Sparsification with Enhanced Saliency
Xinglong Sun
Maying Shen
Hongxu Yin
Lei Mao
Pavlo Molchanov
Jose M. Alvarez
46
1
0
05 Feb 2025
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
Guoyu Li
Shengyu Ye
C. L. P. Chen
Yang Wang
Fan Yang
Ting Cao
Cheng Liu
Mohamed M. Sabry
Mao Yang
MQ
125
0
0
18 Jan 2025
DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm
2
^2
2
Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion
Ang Li
Haolin Wu
Yizhuo Wu
Qinyu Chen
Leo C. N. de Vreede
Chang Gao
19
0
0
15 Oct 2024
Structured Pruning for Efficient Visual Place Recognition
Oliver Grainge
Michael Milford
I. Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
41
1
0
12 Sep 2024
Quality Scalable Quantization Methodology for Deep Learning on Edge
S. Khaliq
Rehan Hafiz
MQ
35
1
0
15 Jul 2024
Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams
Rana Shahout
Michael Mitzenmacher
20
2
0
24 Jun 2024
A Generic Layer Pruning Method for Signal Modulation Recognition Deep Learning Models
Yao Lu
Yutao Zhu
Yuqi Li
Dongwei Xu
Yun Lin
Qi Xuan
Xiaoniu Yang
28
5
0
12 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
91
1
0
06 Jun 2024
Dual sparse training framework: inducing activation map sparsity via Transformed
ℓ
1
\ell1
ℓ
1
regularization
Xiaolong Yu
Cong Tian
44
0
0
30 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
43
0
0
13 May 2024
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization
Bailey J. Eccles
Leon Wong
Blesson Varghese
33
2
0
22 Apr 2024
A 1.6-mW Sparse Deep Learning Accelerator for Speech Separation
Chih-Chyau Yang
Tian-Sheuan Chang
26
0
0
15 Dec 2023
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy
Bogdan Musat
Razvan Andonie
13
0
0
08 Dec 2023
The Road to On-board Change Detection: A Lightweight Patch-Level Change Detection Network via Exploring the Potential of Pruning and Pooling
Lihui Xue
Zhihao Wang
Xueqian Wang
Gang Li
33
1
0
16 Oct 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Muhammad Shafique
K. Pekmestzi
Dimitrios Soudris
29
3
0
20 Jul 2023
Minimizing Energy Consumption of Deep Learning Models by Energy-Aware Training
Dario Lazzaro
Antonio Emanuele Cinà
Maura Pintor
Ambra Demontis
Battista Biggio
Fabio Roli
Marcello Pelillo
24
6
0
01 Jul 2023
Group channel pruning and spatial attention distilling for object detection
Yun Chu
Pu Li
Yong Bai
Zhuhua Hu
Yongqing Chen
Jiafeng Lu
VLM
24
13
0
02 Jun 2023
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous Driving
Minjae Lee
Seongmin Park
Hyung-Se Kim
Minyong Yoon
Jangwhan Lee
Junwon Choi
Nam Sung Kim
Mingu Kang
Jungwook Choi
3DPC
26
4
0
12 May 2023
Towards Carbon-Neutral Edge Computing: Greening Edge AI by Harnessing Spot and Future Carbon Markets
Huirong Ma
Zhi Zhou
Xiaoxi Zhang
Xu Chen
13
11
0
22 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
10
5
0
06 Apr 2023
Competitive plasticity to reduce the energetic costs of learning
Mark C. W. van Rossum
13
2
0
04 Apr 2023
Physics-aware Roughness Optimization for Diffractive Optical Neural Networks
Shangli Zhou
Yingjie Li
Minhan Lou
Weilu Gao
Zhijie Shi
Cunxi Yu
Caiwen Ding
25
2
0
04 Apr 2023
SR-init: An interpretable layer pruning method
Hui Tang
Yao Lu
Qi Xuan
15
8
0
14 Mar 2023
Workload-Balanced Pruning for Sparse Spiking Neural Networks
Ruokai Yin
Youngeun Kim
Yuhang Li
Abhishek Moitra
Nitin Satpute
Anna Hambitzer
Priyadarshini Panda
23
18
0
13 Feb 2023
A
2
Q
\rm A^2Q
A
2
Q
: Aggregation-Aware Quantization for Graph Neural Networks
Zeyu Zhu
Fanrong Li
Zitao Mo
Qinghao Hu
Gang Li
Zejian Liu
Xiaoyao Liang
Jian Cheng
GNN
MQ
18
4
0
01 Feb 2023
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Athul Shibu
Abhishek Kumar
Heechul Jung
Dong-Gyu Lee
9
1
0
26 Jan 2023
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network Accelerators
Min-hee Yoo
Jaeyong Song
Hyeyoon Lee
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
28
5
0
24 Jan 2023
A Theory of I/O-Efficient Sparse Neural Network Inference
Niels Gleinig
Tal Ben-Nun
Torsten Hoefler
19
0
0
03 Jan 2023
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
18
3
0
17 Dec 2022
Algorithm and Hardware Co-Design of Energy-Efficient LSTM Networks for Video Recognition with Hierarchical Tucker Tensor Decomposition
Yu Gong
Miao Yin
Lingyi Huang
Chunhua Deng
Yang Sui
Bo Yuan
19
6
0
05 Dec 2022
Signed Binary Weight Networks
Sachit Kuhar
Alexey Tumanov
Judy Hoffman
MQ
13
1
0
25 Nov 2022
Improved Projection Learning for Lower Dimensional Feature Maps
Ilan Price
Jared Tanner
16
2
0
27 Oct 2022
Gradient-based Weight Density Balancing for Robust Dynamic Sparse Training
Mathias Parger
Alexander Ertl
Paul Eibensteiner
J. H. Mueller
Martin Winter
M. Steinberger
29
0
0
25 Oct 2022
RSC: Accelerating Graph Neural Networks Training via Randomized Sparse Computations
Zirui Liu
Sheng-Wei Chen
Kaixiong Zhou
Daochen Zha
Xiao Huang
Xia Hu
29
14
0
19 Oct 2022
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos
Mathias Parger
Chengcheng Tang
Thomas Neff
Christopher D. Twigg
Cem Keskin
Robert Y. Wang
M. Steinberger
19
6
0
18 Oct 2022
Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
DD
25
40
0
29 Sep 2022
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Shigang Li
Kazuki Osawa
Torsten Hoefler
72
31
0
14 Sep 2022
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Genghan Zhang
Yuetong Zhao
Yanting Tao
Zhongming Yu
Guohao Dai
Sitao Huang
Yuanyuan Wen
Pavlos Petoumenos
Yu Wang
41
4
0
07 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU
Jian-He Liao
Mingzhen Li
Qingxiao Sun
Jiwei Hao
F. Yu
...
Ye Tao
Zicheng Zhang
Hailong Yang
Zhongzhi Luan
D. Qian
21
4
0
06 Sep 2022
Complexity-Driven CNN Compression for Resource-constrained Edge AI
Muhammad Zawish
Steven Davy
L. Abraham
28
16
0
26 Aug 2022
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural Network On Image Classification
Yihe Lu
Maoguo Gong
Wei Zhao
Kaiyuan Feng
Hao Li
VLM
29
0
0
09 Aug 2022
Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition
V. Viswanatha
Ramachandra A.C
R. Prasanna
Prem Chowdary Kakarla
PJ VivekaSimha
Nishanth Mohan
17
15
0
23 Jul 2022
Associative Memory Based Experience Replay for Deep Reinforcement Learning
Mengyuan Li
Arman Kazemi
Ann Franchesca Laguna
Sharon Hu
VLM
11
8
0
16 Jul 2022
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework
Jani Boutellier
Bo Tan
J. Nurmi
16
2
0
16 Jun 2022
QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
22
2
0
20 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards
Youngeun Kwon
Minsoo Rhu
16
27
0
10 May 2022
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator
Miao Yu
Tingting Xiang
Venkata Pavan Kumar Miriyala
Trevor E. Carlson
15
1
0
20 Apr 2022
1
2
3
4
5
Next