Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,628 papers shown
Title
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree
IEEE International Conference on Computer Vision (ICCV), 2023
Quang Hieu Vo
Linh-Tam Tran
Sung-Ho Bae
Lokwon Kim
Choong Seon Hong
MQ
183
1
0
26 Aug 2023
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments
Humaid Ahmed Desai
Amr B. Hilal
Hoda Eldardiry
138
2
0
25 Aug 2023
Federated Learning in IoT: a Survey from a Resource-Constrained Perspective
Ishmeet Kaur
168
7
0
25 Aug 2023
Data-Side Efficiencies for Lightweight Convolutional Neural Networks
Bryan Bo Cao
Lawrence O'Gorman
Michael J. Coss
Shubham Jain
140
2
0
24 Aug 2023
Multi-stage feature decorrelation constraints for improving CNN classification performance
ACM Cloud and Autonomic Computing Conference (CAC), 2023
Qiuyu Zhu
Hao Wang
Xuewen Zu
Chengfei Liu
189
1
0
24 Aug 2023
Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy Measurement
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
S. Rajput
Tim Widmayer
Ziyuan Shang
M. Kechagia
Federica Sarro
Tushar Sharma
277
8
0
23 Aug 2023
Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts
Aymene Mohammed Bouayed
Adrian Iaccovelli
D. Naccache
144
0
0
21 Aug 2023
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks
IEEE International Conference on Computer Vision (ICCV), 2023
Kaixin Xu
Zhe Wang
Xue Geng
Jie Lin
Ruibing Jin
Xiaoli Li
Weisi Lin
113
20
0
21 Aug 2023
Benchmarking Adversarial Robustness of Compressed Deep Learning Models
Brijesh Vora
Kartik Patwari
Syed Mahbub Hafiz
Zubair Shafiq
Chen-Nee Chuah
AAML
186
3
0
16 Aug 2023
A Survey on Model Compression for Large Language Models
Transactions of the Association for Computational Linguistics (TACL), 2023
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
306
345
0
15 Aug 2023
Ada-QPacknet -- adaptive pruning with bit width reduction as an efficient continual learning method without forgetting
Marcin Pietroñ
Dominik Zurek
Kamil Faber
Roberto Corizzo
CLL
179
2
0
14 Aug 2023
Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training
IEEE International Conference on Computer Vision (ICCV), 2023
Xiao-Ming Wu
Dian Zheng
Zuhao Liu
Weishi Zheng
MQ
279
25
0
13 Aug 2023
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Seyedarmin Azizi
M. Nazemi
A. Fayyazi
Massoud Pedram
MQ
91
5
0
12 Aug 2023
SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning
Xiaobei Li
Changchun Yin
Liyue Zhu
Xiaogang Xu
Liming Fang
Run Wang
Chenhao Lin
AAML
294
1
0
09 Aug 2023
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks
ACM Multimedia (ACM MM), 2023
Jue Chen
Huan Yuan
Jianchao Tan
Bin Chen
Chengru Song
Chen Zhang
165
7
0
09 Aug 2023
Lossy and Lossless (L
2
^2
2
) Post-training Model Size Compression
IEEE International Conference on Computer Vision (ICCV), 2023
Yumeng Shi
Shihao Bai
Xiuying Wei
Yazhe Niu
Jianlei Yang
173
5
0
08 Aug 2023
D-Score: A Synapse-Inspired Approach for Filter Pruning
Doyoung Park
Jinsoo Kim
Ji-Min Nam
Jooyoung Chang
S. Park
98
0
0
08 Aug 2023
Pruning a neural network using Bayesian inference
Sunil Mathew
D. Rowe
129
0
0
04 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
121
1
0
02 Aug 2023
An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine Learning
IEEE Signal Processing Magazine (IEEE Signal Process. Mag.), 2023
Yihua Zhang
Prashant Khanduri
Ioannis C. Tsaknakis
Yuguang Yao
Min-Fong Hong
Sijia Liu
AI4CE
329
46
0
01 Aug 2023
Evaluating Spiking Neural Network On Neuromorphic Platform For Human Activity Recognition
International Workshop on the Semantic Web (SW), 2023
Sizhen Bian
Michele Magno
118
11
0
01 Aug 2023
Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning
IEEE International Conference on Computer Vision (ICCV), 2023
Lingyao Li
Yongfeng Zhang
Xixu Hu
Xingxu Xie
G. Yang
AAML
140
35
0
01 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy
IEEE International Conference on Computer Vision (ICCV), 2023
Shibo Jie
Haoqing Wang
Zhiwei Deng
170
41
0
31 Jul 2023
Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
Saizhuo Wang
Hang Yuan
Leon Zhou
L. Ni
H. Shum
Jian Guo
153
40
0
31 Jul 2023
Stable Adam Optimization for 16-bit Neural Networks Training
Juyoung Yun
85
1
0
30 Jul 2023
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs
Or Sharir
Anima Anandkumar
119
0
0
27 Jul 2023
Object-based Probabilistic Similarity Evidence of Sparse Latent Features from Fully Convolutional Networks
Cyril Juliani
103
0
0
25 Jul 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation
Stylianos I. Venieris
Javier Fernandez-Marques
Nicholas D. Lane
MQ
163
4
0
25 Jul 2023
An Estimator for the Sensitivity to Perturbations of Deep Neural Networks
Naman Maheshwari
Nicholas Malaya
Scott A. Moe
J. Kulkarni
S. Gurumurthi
AAML
129
0
0
24 Jul 2023
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shiwei Ding
Lan Zhang
Miao Pan
Xiaoyong Yuan
AAML
193
10
0
20 Jul 2023
Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yong-Nam Oh
Jaeho Lee
Christopher G. Brinton
Yo-Seb Jeon
MQ
317
15
0
20 Jul 2023
EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
IEEE International Conference on Computer Vision (ICCV), 2023
Peijie Dong
Lujun Li
Zimian Wei
Xin-Yi Niu
Zhiliang Tian
H. Pan
MQ
215
47
0
20 Jul 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
ACM Computing Surveys (ACM Comput. Surv.), 2023
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Mohamed Bennai
K. Pekmestzi
Dimitrios Soudris
233
16
0
20 Jul 2023
TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge
International Conference on Machine Learning (ICML), 2023
Young D. Kwon
Rui Li
Stylianos I. Venieris
Jagmohan Chauhan
Nicholas D. Lane
Cecilia Mascolo
229
20
0
19 Jul 2023
Light-Weight Vision Transformer with Parallel Local and Global Self-Attention
Nikolas Ebert
Laurenz Reichardt
D. Stricker
Oliver Wasenmüller
ViT
224
3
0
18 Jul 2023
Neural Network Pruning as Spectrum Preserving Process
S. Yao
Dantong Yu
I. Koutis
CVBM
111
1
0
18 Jul 2023
UPSCALE: Unconstrained Channel Pruning
International Conference on Machine Learning (ICML), 2023
Alvin Wan
Hanxiang Hao
K. Patnaik
Yueyang Xu
Omer Hadad
David Guera
Zhile Ren
Qi Shan
158
4
0
17 Jul 2023
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks
Haobo Song
Soumajit Majumder
Tao Lin
VLM
256
0
0
16 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
Journal of systems architecture (JSA), 2023
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
235
116
0
16 Jul 2023
TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for Gaze Estimation
Italian National Conference on Sensors (INS), 2023
Pietro Bonazzi
Thomas Rüegg
Sizhen Bian
Yawei Li
Michele Magno
231
19
0
15 Jul 2023
Learning Sparse Neural Networks with Identity Layers
International Conference on Image and Graphics (ICIG), 2023
Mingjian Ni
Guangyao Chen
Xiawu Zheng
Peixi Peng
Liuliang Yuan
Yonghong Tian
152
0
0
14 Jul 2023
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems
IEEE Access (IEEE Access), 2023
Julian Moosmann
H. Mueller
Nicky Zimmerman
Georg Rutishauser
Luca Benini
Michele Magno
257
14
0
12 Jul 2023
Search-time Efficient Device Constraints-Aware Neural Architecture Search
Pattern Recognition and Machine Intelligence (PRMI), 2023
Oshin Dutta
Tanu Kanvar
Sumeet Agarwal
157
4
0
10 Jul 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
150
2
0
08 Jul 2023
Transfer Learning for the Efficient Detection of COVID-19 from Smartphone Audio Data
M. Campana
Franca Delmastro
Elena Pagani
141
20
0
06 Jul 2023
Pruning vs Quantization: Which is Better?
Neural Information Processing Systems (NeurIPS), 2023
Andrey Kuzmin
Markus Nagel
M. V. Baalen
Arash Behboodi
Tijmen Blankevoort
MQ
288
99
0
06 Jul 2023
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference
Luciano Del Corro
Allison Del Giorno
Sahaj Agarwal
Ting Yu
Ahmed Hassan Awadallah
Subhabrata Mukherjee
264
77
0
05 Jul 2023
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Yuqin Zhu
Yichen Zhu
ViT
193
21
0
05 Jul 2023
Why do CNNs excel at feature extraction? A mathematical explanation
V. Nandakumar
Arush Tagade
Tongliang Liu
FAtt
91
1
0
03 Jul 2023
Structured Network Pruning by Measuring Filter-wise Interactions
Wenting Tang
Xingxing Wei
Yue Liu
158
0
0
03 Jul 2023
Previous
1
2
3
...
15
16
17
...
71
72
73
Next