ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,628 papers shown
Title
MST-compression: Compressing and Accelerating Binary Neural Networks
  with Minimum Spanning Tree
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning TreeIEEE International Conference on Computer Vision (ICCV), 2023
Quang Hieu Vo
Linh-Tam Tran
Sung-Ho Bae
Lokwon Kim
Choong Seon Hong
MQ
183
1
0
26 Aug 2023
REFT: Resource-Efficient Federated Training Framework for Heterogeneous
  and Resource-Constrained Environments
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments
Humaid Ahmed Desai
Amr B. Hilal
Hoda Eldardiry
138
2
0
25 Aug 2023
Federated Learning in IoT: a Survey from a Resource-Constrained
  Perspective
Federated Learning in IoT: a Survey from a Resource-Constrained Perspective
Ishmeet Kaur
168
7
0
25 Aug 2023
Data-Side Efficiencies for Lightweight Convolutional Neural Networks
Data-Side Efficiencies for Lightweight Convolutional Neural Networks
Bryan Bo Cao
Lawrence O'Gorman
Michael J. Coss
Shubham Jain
140
2
0
24 Aug 2023
Multi-stage feature decorrelation constraints for improving CNN
  classification performance
Multi-stage feature decorrelation constraints for improving CNN classification performanceACM Cloud and Autonomic Computing Conference (CAC), 2023
Qiuyu Zhu
Hao Wang
Xuewen Zu
Chengfei Liu
189
1
0
24 Aug 2023
Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy
  Measurement
Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy MeasurementACM Transactions on Software Engineering and Methodology (TOSEM), 2023
S. Rajput
Tim Widmayer
Ziyuan Shang
M. Kechagia
Federica Sarro
Tushar Sharma
277
8
0
23 Aug 2023
Sampling From Autoencoders' Latent Space via Quantization And
  Probability Mass Function Concepts
Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts
Aymene Mohammed Bouayed
Adrian Iaccovelli
D. Naccache
144
0
0
21 Aug 2023
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep
  Neural Networks
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural NetworksIEEE International Conference on Computer Vision (ICCV), 2023
Kaixin Xu
Zhe Wang
Xue Geng
Jie Lin
Ruibing Jin
Xiaoli Li
Weisi Lin
113
20
0
21 Aug 2023
Benchmarking Adversarial Robustness of Compressed Deep Learning Models
Benchmarking Adversarial Robustness of Compressed Deep Learning Models
Brijesh Vora
Kartik Patwari
Syed Mahbub Hafiz
Zubair Shafiq
Chen-Nee Chuah
AAML
186
3
0
16 Aug 2023
A Survey on Model Compression for Large Language Models
A Survey on Model Compression for Large Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
306
345
0
15 Aug 2023
Ada-QPacknet -- adaptive pruning with bit width reduction as an
  efficient continual learning method without forgetting
Ada-QPacknet -- adaptive pruning with bit width reduction as an efficient continual learning method without forgetting
Marcin Pietroñ
Dominik Zurek
Kamil Faber
Roberto Corizzo
CLL
179
2
0
14 Aug 2023
Estimator Meets Equilibrium Perspective: A Rectified Straight Through
  Estimator for Binary Neural Networks Training
Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks TrainingIEEE International Conference on Computer Vision (ICCV), 2023
Xiao-Ming Wu
Dian Zheng
Zuhao Liu
Weishi Zheng
MQ
279
25
0
13 Aug 2023
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of
  Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Seyedarmin Azizi
M. Nazemi
A. Fayyazi
Massoud Pedram
MQ
91
5
0
12 Aug 2023
SSL-Auth: An Authentication Framework by Fragile Watermarking for
  Pre-trained Encoders in Self-supervised Learning
SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning
Xiaobei Li
Changchun Yin
Liyue Zhu
Xiaogang Xu
Liming Fang
Run Wang
Chenhao Lin
AAML
294
1
0
09 Aug 2023
Resource Constrained Model Compression via Minimax Optimization for
  Spiking Neural Networks
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural NetworksACM Multimedia (ACM MM), 2023
Jue Chen
Huan Yuan
Jianchao Tan
Bin Chen
Chengru Song
Chen Zhang
165
7
0
09 Aug 2023
Lossy and Lossless (L$^2$) Post-training Model Size Compression
Lossy and Lossless (L2^22) Post-training Model Size CompressionIEEE International Conference on Computer Vision (ICCV), 2023
Yumeng Shi
Shihao Bai
Xiuying Wei
Yazhe Niu
Jianlei Yang
173
5
0
08 Aug 2023
D-Score: A Synapse-Inspired Approach for Filter Pruning
D-Score: A Synapse-Inspired Approach for Filter Pruning
Doyoung Park
Jinsoo Kim
Ji-Min Nam
Jooyoung Chang
S. Park
98
0
0
08 Aug 2023
Pruning a neural network using Bayesian inference
Pruning a neural network using Bayesian inference
Sunil Mathew
D. Rowe
129
0
0
04 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
121
1
0
02 Aug 2023
An Introduction to Bi-level Optimization: Foundations and Applications
  in Signal Processing and Machine Learning
An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine LearningIEEE Signal Processing Magazine (IEEE Signal Process. Mag.), 2023
Yihua Zhang
Prashant Khanduri
Ioannis C. Tsaknakis
Yuguang Yao
Min-Fong Hong
Sijia Liu
AI4CE
329
46
0
01 Aug 2023
Evaluating Spiking Neural Network On Neuromorphic Platform For Human
  Activity Recognition
Evaluating Spiking Neural Network On Neuromorphic Platform For Human Activity RecognitionInternational Workshop on the Semantic Web (SW), 2023
Sizhen Bian
Michele Magno
118
11
0
01 Aug 2023
Improving Generalization of Adversarial Training via Robust Critical
  Fine-Tuning
Improving Generalization of Adversarial Training via Robust Critical Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Lingyao Li
Yongfeng Zhang
Xixu Hu
Xingxu Xie
G. Yang
AAML
140
35
0
01 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of
  Precision Redundancy
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision RedundancyIEEE International Conference on Computer Vision (ICCV), 2023
Shibo Jie
Haoqing Wang
Zhiwei Deng
170
41
0
31 Jul 2023
Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
Saizhuo Wang
Hang Yuan
Leon Zhou
L. Ni
H. Shum
Jian Guo
153
40
0
31 Jul 2023
Stable Adam Optimization for 16-bit Neural Networks Training
Juyoung Yun
85
1
0
30 Jul 2023
Incrementally-Computable Neural Networks: Efficient Inference for
  Dynamic Inputs
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs
Or Sharir
Anima Anandkumar
119
0
0
27 Jul 2023
Object-based Probabilistic Similarity Evidence of Sparse Latent Features
  from Fully Convolutional Networks
Object-based Probabilistic Similarity Evidence of Sparse Latent Features from Fully Convolutional Networks
Cyril Juliani
103
0
0
25 Jul 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights
  Generation
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation
Stylianos I. Venieris
Javier Fernandez-Marques
Nicholas D. Lane
MQ
163
4
0
25 Jul 2023
An Estimator for the Sensitivity to Perturbations of Deep Neural
  Networks
An Estimator for the Sensitivity to Perturbations of Deep Neural Networks
Naman Maheshwari
Nicholas Malaya
Scott A. Moe
J. Kulkarni
S. Gurumurthi
AAML
129
0
0
24 Jul 2023
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against
  Model Inversion Attacks
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion AttacksIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Shiwei Ding
Lan Zhang
Miao Pan
Xiaoyong Yuan
AAML
193
10
0
20 Jul 2023
Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
Communication-Efficient Split Learning via Adaptive Feature-Wise CompressionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Yong-Nam Oh
Jaeho Lee
Christopher G. Brinton
Yo-Seb Jeon
MQ
317
15
0
20 Jul 2023
EMQ: Evolving Training-free Proxies for Automated Mixed Precision
  Quantization
EMQ: Evolving Training-free Proxies for Automated Mixed Precision QuantizationIEEE International Conference on Computer Vision (ICCV), 2023
Peijie Dong
Lujun Li
Zimian Wei
Xin-Yi Niu
Zhiliang Tian
H. Pan
MQ
215
47
0
20 Jul 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and ApplicationsACM Computing Surveys (ACM Comput. Surv.), 2023
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Mohamed Bennai
K. Pekmestzi
Dimitrios Soudris
233
16
0
20 Jul 2023
TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the
  Data-Scarce Edge
TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce EdgeInternational Conference on Machine Learning (ICML), 2023
Young D. Kwon
Rui Li
Stylianos I. Venieris
Jagmohan Chauhan
Nicholas D. Lane
Cecilia Mascolo
229
20
0
19 Jul 2023
Light-Weight Vision Transformer with Parallel Local and Global
  Self-Attention
Light-Weight Vision Transformer with Parallel Local and Global Self-Attention
Nikolas Ebert
Laurenz Reichardt
D. Stricker
Oliver Wasenmüller
ViT
224
3
0
18 Jul 2023
Neural Network Pruning as Spectrum Preserving Process
Neural Network Pruning as Spectrum Preserving Process
S. Yao
Dantong Yu
I. Koutis
CVBM
111
1
0
18 Jul 2023
UPSCALE: Unconstrained Channel Pruning
UPSCALE: Unconstrained Channel PruningInternational Conference on Machine Learning (ICML), 2023
Alvin Wan
Hanxiang Hao
K. Patnaik
Yueyang Xu
Omer Hadad
David Guera
Zhile Ren
Qi Shan
158
4
0
17 Jul 2023
Revisiting Implicit Models: Sparsity Trade-offs Capability in
  Weight-tied Model for Vision Tasks
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks
Haobo Song
Soumajit Majumder
Tao Lin
VLM
256
0
0
16 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
A Survey of Techniques for Optimizing Transformer InferenceJournal of systems architecture (JSA), 2023
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
235
116
0
16 Jul 2023
TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for
  Gaze Estimation
TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for Gaze EstimationItalian National Conference on Sensors (INS), 2023
Pietro Bonazzi
Thomas Rüegg
Sizhen Bian
Yawei Li
Michele Magno
231
19
0
15 Jul 2023
Learning Sparse Neural Networks with Identity Layers
Learning Sparse Neural Networks with Identity LayersInternational Conference on Image and Graphics (ICIG), 2023
Mingjian Ni
Guangyao Chen
Xiawu Zheng
Peixi Peng
Liuliang Yuan
Yonghong Tian
152
0
0
14 Jul 2023
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for
  Ultra-Low-Power Edge Systems
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge SystemsIEEE Access (IEEE Access), 2023
Julian Moosmann
H. Mueller
Nicky Zimmerman
Georg Rutishauser
Luca Benini
Michele Magno
257
14
0
12 Jul 2023
Search-time Efficient Device Constraints-Aware Neural Architecture
  Search
Search-time Efficient Device Constraints-Aware Neural Architecture SearchPattern Recognition and Machine Intelligence (PRMI), 2023
Oshin Dutta
Tanu Kanvar
Sumeet Agarwal
157
4
0
10 Jul 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication
  Kernels
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
150
2
0
08 Jul 2023
Transfer Learning for the Efficient Detection of COVID-19 from
  Smartphone Audio Data
Transfer Learning for the Efficient Detection of COVID-19 from Smartphone Audio Data
M. Campana
Franca Delmastro
Elena Pagani
141
20
0
06 Jul 2023
Pruning vs Quantization: Which is Better?
Pruning vs Quantization: Which is Better?Neural Information Processing Systems (NeurIPS), 2023
Andrey Kuzmin
Markus Nagel
M. V. Baalen
Arash Behboodi
Tijmen Blankevoort
MQ
288
99
0
06 Jul 2023
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for
  Efficient LLM Inference
SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference
Luciano Del Corro
Allison Del Giorno
Sahaj Agarwal
Ting Yu
Ahmed Hassan Awadallah
Subhabrata Mukherjee
264
77
0
05 Jul 2023
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Make A Long Image Short: Adaptive Token Length for Vision Transformers
Yuqin Zhu
Yichen Zhu
ViT
193
21
0
05 Jul 2023
Why do CNNs excel at feature extraction? A mathematical explanation
Why do CNNs excel at feature extraction? A mathematical explanation
V. Nandakumar
Arush Tagade
Tongliang Liu
FAtt
91
1
0
03 Jul 2023
Structured Network Pruning by Measuring Filter-wise Interactions
Structured Network Pruning by Measuring Filter-wise Interactions
Wenting Tang
Xingxing Wei
Yue Liu
158
0
0
03 Jul 2023
Previous
123...151617...717273
Next