ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,629 papers shown
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and
  Semantics
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and SemanticsIEEE Internet of Things Journal (IEEE IoT J.), 2023
Yahao Ding
Zhaohui Yang
Quoc-Viet Pham
Zhaoyang Zhang
M. Shikh-Bahaei
195
65
0
03 Jan 2023
SAFEMYRIDES: Application of Decentralized Control Edge-Computing to
  Ridesharing Monitoring Services
SAFEMYRIDES: Application of Decentralized Control Edge-Computing to Ridesharing Monitoring Services
S. Elnagar
Manoj A. Thomas
Kweku-Muata A. Osei-Bryson
125
0
0
02 Jan 2023
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-ShotInternational Conference on Machine Learning (ICML), 2023
Elias Frantar
Dan Alistarh
VLM
560
1,037
0
02 Jan 2023
Holistic Network Virtualization and Pervasive Network Intelligence for
  6G
Holistic Network Virtualization and Pervasive Network Intelligence for 6GIEEE Communications Surveys and Tutorials (COMST), 2023
Xuemin Shen
Shen
Jie Gao
Wen Wu
Mushu Li
Conghao Zhou
W. Zhuang
296
291
0
02 Jan 2023
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via
  Deep Reinforcement Learning
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement LearningIEEE Transactions on Industrial Informatics (TII), 2021
Wen Wu
Peng Yang
Weiting Zhang
Conghao Zhou
Xuemin
X. Shen
236
128
0
31 Dec 2022
FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep
  Neural Networks
FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep Neural Networks
Akul Malhotra
S. Gupta
103
0
0
29 Dec 2022
QuickNets: Saving Training and Preventing Overconfidence in Early-Exit
  Neural Architectures
QuickNets: Saving Training and Preventing Overconfidence in Early-Exit Neural Architectures
Devdhar Patel
H. Siegelmann
OnRL
190
1
0
25 Dec 2022
Hyperspherical Quantization: Toward Smaller and More Accurate Models
Hyperspherical Quantization: Toward Smaller and More Accurate ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Dan Liu
X. Chen
Chen Ma
Xue Liu
MQ
179
4
0
24 Dec 2022
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Danyang Liu
Xue Liu
131
1
0
24 Dec 2022
Hyperspherical Loss-Aware Ternary Quantization
Hyperspherical Loss-Aware Ternary Quantization
Dan Liu
Xue Liu
MQ
164
0
0
24 Dec 2022
Exploring Content Relationships for Distilling Efficient GANs
Exploring Content Relationships for Distilling Efficient GANs
Lizhou You
Mingbao Lin
Tie Hu
Jiayi Ji
Rongrong Ji
160
4
0
21 Dec 2022
Redistribution of Weights and Activations for AdderNet Quantization
Redistribution of Weights and Activations for AdderNet QuantizationNeural Information Processing Systems (NeurIPS), 2022
Ying Nie
Kai Han
Haikang Diao
Chuanjian Liu
Enhua Wu
Yunhe Wang
MQ
252
11
0
20 Dec 2022
The case for 4-bit precision: k-bit Inference Scaling Laws
The case for 4-bit precision: k-bit Inference Scaling LawsInternational Conference on Machine Learning (ICML), 2022
Tim Dettmers
Luke Zettlemoyer
MQ
376
287
0
19 Dec 2022
Training Lightweight Graph Convolutional Networks with Phase-field
  Models
Training Lightweight Graph Convolutional Networks with Phase-field Models
H. Sahbi
150
0
0
19 Dec 2022
FSCNN: A Fast Sparse Convolution Neural Network Inference System
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
140
3
0
17 Dec 2022
Atrous Space Bender U-Net (ASBU-Net/LogiNet)
Atrous Space Bender U-Net (ASBU-Net/LogiNet)
Anurag Bansal
O. Ostap
Miguel Maestre Trueba
Kristopher Perry
SSeg
224
1
0
16 Dec 2022
Can We Find Strong Lottery Tickets in Generative Models?
Can We Find Strong Lottery Tickets in Generative Models?AAAI Conference on Artificial Intelligence (AAAI), 2022
Sangyeop Yeo
Yoojin Jang
Jy-yong Sohn
Dongyoon Han
Jaejun Yoo
136
8
0
16 Dec 2022
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners
Zitian Chen
Songlin Yang
Mingyu Ding
Zhenfang Chen
Hengshuang Zhao
E. Learned-Miller
Chuang Gan
MoE
124
18
0
15 Dec 2022
Towards Hardware-Specific Automatic Compression of Neural Networks
Towards Hardware-Specific Automatic Compression of Neural Networks
Torben Krieger
Bernhard Klein
Holger Fröning
MQ
151
4
0
15 Dec 2022
Quant 4.0: Engineering Quantitative Investment with Automated,
  Explainable and Knowledge-driven Artificial Intelligence
Quant 4.0: Engineering Quantitative Investment with Automated, Explainable and Knowledge-driven Artificial Intelligence
Jian Guo
Saizhuo Wang
L. Ni
H. Shum
AIFin
218
16
0
13 Dec 2022
ResFed: Communication Efficient Federated Learning by Transmitting Deep
  Compressed Residuals
ResFed: Communication Efficient Federated Learning by Transmitting Deep Compressed Residuals
Rui Song
Liguo Zhou
Lingjuan Lyu
Andreas Festag
Alois Knoll
FedML
186
5
0
11 Dec 2022
Statistical guarantees for sparse deep learning
Statistical guarantees for sparse deep learningAStA Advances in Statistical Analysis (AStA), 2022
Johannes Lederer
145
14
0
11 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous
  Inference
Vertical Layering of Quantized Neural Networks for Heterogeneous InferenceIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
209
7
0
10 Dec 2022
QVIP: An ILP-based Formal Verification Approach for Quantized Neural
  Networks
QVIP: An ILP-based Formal Verification Approach for Quantized Neural NetworksInternational Conference on Automated Software Engineering (ASE), 2022
Yedi Zhang
Zhe Zhao
Fu Song
Hao Fei
Tao Chen
Jun Sun
165
23
0
10 Dec 2022
Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural
  Networks
Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural Networks
Shiyu Liu
Rohan Ghosh
John Tan Chong Min
Mehul Motani
238
0
0
09 Dec 2022
Analysis of Deep Learning Architectures and Efficacy of Detecting Forest
  Fires
Analysis of Deep Learning Architectures and Efficacy of Detecting Forest Fires
Ryan Marinelli
139
0
0
08 Dec 2022
Efficient Stein Variational Inference for Reliable Distribution-lossless
  Network Pruning
Efficient Stein Variational Inference for Reliable Distribution-lossless Network Pruning
Yingchun Wang
Song Guo
Jingcai Guo
Weizhan Zhang
Yi Tian Xu
Jiewei Zhang
Yi Liu
205
17
0
07 Dec 2022
Slimmable Pruned Neural Networks
Slimmable Pruned Neural Networks
Hideaki Kuratsu
Atsuyoshi Nakamura
209
3
0
07 Dec 2022
Label-free Knowledge Distillation with Contrastive Loss for Light-weight
  Speaker Recognition
Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker RecognitionInternational Symposium on Chinese Spoken Language Processing (ISCSLP), 2022
Zhiyuan Peng
Xuanji He
Ke Ding
Tan Lee
Guanglu Wan
163
6
0
06 Dec 2022
QEBVerif: Quantization Error Bound Verification of Neural Networks
QEBVerif: Quantization Error Bound Verification of Neural NetworksInternational Conference on Computer Aided Verification (CAV), 2022
Yedi Zhang
Fu Song
Jun Sun
MQ
303
12
0
06 Dec 2022
MobileTL: On-device Transfer Learning with Inverted Residual Blocks
MobileTL: On-device Transfer Learning with Inverted Residual BlocksAAAI Conference on Artificial Intelligence (AAAI), 2022
HungYueh Chiang
N. Frumkin
Feng Liang
Diana Marculescu
MQ
206
17
0
05 Dec 2022
Distributed Pruning Towards Tiny Neural Networks in Federated Learning
Distributed Pruning Towards Tiny Neural Networks in Federated LearningIEEE International Conference on Distributed Computing Systems (ICDCS), 2022
Hong Huang
Lan Zhang
Chaoyue Sun
R. Fang
Xiaoyong Yuan
Dapeng Wu
FedML
216
28
0
05 Dec 2022
Exploiting Kernel Compression on BNNs
Exploiting Kernel Compression on BNNsDesign, Automation and Test in Europe (DATE), 2022
Franyell Silfa
J. Arnau
Antonio González
MQ
170
0
0
01 Dec 2022
Boosted Dynamic Neural Networks
Boosted Dynamic Neural NetworksAAAI Conference on Artificial Intelligence (AAAI), 2022
Haichao Yu
Haoxiang Li
G. Hua
Gao Huang
Humphrey Shi
183
15
0
30 Nov 2022
Compressing Volumetric Radiance Fields to 1 MB
Compressing Volumetric Radiance Fields to 1 MBComputer Vision and Pattern Recognition (CVPR), 2022
Lingzhi Li
Zhen Shen
Zhongshu Wang
Li Shen
Liefeng Bo
170
84
0
29 Nov 2022
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization
  for Vision Transformers
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2022
Yijiang Liu
Huanrui Yang
Zhen Dong
Kurt Keutzer
Li Du
Shanghang Zhang
MQ
233
68
0
29 Nov 2022
Feature-domain Adaptive Contrastive Distillation for Efficient Single
  Image Super-Resolution
Feature-domain Adaptive Contrastive Distillation for Efficient Single Image Super-ResolutionIEEE Access (IEEE Access), 2022
Hye-Min Moon
Jinwoo Jeong
Sungjei Kim
205
2
0
29 Nov 2022
On the Effectiveness of Parameter-Efficient Fine-Tuning
On the Effectiveness of Parameter-Efficient Fine-TuningAAAI Conference on Artificial Intelligence (AAAI), 2022
Z. Fu
Haoran Yang
Anthony Man-Cho So
Wai Lam
Lidong Bing
Nigel Collier
214
204
0
28 Nov 2022
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Hongjie Zhang
OffRL
124
0
0
28 Nov 2022
Class-based Quantization for Neural Networks
Class-based Quantization for Neural NetworksDesign, Automation and Test in Europe (DATE), 2022
Wenhao Sun
Grace Li Zhang
Huaxi Gu
Bing Li
Ulf Schlichtmann
MQ
151
10
0
27 Nov 2022
SteppingNet: A Stepping Neural Network with Incremental Accuracy
  Enhancement
SteppingNet: A Stepping Neural Network with Incremental Accuracy EnhancementDesign, Automation and Test in Europe (DATE), 2022
Wenhao Sun
Grace Li Zhang
Xunzhao Yin
Cheng Zhuo
Huaxi Gu
Bing Li
Ulf Schlichtmann
137
3
0
27 Nov 2022
Diffusion Probabilistic Model Made Slim
Diffusion Probabilistic Model Made SlimComputer Vision and Pattern Recognition (CVPR), 2022
Xingyi Yang
Daquan Zhou
Jiashi Feng
Xinchao Wang
DiffM
322
131
0
27 Nov 2022
Medical Image Segmentation Review: The success of U-Net
Medical Image Segmentation Review: The success of U-NetIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Reza Azad
Ehsan Khodapanah Aghdam
Amelie Rauland
Yiwei Jia
Atlas Haddadi Avval
Afshin Bozorgpour
Sanaz Karimijafarbigloo
Joseph Paul Cohen
Ehsan Adeli
Dorit Merhof
SSeg
277
637
0
27 Nov 2022
Fast and Efficient Malware Detection with Joint Static and Dynamic
  Features Through Transfer Learning
Fast and Efficient Malware Detection with Joint Static and Dynamic Features Through Transfer LearningInternational Conference on Applied Cryptography and Network Security (ACNS), 2022
Mao V. Ngo
Tram Truong-Huu
Dima Rabadi
Jia Yi Loo
Sin Gee Teo
80
8
0
25 Nov 2022
Signed Binary Weight Networks
Sachit Kuhar
Alexey Tumanov
Judy Hoffman
MQ
322
1
0
25 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain
  Generalization
PAC-Bayes Compression Bounds So Tight That They Can Explain GeneralizationNeural Information Processing Systems (NeurIPS), 2022
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDLMLTAI4CE
202
75
0
24 Nov 2022
Structural Knowledge Distillation for Object Detection
Structural Knowledge Distillation for Object DetectionNeural Information Processing Systems (NeurIPS), 2022
Philip de Rijk
Lukas Schneider
Marius Cordts
D. Gavrila
190
40
0
23 Nov 2022
Join the High Accuracy Club on ImageNet with A Binary Neural Network
  Ticket
Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket
Nianhui Guo
Joseph Bethge
Christoph Meinel
Haojin Yang
MQ
404
23
0
23 Nov 2022
Developmental Plasticity-inspired Adaptive Pruning for Deep Spiking and
  Artificial Neural Networks
Developmental Plasticity-inspired Adaptive Pruning for Deep Spiking and Artificial Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Bing Han
Feifei Zhao
Yi Zeng
Guobin Shen
162
11
0
23 Nov 2022
FedDCT: Federated Learning of Large Convolutional Neural Networks on
  Resource Constrained Devices using Divide and Collaborative Training
FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Collaborative TrainingIEEE Transactions on Network and Service Management (IEEE TNSM), 2022
Quan Nguyen
Hieu H. Pham
Kok-Seng Wong
Phi Le Nguyen
Truong Thao Nguyen
Minh N. Do
FedML
261
9
0
20 Nov 2022
Previous
123...202122...717273
Next