ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown
Title
Wide Compression: Tensor Ring Nets
Wide Compression: Tensor Ring Nets
Wenqi Wang
Yifan Sun
Brian Eriksson
Wenlin Wang
Vaneet Aggarwal
13
168
0
25 Feb 2018
Loss-aware Weight Quantization of Deep Networks
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
15
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for
  each weight
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
22
71
0
23 Feb 2018
Approximation Algorithms for Cascading Prediction Models
Approximation Algorithms for Cascading Prediction Models
Matthew J. Streeter
TPM
8
19
0
21 Feb 2018
Building Efficient ConvNets using Redundant Feature Pruning
Building Efficient ConvNets using Redundant Feature Pruning
B. Ayinde
J. Zurada
VLM
3DPC
16
47
0
21 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed
  Machine Learning
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
11
70
0
21 Feb 2018
The Description Length of Deep Learning Models
The Description Length of Deep Learning Models
Léonard Blier
Yann Ollivier
24
95
0
20 Feb 2018
DeepThin: A Self-Compressing Library for Deep Neural Networks
DeepThin: A Self-Compressing Library for Deep Neural Networks
Matthew Sotoudeh
Sara S. Baghsorkhi
16
4
0
20 Feb 2018
Layer-wise synapse optimization for implementing neural networks on
  general neuromorphic architectures
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
John Mern
Jayesh K. Gupta
Mykel Kochenderfer
30
1
0
20 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on
  Large In-Memory Datasets
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
21
70
0
19 Feb 2018
Towards Ultra-High Performance and Energy Efficiency of Deep Learning
  Systems: An Algorithm-Hardware Co-Optimization Framework
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Yanzhi Wang
Caiwen Ding
Zhe Li
Geng Yuan
Siyu Liao
...
Bo Yuan
Xuehai Qian
Jian Tang
Qinru Qiu
X. Lin
18
33
0
18 Feb 2018
Efficient Sparse-Winograd Convolutional Neural Networks
Efficient Sparse-Winograd Convolutional Neural Networks
Xingyu Liu
Jeff Pool
Song Han
W. Dally
11
122
0
18 Feb 2018
Towards Principled Design of Deep Convolutional Networks: Introducing
  SimpNet
Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet
S. H. HasanPour
Mohammad Rouhani
Mohsen Fayyaz
Mohammad Sabokrou
Ehsan Adeli
42
45
0
17 Feb 2018
Systematic Weight Pruning of DNNs using Alternating Direction Method of
  Multipliers
Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers
Tianyun Zhang
Shaokai Ye
Yipeng Zhang
Yanzhi Wang
M. Fardad
17
21
0
15 Feb 2018
Model compression via distillation and quantization
Model compression via distillation and quantization
A. Polino
Razvan Pascanu
Dan Alistarh
MQ
17
718
0
15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning
  Systems under Adversarial Attacks
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks
Qi Liu
Tao Liu
Zihao Liu
Yanzhi Wang
Yier Jin
Wujie Wen
AAML
27
48
0
14 Feb 2018
Paraphrasing Complex Network: Network Compression via Factor Transfer
Paraphrasing Complex Network: Network Compression via Factor Transfer
Jangho Kim
Seonguk Park
Nojun Kwak
16
543
0
14 Feb 2018
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning
Haoyu Zhang
Logan Stafman
Andrew Or
M. Freedman
16
141
0
13 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
19
389
0
13 Feb 2018
Attention-Based Guided Structured Sparsity of Deep Neural Networks
Attention-Based Guided Structured Sparsity of Deep Neural Networks
A. Torfi
Rouzbeh A. Shirvani
Sobhan Soleymani
Nasser M. Nasrabadi
21
23
0
13 Feb 2018
DCFNet: Deep Neural Network with Decomposed Convolutional Filters
DCFNet: Deep Neural Network with Decomposed Convolutional Filters
Qiang Qiu
Xiuyuan Cheng
Robert Calderbank
Guillermo Sapiro
33
69
0
12 Feb 2018
ClosNets: a Priori Sparse Topologies for Faster DNN Training
ClosNets: a Priori Sparse Topologies for Faster DNN Training
Mihailo Isakov
Michel A. Kinsy
CVBM
16
0
0
12 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space
  Encoding for Resource-Constrained Internet-of-Things Platforms
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
J. Ko
Taesik Na
M. Amir
Saibal Mukhopadhyay
16
148
0
11 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error
  Resilience for Energy Efficient Deep Neural Network Accelerators
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators
Jeff Zhang
Kartheek Rangineni
Zahra Ghodsi
S. Garg
20
117
0
11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic
  Array Based Neural Network Accelerator
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator
Jeff Zhang
Tianyu Gu
K. Basu
S. Garg
6
134
0
11 Feb 2018
The Need for Speed of AI Applications: Performance Comparison of Native
  vs. Browser-based Algorithm Implementations
The Need for Speed of AI Applications: Performance Comparison of Native vs. Browser-based Algorithm Implementations
Bernd Malle
Nicola Giuliani
Peter Kieseberg
Andreas Holzinger
8
8
0
11 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU
  Neural Networks
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
21
21
0
10 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li-Jia Li
Song Han
33
1,339
0
10 Feb 2018
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary
  Deep Intelligence
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary Deep Intelligence
A. Chung
Paul Fieguth
A. Wong
14
1
0
09 Feb 2018
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Abhronil Sengupta
Yuting Ye
Robert Y. Wang
Chiao Liu
Kaushik Roy
15
978
0
07 Feb 2018
Effective Quantization Approaches for Recurrent Neural Networks
Effective Quantization Approaches for Recurrent Neural Networks
Md. Zahangir Alom
A. Moody
N. Maruyama
B. Van Essen
T. Taha
MQ
8
33
0
07 Feb 2018
CryptoRec: Privacy-preserving Recommendation as a Service
CryptoRec: Privacy-preserving Recommendation as a Service
Jun Wang
Afonso Arriaga
Qiang Tang
Peter Y. A. Ryan
13
3
0
07 Feb 2018
Universal Deep Neural Network Compression
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
81
85
0
07 Feb 2018
Digital Watermarking for Deep Neural Networks
Digital Watermarking for Deep Neural Networks
Yuki Nagai
Yusuke Uchida
S. Sakazawa
Shiníchi Satoh
WIGM
23
143
0
06 Feb 2018
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT
  Devices
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Ramyad Hadidi
Jiashen Cao
M. Woodward
Michael S. Ryoo
Hyesoon Kim
14
34
0
05 Feb 2018
Learning Compact Neural Networks with Regularization
Learning Compact Neural Networks with Regularization
Samet Oymak
MLT
25
39
0
05 Feb 2018
Recent Advances in Efficient Computation of Deep Convolutional Neural
  Networks
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks
Jian Cheng
Peisong Wang
Gang Li
Qinghao Hu
Hanqing Lu
16
3
0
03 Feb 2018
Build a Compact Binary Neural Network through Bit-level Sensitivity and
  Data Pruning
Build a Compact Binary Neural Network through Bit-level Sensitivity and Data Pruning
Yixing Li
Fengbo Ren
MQ
14
12
0
03 Feb 2018
Intriguing Properties of Randomly Weighted Networks: Generalizing While
  Learning Next to Nothing
Intriguing Properties of Randomly Weighted Networks: Generalizing While Learning Next to Nothing
Amir Rosenfeld
John K. Tsotsos
MLT
24
51
0
02 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks
VIBNN: Hardware Acceleration of Bayesian Neural Networks
R. Cai
Ao Ren
Ning Liu
Caiwen Ding
Luhao Wang
Xuehai Qian
Massoud Pedram
Yanzhi Wang
BDL
21
87
0
02 Feb 2018
Adaptive Memory Networks
Adaptive Memory Networks
Daniel Li
Asim Kadav
21
5
0
01 Feb 2018
Alternating Multi-bit Quantization for Recurrent Neural Networks
Alternating Multi-bit Quantization for Recurrent Neural Networks
Chen Xu
Jianqiang Yao
Zhouchen Lin
Wenwu Ou
Yuanbin Cao
Zhirong Wang
H. Zha
MQ
27
115
0
01 Feb 2018
Model compression for faster structural separation of macromolecules
  captured by Cellular Electron Cryo-Tomography
Model compression for faster structural separation of macromolecules captured by Cellular Electron Cryo-Tomography
JiaLiang Guo
Bo Zhou
Xiangrui Zeng
Z. Freyberg
Min Xu
17
10
0
31 Jan 2018
Recovering from Random Pruning: On the Plasticity of Deep Convolutional
  Neural Networks
Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks
Deepak Mittal
S. Bhardwaj
Mitesh M. Khapra
Balaraman Ravindran
VLM
22
65
0
31 Jan 2018
On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient
  Deep Neural Networks for Speech Denoising
On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising
Kai Zhen
Aswin Sivaraman
Jongmo Sung
Minje Kim
9
7
0
29 Jan 2018
Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of
  Deep Convolutional Neural Networks
Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of Deep Convolutional Neural Networks
Yuechao Gao
Nianhong Liu
Shenmin Zhang
13
0
0
23 Jan 2018
Learning to Prune Filters in Convolutional Neural Networks
Learning to Prune Filters in Convolutional Neural Networks
Qiangui Huang
S. Kevin Zhou
Suya You
Ulrich Neumann
VLM
23
176
0
23 Jan 2018
Binary output layer of feedforward neural networks for solving
  multi-class classification problems
Binary output layer of feedforward neural networks for solving multi-class classification problems
Sibo Yang
Chao Zhang
Wei Wu
MQ
14
8
0
22 Jan 2018
Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate
  Modeling and Uncertainty Quantification
Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate Modeling and Uncertainty Quantification
Yinhao Zhu
N. Zabaras
UQCV
BDL
17
636
0
21 Jan 2018
Toward Scalable Verification for Safety-Critical Deep Networks
Toward Scalable Verification for Safety-Critical Deep Networks
L. Kuper
Guy Katz
Justin Emile Gottschlich
Kyle D. Julian
Clark W. Barrett
Mykel Kochenderfer
29
40
0
18 Jan 2018
Previous
123...626364...676869
Next