ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown
Title
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional
  Network with Bayesian Optimization
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
25
35
0
28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis
Anastasios Tefas
18
70
0
25 Jul 2017
Towards Evolutional Compression
Towards Evolutional Compression
Yunhe Wang
Chang Xu
Jiayan Qiu
Chao Xu
Dacheng Tao
14
14
0
25 Jul 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Cong Leng
Hao Li
Shenghuo Zhu
R. L. Jin
MQ
19
286
0
24 Jul 2017
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Fernando Moya Rueda
René Grzeszick
G. Fink
CVBM
9
17
0
21 Jul 2017
ThiNet: A Filter Level Pruning Method for Deep Neural Network
  Compression
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Jian-Hao Luo
Jianxin Wu
Weiyao Lin
17
1,740
0
20 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian-jun Sun
32
2,503
0
19 Jul 2017
Pruning Convolutional Neural Networks for Image Instance Retrieval
Pruning Convolutional Neural Networks for Image Instance Retrieval
Gaurav Manek
Jie Lin
V. Chandrasekhar
Ling-Yu Duan
Sateesh Giduthuri
Xiaoli Li
T. Poggio
22
2
0
18 Jul 2017
Fast and Accurate Image Super Resolution by Deep CNN with Skip
  Connection and Network in Network
Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network
Jin Yamanaka
S. Kuwashima
Takio Kurita
SupR
17
213
0
18 Jul 2017
Ternary Residual Networks
Ternary Residual Networks
Abhisek Kundu
K. Banerjee
Naveen Mellempudi
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
12
8
0
15 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks
Interleaved Group Convolutions for Deep Neural Networks
Ting Zhang
Guo-Jun Qi
Bin Xiao
Jingdong Wang
20
81
0
10 Jul 2017
An Embedded Deep Learning based Word Prediction
An Embedded Deep Learning based Word Prediction
Seunghak Yu
Nilesh Kulkarni
Haejun Lee
J. Kim
29
0
0
06 Jul 2017
Model compression as constrained optimization, with application to
  neural nets. Part I: general framework
Model compression as constrained optimization, with application to neural nets. Part I: general framework
Miguel Á. Carreira-Perpiñán
MQ
12
32
0
05 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for
  Mobile Devices
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian-jun Sun
AI4TS
17
6,772
0
04 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for
  Efficient Hardware Implementations
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
14
11
0
01 Jul 2017
Irregular Convolutional Neural Networks
Irregular Convolutional Neural Networks
Jiabin Ma
Wei Wang
Liang Wang
23
12
0
24 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized
  Neural Networks
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shuchang Zhou
Yuzhi Wang
He Wen
Qinyao He
Yuheng Zou
MQ
14
110
0
22 Jun 2017
MEC: Memory-efficient Convolution for Deep Neural Network
MEC: Memory-efficient Convolution for Deep Neural Network
Minsik Cho
D. Brand
6
85
0
21 Jun 2017
Using Convolutional Neural Networks in Robots with Limited Computational
  Resources: Detecting NAO Robots while Playing Soccer
Using Convolutional Neural Networks in Robots with Limited Computational Resources: Detecting NAO Robots while Playing Soccer
Nicolás Cruz
Kenzo Lobos-Tsunekawa
Javier Ruiz-del-Solar
19
35
0
20 Jun 2017
An Entropy-based Pruning Method for CNN Compression
An Entropy-based Pruning Method for CNN Compression
Jian-Hao Luo
Jianxin Wu
14
180
0
19 Jun 2017
Sobolev Training for Neural Networks
Sobolev Training for Neural Networks
Wojciech M. Czarnecki
Simon Osindero
Max Jaderberg
G. Swirszcz
Razvan Pascanu
11
240
0
15 Jun 2017
LinkNet: Exploiting Encoder Representations for Efficient Semantic
  Segmentation
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
Abhishek Chaurasia
Eugenio Culurciello
SSeg
16
1,361
0
14 Jun 2017
Getting deep recommenders fit: Bloom embeddings for sparse binary
  input/output networks
Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks
Joan Serra
Alexandros Karatzoglou
12
52
0
13 Jun 2017
SEP-Nets: Small and Effective Pattern Networks
SEP-Nets: Small and Effective Pattern Networks
Zhe Li
Xiaoyu Wang
Xutao Lv
Tianbao Yang
19
12
0
13 Jun 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of
  Convolutional Neural Networks
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
11
52
0
07 Jun 2017
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems
  with a Compressor-Critic Framework
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
Shuochao Yao
Yiran Zhao
Aston Zhang
Lu Su
Tarek F. Abdelzaher
23
183
0
05 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink
IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang
Yujia Luo
D. Crankshaw
Alexey Tumanov
Fisher Yu
Joseph E. Gonzalez
12
107
0
03 Jun 2017
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
Qingqing Cao
Niranjan Balasubramanian
A. Balasubramanian
16
61
0
03 Jun 2017
Tensor Contraction Layers for Parsimonious Deep Nets
Tensor Contraction Layers for Parsimonious Deep Nets
Jean Kossaifi
Aran Khanna
Zachary Chase Lipton
Tommaso Furlanello
Anima Anandkumar
21
60
0
01 Jun 2017
Deep Mutual Learning
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
21
1,633
0
01 Jun 2017
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super
  Networks
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks
Tom Véniat
Ludovic Denoyer
10
21
0
31 May 2017
Computation-Performance Optimization of Convolutional Neural Networks
  with Redundant Kernel Removal
Computation-Performance Optimization of Convolutional Neural Networks with Redundant Kernel Removal
Chih-Ting Liu
Yi-Heng Wu
Yu-Sheng Lin
Shao-Yi Chien
SupR
14
5
0
30 May 2017
Iterative Machine Teaching
Iterative Machine Teaching
Weiyang Liu
Bo Dai
Ahmad Humayun
C. Tay
Chen Yu
Linda B. Smith
James M. Rehg
Le Song
18
140
0
30 May 2017
GXNOR-Net: Training deep neural networks with ternary weights and
  activations without full-precision memory under a unified discretization
  framework
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng
Peng Jiao
Jing Pei
Zhenzhi Wu
Guoqi Li
MQ
15
20
0
25 May 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural
  Networks
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
W. Dally
11
241
0
24 May 2017
Bayesian Compression for Deep Learning
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
15
479
0
24 May 2017
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
19
1,113
0
23 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep
  Learning
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning
W. Wen
Cong Xu
Feng Yan
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
18
980
0
22 May 2017
Structural Compression of Convolutional Neural Networks
Structural Compression of Convolutional Neural Networks
R. Abbasi-Asl
Bin-Xia Yu
17
16
0
20 May 2017
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Kirill Neklyudov
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
11
188
0
20 May 2017
The High-Dimensional Geometry of Binary Neural Networks
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
19
75
0
19 May 2017
Espresso: Efficient Forward Propagation for BCNNs
Espresso: Efficient Forward Propagation for BCNNs
Fabrizio Pedersoli
George Tzanetakis
Andrea Tagliasacchi
MQ
11
13
0
19 May 2017
Building effective deep neural network architectures one feature at a
  time
Building effective deep neural network architectures one feature at a time
Martin Mundt
Tobias Weis
K. Konda
Visvanathan Ramesh
17
1
0
18 May 2017
Design of a Very Compact CNN Classifier for Online Handwritten Chinese
  Character Recognition Using DropWeight and Global Pooling
Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling
Xuefeng Xiao
Yafeng Yang
Tasweer Ahmad
Lianwen Jin
Tianhai Chang
18
21
0
15 May 2017
Incremental Learning Through Deep Adaptation
Incremental Learning Through Deep Adaptation
Amir Rosenfeld
John K. Tsotsos
CLL
11
275
0
11 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine
  Translation Decoding on the CPU
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
Jacob Devlin
14
36
0
04 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep
  Neural Networks
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
17
176
0
03 May 2017
Image reconstruction by domain transform manifold learning
Image reconstruction by domain transform manifold learning
Bo Zhu
Jeremiah Zhe Liu
Bruce Rosen
M. Rosen
18
1,515
0
28 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Hengshuang Zhao
Xiaojuan Qi
Xiaoyong Shen
Jianping Shi
Jiaya Jia
SSeg
24
1,402
0
27 Apr 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of
  Rectifier Units
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
S. Shi
Xiaowen Chu
13
43
0
25 Apr 2017
Previous
123...6566676869
Next