Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,434 papers shown
Title
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
25
35
0
28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis
Anastasios Tefas
18
70
0
25 Jul 2017
Towards Evolutional Compression
Yunhe Wang
Chang Xu
Jiayan Qiu
Chao Xu
Dacheng Tao
14
14
0
25 Jul 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Cong Leng
Hao Li
Shenghuo Zhu
R. L. Jin
MQ
19
286
0
24 Jul 2017
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Fernando Moya Rueda
René Grzeszick
G. Fink
CVBM
9
17
0
21 Jul 2017
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Jian-Hao Luo
Jianxin Wu
Weiyao Lin
17
1,740
0
20 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian-jun Sun
32
2,503
0
19 Jul 2017
Pruning Convolutional Neural Networks for Image Instance Retrieval
Gaurav Manek
Jie Lin
V. Chandrasekhar
Ling-Yu Duan
Sateesh Giduthuri
Xiaoli Li
T. Poggio
22
2
0
18 Jul 2017
Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network
Jin Yamanaka
S. Kuwashima
Takio Kurita
SupR
17
213
0
18 Jul 2017
Ternary Residual Networks
Abhisek Kundu
K. Banerjee
Naveen Mellempudi
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
12
8
0
15 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks
Ting Zhang
Guo-Jun Qi
Bin Xiao
Jingdong Wang
20
81
0
10 Jul 2017
An Embedded Deep Learning based Word Prediction
Seunghak Yu
Nilesh Kulkarni
Haejun Lee
J. Kim
29
0
0
06 Jul 2017
Model compression as constrained optimization, with application to neural nets. Part I: general framework
Miguel Á. Carreira-Perpiñán
MQ
12
32
0
05 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian-jun Sun
AI4TS
17
6,772
0
04 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
14
11
0
01 Jul 2017
Irregular Convolutional Neural Networks
Jiabin Ma
Wei Wang
Liang Wang
23
12
0
24 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shuchang Zhou
Yuzhi Wang
He Wen
Qinyao He
Yuheng Zou
MQ
14
110
0
22 Jun 2017
MEC: Memory-efficient Convolution for Deep Neural Network
Minsik Cho
D. Brand
6
85
0
21 Jun 2017
Using Convolutional Neural Networks in Robots with Limited Computational Resources: Detecting NAO Robots while Playing Soccer
Nicolás Cruz
Kenzo Lobos-Tsunekawa
Javier Ruiz-del-Solar
19
35
0
20 Jun 2017
An Entropy-based Pruning Method for CNN Compression
Jian-Hao Luo
Jianxin Wu
14
180
0
19 Jun 2017
Sobolev Training for Neural Networks
Wojciech M. Czarnecki
Simon Osindero
Max Jaderberg
G. Swirszcz
Razvan Pascanu
11
240
0
15 Jun 2017
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
Abhishek Chaurasia
Eugenio Culurciello
SSeg
16
1,361
0
14 Jun 2017
Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks
Joan Serra
Alexandros Karatzoglou
12
52
0
13 Jun 2017
SEP-Nets: Small and Effective Pattern Networks
Zhe Li
Xiaoyu Wang
Xutao Lv
Tianbao Yang
19
12
0
13 Jun 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
11
52
0
07 Jun 2017
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
Shuochao Yao
Yiran Zhao
Aston Zhang
Lu Su
Tarek F. Abdelzaher
23
183
0
05 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang
Yujia Luo
D. Crankshaw
Alexey Tumanov
Fisher Yu
Joseph E. Gonzalez
12
107
0
03 Jun 2017
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
Qingqing Cao
Niranjan Balasubramanian
A. Balasubramanian
16
61
0
03 Jun 2017
Tensor Contraction Layers for Parsimonious Deep Nets
Jean Kossaifi
Aran Khanna
Zachary Chase Lipton
Tommaso Furlanello
Anima Anandkumar
21
60
0
01 Jun 2017
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
21
1,633
0
01 Jun 2017
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks
Tom Véniat
Ludovic Denoyer
10
21
0
31 May 2017
Computation-Performance Optimization of Convolutional Neural Networks with Redundant Kernel Removal
Chih-Ting Liu
Yi-Heng Wu
Yu-Sheng Lin
Shao-Yi Chien
SupR
14
5
0
30 May 2017
Iterative Machine Teaching
Weiyang Liu
Bo Dai
Ahmad Humayun
C. Tay
Chen Yu
Linda B. Smith
James M. Rehg
Le Song
18
140
0
30 May 2017
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng
Peng Jiao
Jing Pei
Zhenzhi Wu
Guoqi Li
MQ
15
20
0
25 May 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
W. Dally
11
241
0
24 May 2017
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
15
479
0
24 May 2017
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
19
1,113
0
23 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning
W. Wen
Cong Xu
Feng Yan
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
18
980
0
22 May 2017
Structural Compression of Convolutional Neural Networks
R. Abbasi-Asl
Bin-Xia Yu
17
16
0
20 May 2017
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Kirill Neklyudov
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
11
188
0
20 May 2017
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
19
75
0
19 May 2017
Espresso: Efficient Forward Propagation for BCNNs
Fabrizio Pedersoli
George Tzanetakis
Andrea Tagliasacchi
MQ
11
13
0
19 May 2017
Building effective deep neural network architectures one feature at a time
Martin Mundt
Tobias Weis
K. Konda
Visvanathan Ramesh
17
1
0
18 May 2017
Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling
Xuefeng Xiao
Yafeng Yang
Tasweer Ahmad
Lianwen Jin
Tianhai Chang
18
21
0
15 May 2017
Incremental Learning Through Deep Adaptation
Amir Rosenfeld
John K. Tsotsos
CLL
11
275
0
11 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
Jacob Devlin
14
36
0
04 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
17
176
0
03 May 2017
Image reconstruction by domain transform manifold learning
Bo Zhu
Jeremiah Zhe Liu
Bruce Rosen
M. Rosen
18
1,515
0
28 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Hengshuang Zhao
Xiaojuan Qi
Xiaoyong Shen
Jianping Shi
Jiaya Jia
SSeg
24
1,402
0
27 Apr 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
S. Shi
Xiaowen Chu
13
43
0
25 Apr 2017
Previous
1
2
3
...
65
66
67
68
69
Next