Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown

Title
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization Frederick Tung S. Muralidharan Greg Mori 25 35 0 28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks Nikolaos Passalis Anastasios Tefas 18 70 0 25 Jul 2017
Towards Evolutional Compression Yunhe Wang Chang Xu Jiayan Qiu Chao Xu Dacheng Tao 14 14 0 25 Jul 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM Cong Leng Hao Li Shenghuo Zhu R. L. Jin MQ 19 286 0 24 Jul 2017
Neuron Pruning for Compressing Deep Networks using Maxout Architectures Fernando Moya Rueda René Grzeszick G. Fink CVBM 9 17 0 21 Jul 2017
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression Jian-Hao Luo Jianxin Wu Weiyao Lin 17 1,740 0 20 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks Yihui He Xiangyu Zhang Jian-jun Sun 32 2,503 0 19 Jul 2017
Pruning Convolutional Neural Networks for Image Instance Retrieval Gaurav Manek Jie Lin V. Chandrasekhar Ling-Yu Duan Sateesh Giduthuri Xiaoli Li T. Poggio 22 2 0 18 Jul 2017
Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network Jin Yamanaka S. Kuwashima Takio Kurita SupR 17 213 0 18 Jul 2017
Ternary Residual Networks Abhisek Kundu K. Banerjee Naveen Mellempudi Dheevatsa Mudigere Dipankar Das Bharat Kaul Pradeep Dubey 12 8 0 15 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks Ting Zhang Guo-Jun Qi Bin Xiao Jingdong Wang 20 81 0 10 Jul 2017
An Embedded Deep Learning based Word Prediction Seunghak Yu Nilesh Kulkarni Haejun Lee J. Kim 29 0 0 06 Jul 2017
Model compression as constrained optimization, with application to neural nets. Part I: general framework Miguel Á. Carreira-Perpiñán MQ 12 32 0 05 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices Xiangyu Zhang Xinyu Zhou Mengxiao Lin Jian-jun Sun AI4TS 17 6,772 0 04 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations Yoonho Boo Wonyong Sung MQ 14 11 0 01 Jul 2017
Irregular Convolutional Neural Networks Jiabin Ma Wei Wang Liang Wang 23 12 0 24 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks Shuchang Zhou Yuzhi Wang He Wen Qinyao He Yuheng Zou MQ 14 110 0 22 Jun 2017
MEC: Memory-efficient Convolution for Deep Neural Network Minsik Cho D. Brand 6 85 0 21 Jun 2017
Using Convolutional Neural Networks in Robots with Limited Computational Resources: Detecting NAO Robots while Playing Soccer Nicolás Cruz Kenzo Lobos-Tsunekawa Javier Ruiz-del-Solar 19 35 0 20 Jun 2017
An Entropy-based Pruning Method for CNN Compression Jian-Hao Luo Jianxin Wu 14 180 0 19 Jun 2017
Sobolev Training for Neural Networks Wojciech M. Czarnecki Simon Osindero Max Jaderberg G. Swirszcz Razvan Pascanu 11 240 0 15 Jun 2017
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation Abhishek Chaurasia Eugenio Culurciello SSeg 16 1,361 0 14 Jun 2017
Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks Joan Serra Alexandros Karatzoglou 12 52 0 13 Jun 2017
SEP-Nets: Small and Effective Pattern Networks Zhe Li Xiaoyu Wang Xutao Lv Tianbao Yang 19 12 0 13 Jun 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks Denis A. Gudovskiy Luca Rigazio MQ 11 52 0 07 Jun 2017
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework Shuochao Yao Yiran Zhao Aston Zhang Lu Su Tarek F. Abdelzaher 23 183 0 05 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink Xin Wang Yujia Luo D. Crankshaw Alexey Tumanov Fisher Yu Joseph E. Gonzalez 12 107 0 03 Jun 2017
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU Qingqing Cao Niranjan Balasubramanian A. Balasubramanian 16 61 0 03 Jun 2017
Tensor Contraction Layers for Parsimonious Deep Nets Jean Kossaifi Aran Khanna Zachary Chase Lipton Tommaso Furlanello Anima Anandkumar 21 60 0 01 Jun 2017
Deep Mutual Learning Ying Zhang Tao Xiang Timothy M. Hospedales Huchuan Lu FedML 21 1,633 0 01 Jun 2017
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks Tom Véniat Ludovic Denoyer 10 21 0 31 May 2017
Computation-Performance Optimization of Convolutional Neural Networks with Redundant Kernel Removal Chih-Ting Liu Yi-Heng Wu Yu-Sheng Lin Shao-Yi Chien SupR 14 5 0 30 May 2017
Iterative Machine Teaching Weiyang Liu Bo Dai Ahmad Humayun C. Tay Chen Yu Linda B. Smith James M. Rehg Le Song 18 140 0 30 May 2017
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework Lei Deng Peng Jiao Jing Pei Zhenzhi Wu Guoqi Li MQ 15 20 0 25 May 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks Huizi Mao Song Han Jeff Pool Wenshuo Li Xingyu Liu Yu Wang W. Dally 11 241 0 24 May 2017
Bayesian Compression for Deep Learning Christos Louizos Karen Ullrich Max Welling UQCV BDL 15 479 0 24 May 2017
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks A. Parashar Minsoo Rhu Anurag Mukkara A. Puglielli Rangharajan Venkatesan Brucek Khailany J. Emer S. Keckler W. Dally 19 1,113 0 23 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning W. Wen Cong Xu Feng Yan Chunpeng Wu Yandan Wang Yiran Chen Hai Helen Li 18 980 0 22 May 2017
Structural Compression of Convolutional Neural Networks R. Abbasi-Asl Bin-Xia Yu 17 16 0 20 May 2017
Structured Bayesian Pruning via Log-Normal Multiplicative Noise Kirill Neklyudov Dmitry Molchanov Arsenii Ashukha Dmitry Vetrov BDL 11 188 0 20 May 2017
The High-Dimensional Geometry of Binary Neural Networks Alexander G. Anderson C. P. Berg MQ 19 75 0 19 May 2017
Espresso: Efficient Forward Propagation for BCNNs Fabrizio Pedersoli George Tzanetakis Andrea Tagliasacchi MQ 11 13 0 19 May 2017
Building effective deep neural network architectures one feature at a time Martin Mundt Tobias Weis K. Konda Visvanathan Ramesh 17 1 0 18 May 2017
Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling Xuefeng Xiao Yafeng Yang Tasweer Ahmad Lianwen Jin Tianhai Chang 18 21 0 15 May 2017
Incremental Learning Through Deep Adaptation Amir Rosenfeld John K. Tsotsos CLL 11 275 0 11 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU Jacob Devlin 14 36 0 04 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks Minsoo Rhu Mike O'Connor Niladrish Chatterjee Jeff Pool S. Keckler 17 176 0 03 May 2017
Image reconstruction by domain transform manifold learning Bo Zhu Jeremiah Zhe Liu Bruce Rosen M. Rosen 18 1,515 0 28 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images Hengshuang Zhao Xiaojuan Qi Xiaoyong Shen Jianping Shi Jiaya Jia SSeg 24 1,402 0 27 Apr 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units S. Shi Xiaowen Chu 13 43 0 25 Apr 2017