Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown

Title
ReBNet: Residual Binarized Neural Network M. Ghasemzadeh Mohammad Samragh F. Koushanfar MQ 22 4 0 03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity Jingyang Zhu Jingbo Jiang Xizi Chen Chi-Ying Tsui 18 36 0 03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning Raphael Shu Hideki Nakayama 21 129 0 03 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks Dharma Teja Vooturi Saurabh Goyal Anamitra R. Choudhury Yogish Sabharwal Ashish Verma 16 6 0 01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks Bohan Zhuang Chunhua Shen Mingkui Tan Lingqiao Liu Ian Reid MQ 23 231 0 01 Nov 2017
Tensorizing Generative Adversarial Nets Xingwei Cao Xuyang Zhao Qibin Zhao GAN 17 9 0 30 Oct 2017
Knowledge Projection for Deep Neural Networks Zhi Zhang G. Ning Zhihai He 28 15 0 26 Oct 2017
Trace norm regularization and faster inference for embedded speech recognition RNNs Markus Kliegl Siddharth Goyal Kexin Zhao Kavya Srinet M. Shoeybi 15 8 0 25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks Yu Cheng Duo Wang Pan Zhou Zhang Tao 15 1,085 0 23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick Oran Shayer Dan Levi Ethan Fetaya 11 88 0 21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks Raphael Gontijo-Lopes Stefano Fenu Thad Starner 12 270 0 19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization D. Loroch Norbert Wehn Franz-Josef Pfreundt J. Keuper MQ 25 23 0 13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition Nitin Rathi Priyadarshini Panda Kaushik Roy 16 111 0 12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers Jiaqi Guan Yang Liu Qiang Liu Jian-wei Peng 14 33 0 10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial Mingzhe Chen Ursula Challita Walid Saad Changchuan Yin Mérouane Debbah 15 207 0 09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures F. Iandola Kurt Keutzer 15 37 0 07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression Michael Zhu Suyog Gupta 37 1,248 0 05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear Filters D. Tran Alexandros Iosifidis M. Gabbouj 16 40 0 28 Sep 2017
Connectivity Learning in Multi-Branch Networks Karim Ahmed Lorenzo Torresani 16 26 0 27 Sep 2017
Machine Learning Models that Remember Too Much Congzheng Song Thomas Ristenpart Vitaly Shmatikov VLM 16 502 0 22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design Zhourui Song Zhenyu Liu Dongsheng Wang 13 41 0 22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration Huan Wang Qiming Zhang Yuehai Wang Roland Hu 13 11 0 20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks Julian Faraone Nicholas J. Fraser Giulio Gambardella Michaela Blott Philip H. W. Leong MQ UQCV 13 12 0 19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning A. Ashok Nicholas Rhinehart Fares N. Beainy Kris M. Kitani 16 169 0 18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement Tianchan Guan Xiaoyang Zeng Mingoo Seok MQ 20 6 0 15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications Yuan Du Li Du Yilei Li Junjie Su Mau-Chung Frank Chang 14 6 0 15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory W. Wen Yuxiong He Samyam Rajbhandari Minjia Zhang Wenhan Wang Fang Liu Bin Hu Yiran Chen H. Li MQ 21 140 0 15 Sep 2017
Supervising Unsupervised Learning Vikas K. Garg Adam Kalai SSL FedML 16 29 0 14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing model without retraining Ryuji Kamiya Takayoshi Yamashita Mitsuru Ambai Ikuro Sato Yuji Yamauchi H. Fujiyoshi MQ 12 4 0 14 Sep 2017
Flexible Network Binarization with Layer-wise Priority He Wang Yi Tian Xu Bingbing Ni Hongteng Xu MQ 23 10 0 13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification Chong-Jun Wang Xipeng Lan Yang Zhang CVBM 15 26 0 09 Sep 2017
Real-time convolutional networks for sonar image classification in low-power embedded systems Matias Valdenegro-Toro 23 10 0 07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis A. Chung M. Shafiee Paul Fieguth A. Wong 10 4 0 07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks Surat Teerapittayanon Bradley McDanel H. T. Kung UQCV 11 1,109 0 06 Sep 2017
Domain-adaptive deep network compression Marc Masana Joost van de Weijer Luis Herranz Andrew D. Bagdanov J. Álvarez 36 62 0 04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks Qifeng Chen Jia Xu V. Koltun 10 322 0 02 Sep 2017
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions Toby Lightheart S. Grainger Tien-Fu Lu 8 0 0 30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual Quantization Zefan Li Bingbing Ni Wenjun Zhang Xiaokang Yang Wen Gao MQ 16 105 0 29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices Caiwen Ding Siyu Liao Yanzhi Wang Zhe Li Ning Liu ... Yipeng Zhang Jian Tang Qinru Qiu X. Lin Bo Yuan GNN 19 258 0 29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images Duc Minh Nguyen Evaggelia Tsiligianni Nikos Deligiannis 13 26 0 28 Aug 2017
The Convergence of Machine Learning and Communications Wojciech Samek S. Stańczak Thomas Wiegand AI4CE 24 29 0 28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming Zhuang Liu Jianguo Li Zhiqiang Shen Gao Huang Shoumeng Yan Changshui Zhang 24 2,383 0 22 Aug 2017
Neural Networks Compression for Language Modeling Artem M. Grachev D. Ignatov Andrey V. Savchenko 14 30 0 20 Aug 2017
Deep Neural Network Capacity Aosen Wang Huan Zhou Wenyao Xu Xin Chen 11 4 0 16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks Aswin Raghavan Mohamed R. Amer S. Chai Graham Taylor MQ 22 10 0 16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices Dawei Li Xiaolong Wang Deguang Kong 15 97 0 16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS J. Kepner Manoj Kumar José Moreira P. Pattnaik M. Serrano H. Tufo GNN 14 33 0 09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink X. Li Changsong Liu CVBM 11 4 0 08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks Jan A. Botha Emily Pitler Ji Ma A. Bakalov Alexandru Salcianu David J. Weiss Ryan T. McDonald Slav Petrov HAI 17 38 0 01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform Chaim Baskin Natan Liss Evgenii Zheltonozhskii A. Bronstein A. Mendelson GNN MQ 28 35 0 31 Jul 2017