Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown

Title
Wide Compression: Tensor Ring Nets Wenqi Wang Yifan Sun Brian Eriksson Wenlin Wang Vaneet Aggarwal 13 168 0 25 Feb 2018
Loss-aware Weight Quantization of Deep Networks Lu Hou James T. Kwok MQ 15 127 0 23 Feb 2018
Training wide residual networks for deployment using a single bit for each weight Mark D Mcdonnell MQ 22 71 0 23 Feb 2018
Approximation Algorithms for Cascading Prediction Models Matthew J. Streeter TPM 8 19 0 21 Feb 2018
Building Efficient ConvNets using Redundant Feature Pruning B. Ayinde J. Zurada VLM 3DPC 16 47 0 21 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning Hyeontaek Lim D. Andersen M. Kaminsky 11 70 0 21 Feb 2018
The Description Length of Deep Learning Models Léonard Blier Yann Ollivier 24 95 0 20 Feb 2018
DeepThin: A Self-Compressing Library for Deep Neural Networks Matthew Sotoudeh Sara S. Baghsorkhi 16 4 0 20 Feb 2018
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures John Mern Jayesh K. Gupta Mykel Kochenderfer 30 1 0 20 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets Fabian Schuiki Michael Schaffner Frank K. Gürkaynak Luca Benini 21 70 0 19 Feb 2018
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework Yanzhi Wang Caiwen Ding Zhe Li Geng Yuan Siyu Liao ... Bo Yuan Xuehai Qian Jian Tang Qinru Qiu X. Lin 18 33 0 18 Feb 2018
Efficient Sparse-Winograd Convolutional Neural Networks Xingyu Liu Jeff Pool Song Han W. Dally 11 122 0 18 Feb 2018
Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet S. H. HasanPour Mohammad Rouhani Mohsen Fayyaz Mohammad Sabokrou Ehsan Adeli 42 45 0 17 Feb 2018
Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers Tianyun Zhang Shaokai Ye Yipeng Zhang Yanzhi Wang M. Fardad 17 21 0 15 Feb 2018
Model compression via distillation and quantization A. Polino Razvan Pascanu Dan Alistarh MQ 17 718 0 15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks Qi Liu Tao Liu Zihao Liu Yanzhi Wang Yier Jin Wujie Wen AAML 27 48 0 14 Feb 2018
Paraphrasing Complex Network: Network Compression via Factor Transfer Jangho Kim Seonguk Park Nojun Kwak 16 543 0 14 Feb 2018
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning Haoyu Zhang Logan Stafman Andrew Or M. Freedman 16 141 0 13 Feb 2018
Training and Inference with Integers in Deep Neural Networks Shuang Wu Guoqi Li F. Chen Luping Shi MQ 19 389 0 13 Feb 2018
Attention-Based Guided Structured Sparsity of Deep Neural Networks A. Torfi Rouzbeh A. Shirvani Sobhan Soleymani Nasser M. Nasrabadi 21 23 0 13 Feb 2018
DCFNet: Deep Neural Network with Decomposed Convolutional Filters Qiang Qiu Xiuyuan Cheng Robert Calderbank Guillermo Sapiro 33 69 0 12 Feb 2018
ClosNets: a Priori Sparse Topologies for Faster DNN Training Mihailo Isakov Michel A. Kinsy CVBM 16 0 0 12 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms J. Ko Taesik Na M. Amir Saibal Mukhopadhyay 16 148 0 11 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators Jeff Zhang Kartheek Rangineni Zahra Ghodsi S. Garg 20 117 0 11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator Jeff Zhang Tianyu Gu K. Basu S. Garg 6 134 0 11 Feb 2018
The Need for Speed of AI Applications: Performance Comparison of Native vs. Browser-based Algorithm Implementations Bernd Malle Nicola Giuliani Peter Kieseberg Andreas Holzinger 8 8 0 11 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks Yukun Ding Jinglan Liu Jinjun Xiong Yiyu Shi MQ 21 21 0 10 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices Yihui He Ji Lin Zhijian Liu Hanrui Wang Li-Jia Li Song Han 33 1,339 0 10 Feb 2018
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary Deep Intelligence A. Chung Paul Fieguth A. Wong 14 1 0 09 Feb 2018
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures Abhronil Sengupta Yuting Ye Robert Y. Wang Chiao Liu Kaushik Roy 15 978 0 07 Feb 2018
Effective Quantization Approaches for Recurrent Neural Networks Md. Zahangir Alom A. Moody N. Maruyama B. Van Essen T. Taha MQ 8 33 0 07 Feb 2018
CryptoRec: Privacy-preserving Recommendation as a Service Jun Wang Afonso Arriaga Qiang Tang Peter Y. A. Ryan 13 3 0 07 Feb 2018
Universal Deep Neural Network Compression Yoojin Choi Mostafa El-Khamy Jungwon Lee MQ 81 85 0 07 Feb 2018
Digital Watermarking for Deep Neural Networks Yuki Nagai Yusuke Uchida S. Sakazawa Shiníchi Satoh WIGM 23 143 0 06 Feb 2018
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices Ramyad Hadidi Jiashen Cao M. Woodward Michael S. Ryoo Hyesoon Kim 14 34 0 05 Feb 2018
Learning Compact Neural Networks with Regularization Samet Oymak MLT 25 39 0 05 Feb 2018
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks Jian Cheng Peisong Wang Gang Li Qinghao Hu Hanqing Lu 16 3 0 03 Feb 2018
Build a Compact Binary Neural Network through Bit-level Sensitivity and Data Pruning Yixing Li Fengbo Ren MQ 14 12 0 03 Feb 2018
Intriguing Properties of Randomly Weighted Networks: Generalizing While Learning Next to Nothing Amir Rosenfeld John K. Tsotsos MLT 24 51 0 02 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks R. Cai Ao Ren Ning Liu Caiwen Ding Luhao Wang Xuehai Qian Massoud Pedram Yanzhi Wang BDL 21 87 0 02 Feb 2018
Adaptive Memory Networks Daniel Li Asim Kadav 21 5 0 01 Feb 2018
Alternating Multi-bit Quantization for Recurrent Neural Networks Chen Xu Jianqiang Yao Zhouchen Lin Wenwu Ou Yuanbin Cao Zhirong Wang H. Zha MQ 27 115 0 01 Feb 2018
Model compression for faster structural separation of macromolecules captured by Cellular Electron Cryo-Tomography JiaLiang Guo Bo Zhou Xiangrui Zeng Z. Freyberg Min Xu 17 10 0 31 Jan 2018
Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks Deepak Mittal S. Bhardwaj Mitesh M. Khapra Balaraman Ravindran VLM 22 65 0 31 Jan 2018
On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising Kai Zhen Aswin Sivaraman Jongmo Sung Minje Kim 9 7 0 29 Jan 2018
Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of Deep Convolutional Neural Networks Yuechao Gao Nianhong Liu Shenmin Zhang 13 0 0 23 Jan 2018
Learning to Prune Filters in Convolutional Neural Networks Qiangui Huang S. Kevin Zhou Suya You Ulrich Neumann VLM 23 176 0 23 Jan 2018
Binary output layer of feedforward neural networks for solving multi-class classification problems Sibo Yang Chao Zhang Wei Wu MQ 14 8 0 22 Jan 2018
Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate Modeling and Uncertainty Quantification Yinhao Zhu N. Zabaras UQCV BDL 17 636 0 21 Jan 2018
Toward Scalable Verification for Safety-Critical Deep Networks L. Kuper Guy Katz Justin Emile Gottschlich Kyle D. Julian Clark W. Barrett Mykel Kochenderfer 29 40 0 18 Jan 2018