Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

34 / 3,434 papers shown

Title
CNNLab: a Novel Parallel Framework for Neural Networks using GPU and FPGA-a Practical Study with Trade-off Analysis Maohua Zhu L. Liu Chao Wang Yuan Xie GNN 14 20 0 20 Jun 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients Shuchang Zhou Yuxin Wu Zekun Ni Xinyu Zhou He Wen Yuheng Zou MQ 21 2,071 0 20 Jun 2016
Deep Learning with Darwin: Evolutionary Synthesis of Deep Neural Networks M. Shafiee A. Mishra A. Wong 16 44 0 14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning R. Appuswamy T. Nayak John V. Arthur S. K. Esser P. Merolla J. McKinstry T. Melano M. Flickner D. Modha 25 11 0 08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation Adam Paszke Abhishek Chaurasia Sangpil Kim Eugenio Culurciello SSeg 216 2,055 0 07 Jun 2016
Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention Yang Janet Liu Chengjie Sun Mehdi Alizadeh Xiaolong Wang 22 273 0 30 May 2016
An Analysis of Deep Neural Network Models for Practical Applications A. Canziani Adam Paszke Eugenio Culurciello 8 1,164 0 24 May 2016
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations Behnam Neyshabur Yuhuai Wu Ruslan Salakhutdinov Nathan Srebro AI4CE ODL 14 30 0 23 May 2016
Learning Sensor Multiplexing Design through Back-propagation Ayan Chakrabarti SSL 18 126 0 23 May 2016
Functional Hashing for Compressing Neural Networks Lei Shi Shikun Feng Zhifan Zhu 25 4 0 20 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks Philipp Gysel 19 127 0 20 May 2016
Reducing the Model Order of Deep Neural Networks Using Information Theory Ming Tu Visar Berisha Yu Cao Jae-sun Seo 6 23 0 16 May 2016
Ternary Weight Networks Fengfu Li Bin Liu Xiaoxing Wang Bo-Wen Zhang Junchi Yan MQ 19 520 0 16 May 2016
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels H. G. Chen Suren Jayasuriya Jiyue Yang J. Stephen S. Sivaramakrishnan Ashok Veeraraghavan A. Molnar 13 66 0 11 May 2016
Hardware-oriented Approximation of Convolutional Neural Networks Philipp Gysel Mohammad Motamedi S. Ghiasi 39 309 0 11 Apr 2016
Training Constrained Deconvolutional Networks for Road Scene Semantic Segmentation G. Ros Simon Stent P. Alcantarilla Tomoki Watanabe 13 55 0 06 Apr 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks Mohammad Rastegari Vicente Ordonez Joseph Redmon Ali Farhadi MQ 8 4,328 0 16 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation Daisuke Miyashita Edward H. Lee B. Murmann MQ 19 425 0 03 Mar 2016
vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design Minsoo Rhu N. Gimelshein Jason Clemons A. Zulfiqar S. Keckler GNN 6 32 0 25 Feb 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size F. Iandola Song Han Matthew W. Moskewicz Khalid Ashraf W. Dally Kurt Keutzer 25 7,412 0 24 Feb 2016
Binarized Neural Networks Itay Hubara Daniel Soudry Ran El-Yaniv MQ 15 1,351 0 08 Feb 2016
EIE: Efficient Inference Engine on Compressed Deep Neural Network Song Han Xingyu Liu Huizi Mao Jing Pu A. Pedram M. Horowitz W. Dally 28 2,446 0 04 Feb 2016
Relief R-CNN : Utilizing Convolutional Features for Fast Object Detection Guiying Li Junlong Liu Chunhui Jiang Liangpeng Zhang Minlong Lin Ke Tang ObjD 21 7 0 25 Jan 2016
Structured Pruning of Deep Convolutional Neural Networks S. Anwar Kyuyeon Hwang Wonyong Sung 14 741 0 29 Dec 2015
Recent Advances in Convolutional Neural Networks Jiuxiang Gu Zhenhua Wang Jason Kuen Lianyang Ma Amir Shahroudy ... Xingxing Wang Li Wang Gang Wang Jianfei Cai Tsuhan Chen 29 5,134 0 22 Dec 2015
Quantized Convolutional Neural Networks for Mobile Devices Jiaxiang Wu Cong Leng Yuhang Wang Qinghao Hu Jian Cheng MQ 10 1,156 0 21 Dec 2015
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications Yong-Deok Kim Eunhyeok Park S. Yoo Taelim Choi Lu Yang Dongjun Shin 12 892 0 20 Nov 2015
Resiliency of Deep Neural Networks under Quantization Wonyong Sung Sungho Shin Kyuyeon Hwang MQ 12 157 0 20 Nov 2015
Blending LSTMs into CNNs Krzysztof J. Geras Abdel-rahman Mohamed R. Caruana G. Urban Shengjie Wang Ozlem Aslan Matthai Philipose Matthew Richardson Charles Sutton 11 60 0 19 Nov 2015
Fixed Point Quantization of Deep Convolutional Networks D. Lin S. Talathi V. Annapureddy MQ 14 809 0 19 Nov 2015
Adjustable Bounded Rectifiers: Towards Deep Binary Representations Zhirong Wu Dahua Lin Xiaoou Tang MQ 14 14 0 19 Nov 2015
ACDC: A Structured Efficient Linear Layer Marcin Moczulski Misha Denil J. Appleyard Nando de Freitas 16 98 0 18 Nov 2015
FireCaffe: near-linear acceleration of deep neural network training on compute clusters F. Iandola Khalid Ashraf Matthew W. Moskewicz Kurt Keutzer 11 302 0 31 Oct 2015
Learning both Weights and Connections for Efficient Neural Networks Song Han Jeff Pool J. Tran W. Dally CVBM 27 6,559 0 08 Jun 2015