Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015

Song Han

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown

Title
StressedNets: Efficient Feature Representations via Stress-induced Evolutionary Synthesis of Deep Neural Networks M. Shafiee Brendan Chwyl Francis Li Rongyan Chen Michelle Karg C. Scharfenberger A. Wong 6 7 0 16 Jan 2018
Deep Net Triage: Analyzing the Importance of Network Layers via Structural Compression Theodore S. Nowak Jason J. Corso FAtt 16 3 0 15 Jan 2018
Fix your classifier: the marginal value of training the last weight layer Elad Hoffer Itay Hubara Daniel Soudry 19 101 0 14 Jan 2018
Focus: Querying Large Video Datasets with Low Latency and Low Cost Kevin Hsieh Ganesh Ananthanarayanan P. Bodík P. Bahl Matthai Philipose Phillip B. Gibbons O. Mutlu 14 275 0 10 Jan 2018
Learning $3$ D-FilterMap for Deep Convolutional Neural Networks Yingzhen Yang Jianchao Yang N. Xu Wei Han 3DV MQ 13 1 0 05 Jan 2018
Overcoming catastrophic forgetting with hard attention to the task Joan Serra Dídac Surís M. Miron Alexandros Karatzoglou CLL 27 1,048 0 04 Jan 2018
Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks Tianshui Chen Liang Lin W. Zuo Xiaonan Luo Lei Zhang 8 56 0 20 Dec 2017
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car Michael Bechtel Elise McEllhiney Minje Kim H. Yun 22 103 0 19 Dec 2017
Squeezed Convolutional Variational AutoEncoder for Unsupervised Anomaly Detection in Edge Device Industrial Internet of Things Dohyung Kim Hyochang Yang Minki Chung Sungzoon Cho DRL 9 32 0 18 Dec 2017
Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA F. Shafiq Takato Yamada Antonio T. Vilchez Sakyasingha Dasgupta MQ 16 3 0 18 Dec 2017
clcNet: Improving the Efficiency of Convolutional Neural Network using Channel Local Convolutions Dong-Qing Zhang 6 10 0 17 Dec 2017
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference Benoit Jacob S. Kligys Bo Chen Menglong Zhu Matthew Tang Andrew G. Howard Hartwig Adam Dmitry Kalenichenko MQ 16 3,043 0 15 Dec 2017
BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition Guangxi Li Jinmian Ye Haiqin Yang Di Chen Shuicheng Yan Zenglin Xu 14 11 0 15 Dec 2017
FFT-Based Deep Learning Deployment in Embedded Systems Sheng Lin Ning Liu M. Nazemi Hongjia Li Caiwen Ding Yanzhi Wang Massoud Pedram 25 52 0 13 Dec 2017
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks Eunwoo Kim Chanho Ahn Songhwai Oh 16 2 0 11 Dec 2017
AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training Chia-Yu Chen Jungwook Choi D. Brand A. Agrawal Wei Zhang K. Gopalakrishnan ODL 14 172 0 07 Dec 2017
Automated Pruning for Deep Neural Network Compression Franco Manessi A. Rozza Simone Bianco Paolo Napoletano Raimondo Schettini 20 56 0 05 Dec 2017
Learning Sparse Neural Networks through $L_0$ Regularization Christos Louizos Max Welling Diederik P. Kingma 14 1,128 0 04 Dec 2017
Adaptive Quantization for Deep Neural Network Yiren Zhou Seyed-Mohsen Moosavi-Dezfooli Ngai-man Cheung P. Frossard MQ 14 177 0 04 Dec 2017
Homomorphic Parameter Compression for Distributed Deep Learning Training Jaehee Jang Byunggook Na Sungroh Yoon FedML 22 1 0 28 Nov 2017
WSNet: Compact and Efficient Networks Through Weight Sampling Xiaojie Jin Yingzhen Yang N. Xu Jianchao Yang Nebojsa Jojic Jiashi Feng Shuicheng Yan 19 2 0 28 Nov 2017
Slim Embedding Layers for Recurrent Neural Language Models Zhongliang Li Raymond Kulhanek Shaojun Wang Yunxin Zhao Shuang Wu KELM 19 23 0 27 Nov 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks Xin Wang F. I. F. Richard Yu Zi-Yi Dou Trevor Darrell Joseph E. Gonzalez 11 624 0 26 Nov 2017
CondenseNet: An Efficient DenseNet using Learned Group Convolutions Gao Huang Shichen Liu L. V. D. van der Maaten Kilian Q. Weinberger 28 795 0 25 Nov 2017
Deep Expander Networks: Efficient Deep Networks from Graph Theory Ameya Prabhu G. Varma A. Namboodiri GNN 21 70 0 23 Nov 2017
BlockDrop: Dynamic Inference Paths in Residual Networks Zuxuan Wu Tushar Nagarajan Abhishek Kumar Steven J. Rennie L. Davis Kristen Grauman Rogerio Feris 15 461 0 22 Nov 2017
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions Bichen Wu Alvin Wan Xiangyu Yue Peter H. Jin Sicheng Zhao Noah Golmant A. Gholaminejad Joseph E. Gonzalez Kurt Keutzer 3DPC 18 363 0 22 Nov 2017
Evaluating Robustness of Neural Networks with Mixed Integer Programming Vincent Tjeng Kai Y. Xiao Russ Tedrake AAML 41 117 0 20 Nov 2017
Interleaver Design for Deep Neural Networks Sourya Dey P. Beerel K. Chugg 13 6 0 18 Nov 2017
Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method Xu Sun Xuancheng Ren Shuming Ma Bingzhen Wei Wei Li Jingjing Xu Houfeng Wang Yi Zhang 23 25 0 17 Nov 2017
Improved Bayesian Compression Marco Federici Karen Ullrich Max Welling UQCV BDL 17 19 0 17 Nov 2017
Mobile Video Object Detection with Temporally-Aware Feature Maps Mason Liu Menglong Zhu ObjD 16 196 0 17 Nov 2017
NISP: Pruning Networks using Neuron Importance Score Propagation Ruichi Yu Ang Li Chun-Fu Chen Jui-Hsin Lai Vlad I. Morariu Xintong Han M. Gao Ching-Yung Lin L. Davis 24 795 0 16 Nov 2017
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy Asit K. Mishra Debbie Marr FedML 16 330 0 15 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler Yu Ji Youhui Zhang Wenguang Chen Yuan Xie 19 56 0 15 Nov 2017
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning Arun Mallya Svetlana Lazebnik CLL 13 1,269 0 15 Nov 2017
Deep Rewiring: Training very sparse deep networks G. Bellec David Kappel Wolfgang Maass R. Legenstein BDL 22 273 0 14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation Moritz B. Milde Daniel Neil Alessandro Aimar T. Delbruck Giacomo Indiveri MQ 22 9 0 13 Nov 2017
Weightless: Lossy Weight Encoding For Deep Neural Network Compression Brandon Reagen Udit Gupta Bob Adolf Michael Mitzenmacher Alexander M. Rush Gu-Yeon Wei David Brooks 19 38 0 13 Nov 2017
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution Haoyu Ren Mostafa El-Khamy Jungwon Lee SupR 33 27 0 11 Nov 2017
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations Ting Chen Martin Renqiang Min Yizhou Sun 11 10 0 08 Nov 2017
Revealing structure components of the retina by deep learning networks Qianyu Yan Zhaofei Yu Feng Chen Jian K. Liu FAtt 8 7 0 08 Nov 2017
Block-Sparse Recurrent Neural Networks Sharan Narang Eric Undersander G. Diamos 9 136 0 08 Nov 2017
Compression-aware Training of Deep Networks J. Álvarez Mathieu Salzmann 8 172 0 07 Nov 2017
Moonshine: Distilling with Cheap Convolutions Elliot J. Crowley Gavia Gray Amos Storkey 19 120 0 07 Nov 2017
Interpreting Convolutional Neural Networks Through Compression R. Abbasi-Asl Bin-Xia Yu FAtt 11 21 0 07 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks Sanchari Sen Shubham Jain Swagath Venkataramani A. Raghunathan 16 30 0 07 Nov 2017
Characterizing Sparse Connectivity Patterns in Neural Networks Sourya Dey Kuan-Wen Huang P. Beerel K. Chugg 22 11 0 06 Nov 2017
Neural Speed Reading via Skim-RNN Minjoon Seo Sewon Min Ali Farhadi Hannaneh Hajishirzi 29 79 0 06 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing Sourya Dey Yinan Shao K. Chugg P. Beerel 22 16 0 03 Nov 2017