Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,434 papers shown
Title
ReBNet: Residual Binarized Neural Network
M. Ghasemzadeh
Mohammad Samragh
F. Koushanfar
MQ
22
4
0
03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity
Jingyang Zhu
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
18
36
0
03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning
Raphael Shu
Hideki Nakayama
21
129
0
03 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
16
6
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
23
231
0
01 Nov 2017
Tensorizing Generative Adversarial Nets
Xingwei Cao
Xuyang Zhao
Qibin Zhao
GAN
17
9
0
30 Oct 2017
Knowledge Projection for Deep Neural Networks
Zhi Zhang
G. Ning
Zhihai He
28
15
0
26 Oct 2017
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
M. Shoeybi
15
8
0
25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
15
1,085
0
23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
11
88
0
21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
12
270
0
19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
25
23
0
13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition
Nitin Rathi
Priyadarshini Panda
Kaushik Roy
16
111
0
12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Jiaqi Guan
Yang Liu
Qiang Liu
Jian-wei Peng
14
33
0
10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
15
207
0
09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures
F. Iandola
Kurt Keutzer
15
37
0
07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
37
1,248
0
05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear Filters
D. Tran
Alexandros Iosifidis
M. Gabbouj
16
40
0
28 Sep 2017
Connectivity Learning in Multi-Branch Networks
Karim Ahmed
Lorenzo Torresani
16
26
0
27 Sep 2017
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
16
502
0
22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
13
41
0
22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration
Huan Wang
Qiming Zhang
Yuehai Wang
Roland Hu
13
11
0
20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
Julian Faraone
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
MQ
UQCV
13
12
0
19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
A. Ashok
Nicholas Rhinehart
Fares N. Beainy
Kris M. Kitani
16
169
0
18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Tianchan Guan
Xiaoyang Zeng
Mingoo Seok
MQ
20
6
0
15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Yuan Du
Li Du
Yilei Li
Junjie Su
Mau-Chung Frank Chang
14
6
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
21
140
0
15 Sep 2017
Supervising Unsupervised Learning
Vikas K. Garg
Adam Kalai
SSL
FedML
16
29
0
14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Ryuji Kamiya
Takayoshi Yamashita
Mitsuru Ambai
Ikuro Sato
Yuji Yamauchi
H. Fujiyoshi
MQ
12
4
0
14 Sep 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
23
10
0
13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification
Chong-Jun Wang
Xipeng Lan
Yang Zhang
CVBM
15
26
0
09 Sep 2017
Real-time convolutional networks for sonar image classification in low-power embedded systems
Matias Valdenegro-Toro
23
10
0
07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis
A. Chung
M. Shafiee
Paul Fieguth
A. Wong
10
4
0
07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
11
1,109
0
06 Sep 2017
Domain-adaptive deep network compression
Marc Masana
Joost van de Weijer
Luis Herranz
Andrew D. Bagdanov
J. Álvarez
36
62
0
04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks
Qifeng Chen
Jia Xu
V. Koltun
10
322
0
02 Sep 2017
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions
Toby Lightheart
S. Grainger
Tien-Fu Lu
8
0
0
30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li
Bingbing Ni
Wenjun Zhang
Xiaokang Yang
Wen Gao
MQ
16
105
0
29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Caiwen Ding
Siyu Liao
Yanzhi Wang
Zhe Li
Ning Liu
...
Yipeng Zhang
Jian Tang
Qinru Qiu
X. Lin
Bo Yuan
GNN
19
258
0
29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images
Duc Minh Nguyen
Evaggelia Tsiligianni
Nikos Deligiannis
13
26
0
28 Aug 2017
The Convergence of Machine Learning and Communications
Wojciech Samek
S. Stańczak
Thomas Wiegand
AI4CE
24
29
0
28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming
Zhuang Liu
Jianguo Li
Zhiqiang Shen
Gao Huang
Shoumeng Yan
Changshui Zhang
24
2,383
0
22 Aug 2017
Neural Networks Compression for Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
14
30
0
20 Aug 2017
Deep Neural Network Capacity
Aosen Wang
Huan Zhou
Wenyao Xu
Xin Chen
11
4
0
16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
22
10
0
16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
15
97
0
16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS
J. Kepner
Manoj Kumar
José Moreira
P. Pattnaik
M. Serrano
H. Tufo
GNN
14
33
0
09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink
X. Li
Changsong Liu
CVBM
11
4
0
08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha
Emily Pitler
Ji Ma
A. Bakalov
Alexandru Salcianu
David J. Weiss
Ryan T. McDonald
Slav Petrov
HAI
17
38
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
28
35
0
31 Jul 2017
Previous
1
2
3
...
64
65
66
67
68
69
Next