Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,434 papers shown
Title
Wide Compression: Tensor Ring Nets
Wenqi Wang
Yifan Sun
Brian Eriksson
Wenlin Wang
Vaneet Aggarwal
13
168
0
25 Feb 2018
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
15
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
22
71
0
23 Feb 2018
Approximation Algorithms for Cascading Prediction Models
Matthew J. Streeter
TPM
8
19
0
21 Feb 2018
Building Efficient ConvNets using Redundant Feature Pruning
B. Ayinde
J. Zurada
VLM
3DPC
16
47
0
21 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
11
70
0
21 Feb 2018
The Description Length of Deep Learning Models
Léonard Blier
Yann Ollivier
24
95
0
20 Feb 2018
DeepThin: A Self-Compressing Library for Deep Neural Networks
Matthew Sotoudeh
Sara S. Baghsorkhi
16
4
0
20 Feb 2018
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
John Mern
Jayesh K. Gupta
Mykel Kochenderfer
30
1
0
20 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
21
70
0
19 Feb 2018
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Yanzhi Wang
Caiwen Ding
Zhe Li
Geng Yuan
Siyu Liao
...
Bo Yuan
Xuehai Qian
Jian Tang
Qinru Qiu
X. Lin
18
33
0
18 Feb 2018
Efficient Sparse-Winograd Convolutional Neural Networks
Xingyu Liu
Jeff Pool
Song Han
W. Dally
11
122
0
18 Feb 2018
Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet
S. H. HasanPour
Mohammad Rouhani
Mohsen Fayyaz
Mohammad Sabokrou
Ehsan Adeli
42
45
0
17 Feb 2018
Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers
Tianyun Zhang
Shaokai Ye
Yipeng Zhang
Yanzhi Wang
M. Fardad
17
21
0
15 Feb 2018
Model compression via distillation and quantization
A. Polino
Razvan Pascanu
Dan Alistarh
MQ
17
718
0
15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks
Qi Liu
Tao Liu
Zihao Liu
Yanzhi Wang
Yier Jin
Wujie Wen
AAML
27
48
0
14 Feb 2018
Paraphrasing Complex Network: Network Compression via Factor Transfer
Jangho Kim
Seonguk Park
Nojun Kwak
16
543
0
14 Feb 2018
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning
Haoyu Zhang
Logan Stafman
Andrew Or
M. Freedman
16
141
0
13 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
19
389
0
13 Feb 2018
Attention-Based Guided Structured Sparsity of Deep Neural Networks
A. Torfi
Rouzbeh A. Shirvani
Sobhan Soleymani
Nasser M. Nasrabadi
21
23
0
13 Feb 2018
DCFNet: Deep Neural Network with Decomposed Convolutional Filters
Qiang Qiu
Xiuyuan Cheng
Robert Calderbank
Guillermo Sapiro
33
69
0
12 Feb 2018
ClosNets: a Priori Sparse Topologies for Faster DNN Training
Mihailo Isakov
Michel A. Kinsy
CVBM
16
0
0
12 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
J. Ko
Taesik Na
M. Amir
Saibal Mukhopadhyay
16
148
0
11 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators
Jeff Zhang
Kartheek Rangineni
Zahra Ghodsi
S. Garg
20
117
0
11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator
Jeff Zhang
Tianyu Gu
K. Basu
S. Garg
6
134
0
11 Feb 2018
The Need for Speed of AI Applications: Performance Comparison of Native vs. Browser-based Algorithm Implementations
Bernd Malle
Nicola Giuliani
Peter Kieseberg
Andreas Holzinger
8
8
0
11 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
21
21
0
10 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li-Jia Li
Song Han
33
1,339
0
10 Feb 2018
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary Deep Intelligence
A. Chung
Paul Fieguth
A. Wong
14
1
0
09 Feb 2018
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Abhronil Sengupta
Yuting Ye
Robert Y. Wang
Chiao Liu
Kaushik Roy
15
978
0
07 Feb 2018
Effective Quantization Approaches for Recurrent Neural Networks
Md. Zahangir Alom
A. Moody
N. Maruyama
B. Van Essen
T. Taha
MQ
8
33
0
07 Feb 2018
CryptoRec: Privacy-preserving Recommendation as a Service
Jun Wang
Afonso Arriaga
Qiang Tang
Peter Y. A. Ryan
13
3
0
07 Feb 2018
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
81
85
0
07 Feb 2018
Digital Watermarking for Deep Neural Networks
Yuki Nagai
Yusuke Uchida
S. Sakazawa
Shiníchi Satoh
WIGM
23
143
0
06 Feb 2018
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Ramyad Hadidi
Jiashen Cao
M. Woodward
Michael S. Ryoo
Hyesoon Kim
14
34
0
05 Feb 2018
Learning Compact Neural Networks with Regularization
Samet Oymak
MLT
25
39
0
05 Feb 2018
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks
Jian Cheng
Peisong Wang
Gang Li
Qinghao Hu
Hanqing Lu
16
3
0
03 Feb 2018
Build a Compact Binary Neural Network through Bit-level Sensitivity and Data Pruning
Yixing Li
Fengbo Ren
MQ
14
12
0
03 Feb 2018
Intriguing Properties of Randomly Weighted Networks: Generalizing While Learning Next to Nothing
Amir Rosenfeld
John K. Tsotsos
MLT
24
51
0
02 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks
R. Cai
Ao Ren
Ning Liu
Caiwen Ding
Luhao Wang
Xuehai Qian
Massoud Pedram
Yanzhi Wang
BDL
21
87
0
02 Feb 2018
Adaptive Memory Networks
Daniel Li
Asim Kadav
21
5
0
01 Feb 2018
Alternating Multi-bit Quantization for Recurrent Neural Networks
Chen Xu
Jianqiang Yao
Zhouchen Lin
Wenwu Ou
Yuanbin Cao
Zhirong Wang
H. Zha
MQ
27
115
0
01 Feb 2018
Model compression for faster structural separation of macromolecules captured by Cellular Electron Cryo-Tomography
JiaLiang Guo
Bo Zhou
Xiangrui Zeng
Z. Freyberg
Min Xu
17
10
0
31 Jan 2018
Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks
Deepak Mittal
S. Bhardwaj
Mitesh M. Khapra
Balaraman Ravindran
VLM
22
65
0
31 Jan 2018
On Psychoacoustically Weighted Cost Functions Towards Resource-Efficient Deep Neural Networks for Speech Denoising
Kai Zhen
Aswin Sivaraman
Jongmo Sung
Minje Kim
9
7
0
29 Jan 2018
Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of Deep Convolutional Neural Networks
Yuechao Gao
Nianhong Liu
Shenmin Zhang
13
0
0
23 Jan 2018
Learning to Prune Filters in Convolutional Neural Networks
Qiangui Huang
S. Kevin Zhou
Suya You
Ulrich Neumann
VLM
23
176
0
23 Jan 2018
Binary output layer of feedforward neural networks for solving multi-class classification problems
Sibo Yang
Chao Zhang
Wei Wu
MQ
14
8
0
22 Jan 2018
Bayesian Deep Convolutional Encoder-Decoder Networks for Surrogate Modeling and Uncertainty Quantification
Yinhao Zhu
N. Zabaras
UQCV
BDL
17
636
0
21 Jan 2018
Toward Scalable Verification for Safety-Critical Deep Networks
L. Kuper
Guy Katz
Justin Emile Gottschlich
Kyle D. Julian
Clark W. Barrett
Mykel Kochenderfer
29
40
0
18 Jan 2018
Previous
1
2
3
...
62
63
64
...
67
68
69
Next