ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

34 / 3,434 papers shown
Title
CNNLab: a Novel Parallel Framework for Neural Networks using GPU and
  FPGA-a Practical Study with Trade-off Analysis
CNNLab: a Novel Parallel Framework for Neural Networks using GPU and FPGA-a Practical Study with Trade-off Analysis
Maohua Zhu
L. Liu
Chao Wang
Yuan Xie
GNN
14
20
0
20 Jun 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low
  Bitwidth Gradients
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
Shuchang Zhou
Yuxin Wu
Zekun Ni
Xinyu Zhou
He Wen
Yuheng Zou
MQ
21
2,071
0
20 Jun 2016
Deep Learning with Darwin: Evolutionary Synthesis of Deep Neural
  Networks
Deep Learning with Darwin: Evolutionary Synthesis of Deep Neural Networks
M. Shafiee
A. Mishra
A. Wong
16
44
0
14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning
Structured Convolution Matrices for Energy-efficient Deep learning
R. Appuswamy
T. Nayak
John V. Arthur
S. K. Esser
P. Merolla
J. McKinstry
T. Melano
M. Flickner
D. Modha
25
11
0
08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic
  Segmentation
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
216
2,055
0
07 Jun 2016
Learning Natural Language Inference using Bidirectional LSTM model and
  Inner-Attention
Learning Natural Language Inference using Bidirectional LSTM model and Inner-Attention
Yang Janet Liu
Chengjie Sun
Mehdi Alizadeh
Xiaolong Wang
22
273
0
30 May 2016
An Analysis of Deep Neural Network Models for Practical Applications
An Analysis of Deep Neural Network Models for Practical Applications
A. Canziani
Adam Paszke
Eugenio Culurciello
8
1,164
0
24 May 2016
Path-Normalized Optimization of Recurrent Neural Networks with ReLU
  Activations
Path-Normalized Optimization of Recurrent Neural Networks with ReLU Activations
Behnam Neyshabur
Yuhuai Wu
Ruslan Salakhutdinov
Nathan Srebro
AI4CE
ODL
14
30
0
23 May 2016
Learning Sensor Multiplexing Design through Back-propagation
Learning Sensor Multiplexing Design through Back-propagation
Ayan Chakrabarti
SSL
18
126
0
23 May 2016
Functional Hashing for Compressing Neural Networks
Functional Hashing for Compressing Neural Networks
Lei Shi
Shikun Feng
Zhifan Zhu
25
4
0
20 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural
  Networks
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
19
127
0
20 May 2016
Reducing the Model Order of Deep Neural Networks Using Information
  Theory
Reducing the Model Order of Deep Neural Networks Using Information Theory
Ming Tu
Visar Berisha
Yu Cao
Jae-sun Seo
6
23
0
16 May 2016
Ternary Weight Networks
Ternary Weight Networks
Fengfu Li
Bin Liu
Xiaoxing Wang
Bo-Wen Zhang
Junchi Yan
MQ
19
520
0
16 May 2016
ASP Vision: Optically Computing the First Layer of Convolutional Neural
  Networks using Angle Sensitive Pixels
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks using Angle Sensitive Pixels
H. G. Chen
Suren Jayasuriya
Jiyue Yang
J. Stephen
S. Sivaramakrishnan
Ashok Veeraraghavan
A. Molnar
13
66
0
11 May 2016
Hardware-oriented Approximation of Convolutional Neural Networks
Hardware-oriented Approximation of Convolutional Neural Networks
Philipp Gysel
Mohammad Motamedi
S. Ghiasi
39
309
0
11 Apr 2016
Training Constrained Deconvolutional Networks for Road Scene Semantic
  Segmentation
Training Constrained Deconvolutional Networks for Road Scene Semantic Segmentation
G. Ros
Simon Stent
P. Alcantarilla
Tomoki Watanabe
13
55
0
06 Apr 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural
  Networks
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
8
4,328
0
16 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
19
425
0
03 Mar 2016
vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient
  Neural Network Design
vDNN: Virtualized Deep Neural Networks for Scalable, Memory-Efficient Neural Network Design
Minsoo Rhu
N. Gimelshein
Jason Clemons
A. Zulfiqar
S. Keckler
GNN
6
32
0
25 Feb 2016
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB
  model size
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size
F. Iandola
Song Han
Matthew W. Moskewicz
Khalid Ashraf
W. Dally
Kurt Keutzer
25
7,412
0
24 Feb 2016
Binarized Neural Networks
Itay Hubara
Daniel Soudry
Ran El-Yaniv
MQ
15
1,351
0
08 Feb 2016
EIE: Efficient Inference Engine on Compressed Deep Neural Network
EIE: Efficient Inference Engine on Compressed Deep Neural Network
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
28
2,446
0
04 Feb 2016
Relief R-CNN : Utilizing Convolutional Features for Fast Object
  Detection
Relief R-CNN : Utilizing Convolutional Features for Fast Object Detection
Guiying Li
Junlong Liu
Chunhui Jiang
Liangpeng Zhang
Minlong Lin
Ke Tang
ObjD
21
7
0
25 Jan 2016
Structured Pruning of Deep Convolutional Neural Networks
Structured Pruning of Deep Convolutional Neural Networks
S. Anwar
Kyuyeon Hwang
Wonyong Sung
14
741
0
29 Dec 2015
Recent Advances in Convolutional Neural Networks
Recent Advances in Convolutional Neural Networks
Jiuxiang Gu
Zhenhua Wang
Jason Kuen
Lianyang Ma
Amir Shahroudy
...
Xingxing Wang
Li Wang
Gang Wang
Jianfei Cai
Tsuhan Chen
29
5,134
0
22 Dec 2015
Quantized Convolutional Neural Networks for Mobile Devices
Quantized Convolutional Neural Networks for Mobile Devices
Jiaxiang Wu
Cong Leng
Yuhang Wang
Qinghao Hu
Jian Cheng
MQ
10
1,156
0
21 Dec 2015
Compression of Deep Convolutional Neural Networks for Fast and Low Power
  Mobile Applications
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
Yong-Deok Kim
Eunhyeok Park
S. Yoo
Taelim Choi
Lu Yang
Dongjun Shin
12
892
0
20 Nov 2015
Resiliency of Deep Neural Networks under Quantization
Resiliency of Deep Neural Networks under Quantization
Wonyong Sung
Sungho Shin
Kyuyeon Hwang
MQ
12
157
0
20 Nov 2015
Blending LSTMs into CNNs
Blending LSTMs into CNNs
Krzysztof J. Geras
Abdel-rahman Mohamed
R. Caruana
G. Urban
Shengjie Wang
Ozlem Aslan
Matthai Philipose
Matthew Richardson
Charles Sutton
11
60
0
19 Nov 2015
Fixed Point Quantization of Deep Convolutional Networks
Fixed Point Quantization of Deep Convolutional Networks
D. Lin
S. Talathi
V. Annapureddy
MQ
14
809
0
19 Nov 2015
Adjustable Bounded Rectifiers: Towards Deep Binary Representations
Adjustable Bounded Rectifiers: Towards Deep Binary Representations
Zhirong Wu
Dahua Lin
Xiaoou Tang
MQ
14
14
0
19 Nov 2015
ACDC: A Structured Efficient Linear Layer
ACDC: A Structured Efficient Linear Layer
Marcin Moczulski
Misha Denil
J. Appleyard
Nando de Freitas
16
98
0
18 Nov 2015
FireCaffe: near-linear acceleration of deep neural network training on
  compute clusters
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
11
302
0
31 Oct 2015
Learning both Weights and Connections for Efficient Neural Networks
Learning both Weights and Connections for Efficient Neural Networks
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
27
6,559
0
08 Jun 2015
Previous
123...676869