ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown
Title
Learning towards Minimum Hyperspherical Energy
Learning towards Minimum Hyperspherical Energy
Weiyang Liu
Rongmei Lin
Z. Liu
Lixin Liu
Zhiding Yu
Bo Dai
Le Song
17
145
0
23 May 2018
AutoPruner: An End-to-End Trainable Filter Pruning Method for Efficient
  Deep Model Inference
AutoPruner: An End-to-End Trainable Filter Pruning Method for Efficient Deep Model Inference
Jian-Hao Luo
Jianxin Wu
10
207
0
23 May 2018
Approximate Random Dropout
Approximate Random Dropout
Zhuoran Song
Ru Wang
Dongyu Ru
Hongru Huang
Zhenghao Peng
Hai Zhao
Xiaoyao Liang
Li Jiang
BDL
6
9
0
23 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN
  Training
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
19
44
0
22 May 2018
CascadeCNN: Pushing the performance limits of quantisation
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
17
24
0
22 May 2018
Parsimonious Bayesian deep networks
Parsimonious Bayesian deep networks
Mingyuan Zhou
BDL
9
8
0
22 May 2018
AxTrain: Hardware-Oriented Neural Network Training for Approximate
  Inference
AxTrain: Hardware-Oriented Neural Network Training for Approximate Inference
Xin He
Liu Ke
Wenyan Lu
Guihai Yan
Xuan Zhang
11
33
0
21 May 2018
Compression of Deep Convolutional Neural Networks under Joint Sparsity
  Constraints
Compression of Deep Convolutional Neural Networks under Joint Sparsity Constraints
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
9
6
0
21 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
21
26
0
21 May 2018
Neural Network Compression using Transform Coding and Clustering
Neural Network Compression using Transform Coding and Clustering
Thorsten Laude
Yannick Richter
Jörn Ostermann
10
4
0
18 May 2018
RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant
  Deep Networks
RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep Networks
Xiuyuan Cheng
Qiang Qiu
Robert Calderbank
Guillermo Sapiro
25
43
0
17 May 2018
Object detection at 200 Frames Per Second
Object detection at 200 Frames Per Second
Rakesh Mehta
Cemalettin Öztürk
ObjD
23
61
0
16 May 2018
Lightweight Pyramid Networks for Image Deraining
Lightweight Pyramid Networks for Image Deraining
Xueyang Fu
Borong Liang
Yue Huang
Xinghao Ding
John Paisley
13
323
0
16 May 2018
Hu-Fu: Hardware and Software Collaborative Attack Framework against
  Neural Networks
Hu-Fu: Hardware and Software Collaborative Attack Framework against Neural Networks
Wenshuo Li
Jincheng Yu
Xuefei Ning
Pengjun Wang
Qi Wei
Yu Wang
Huazhong Yang
AAML
31
61
0
14 May 2018
Unifying and Merging Well-trained Deep Neural Networks for Inference
  Stage
Unifying and Merging Well-trained Deep Neural Networks for Inference Stage
Yi-Min Chou
Yi-Ming Chan
Jia-Hong Lee
Chih-Yi Chiu
Chu-Song Chen
MoMe
27
34
0
14 May 2018
ContextNet: Exploring Context and Detail for Semantic Segmentation in
  Real-time
ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time
Rudra P. K. Poudel
Ujwal D. Bonde
Stephan Liwicki
Christopher Zach
SSeg
38
227
0
11 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial
  Networks
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
Amir Yazdanbakhsh
Hajar Falahati
Philip J. Wolfe
K. Samadi
N. Kim
H. Esmaeilzadeh
20
71
0
10 May 2018
Boosting up Scene Text Detectors with Guided CNN
Boosting up Scene Text Detectors with Guided CNN
Xiaoyu Yue
Zhanghui Kuang
Zhaoyang Zhang
Zhenfang Chen
Pan He
Yu Qiao
Wayne Zhang
9
8
0
10 May 2018
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Charles Eckert
Xiaowei Wang
Jingcheng Wang
Arun K. Subramaniyan
R. Iyer
D. Sylvester
D. Blaauw
R. Das
MQ
6
333
0
09 May 2018
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data
  Quantization-Aware Deep Networks
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data Quantization-Aware Deep Networks
Fuqiang Liu
Chenchen Liu
14
5
0
08 May 2018
A Hierarchical Matcher using Local Classifier Chains
A Hierarchical Matcher using Local Classifier Chains
Lingfeng Zhang
I. Kakadiaris
9
0
0
07 May 2018
Enhancing the Regularization Effect of Weight Pruning in Artificial
  Neural Networks
Enhancing the Regularization Effect of Weight Pruning in Artificial Neural Networks
Brian Bartoldson
Adrian Barbu
G. Erlebacher
12
5
0
04 May 2018
Power Law in Sparsified Deep Neural Networks
Power Law in Sparsified Deep Neural Networks
Lu Hou
James T. Kwok
16
3
0
04 May 2018
Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Shu Kong
Charless C. Fowlkes
45
40
0
03 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural
  Networks
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
9
45
0
29 Apr 2018
Precise Box Score: Extract More Information from Datasets to Improve the
  Performance of Face Detection
Precise Box Score: Extract More Information from Datasets to Improve the Performance of Face Detection
Ce Qi
Xiaoping Chen
Pingyu Wang
Fei Su
CVBM
6
1
0
28 Apr 2018
Low-memory convolutional neural networks through incremental depth-first
  processing
Low-memory convolutional neural networks through incremental depth-first processing
Jonathan Binas
Yoshua Bengio
SupR
20
3
0
28 Apr 2018
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Feiwen Zhu
Jeff Pool
M. Andersch
J. Appleyard
Fung Xie
10
29
0
26 Apr 2018
Profile-guided memory optimization for deep neural networks
Profile-guided memory optimization for deep neural networks
Taro Sekiyama
T. Imamichi
Haruki Imai
Raymond H. Putra
16
22
0
26 Apr 2018
Accelerator-Aware Pruning for Convolutional Neural Networks
Accelerator-Aware Pruning for Convolutional Neural Networks
Hyeong-Ju Kang
6
88
0
26 Apr 2018
Efficient Multi-objective Neural Architecture Search via Lamarckian
  Evolution
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
T. Elsken
J. H. Metzen
Frank Hutter
128
498
0
24 Apr 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
9
395
0
24 Apr 2018
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter
  Server
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server
Guoxin Cui
Jun Xu
Wei Zeng
Yanyan Lan
J. Guo
Xueqi Cheng
MQ
6
13
0
22 Apr 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification
  on Mobile Devices
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
16
557
0
20 Apr 2018
Minimizing Area and Energy of Deep Learning Hardware Design Using
  Collective Low Precision and Structured Compression
Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression
Shihui Yin
Gaurav Srivastava
S. Venkataramanaiah
C. Chakrabarti
Visar Berisha
Jae-sun Seo
9
8
0
19 Apr 2018
Pelee: A Real-Time Object Detection System on Mobile Devices
Pelee: A Real-Time Object Detection System on Mobile Devices
R. Wang
Xiang Li
Charles X. Ling
ObjD
9
454
0
18 Apr 2018
Deep Face Recognition: A Survey
Deep Face Recognition: A Survey
Mei Wang
Weihong Deng
NoLa
25
1,211
0
18 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight
  Repetition
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
14
165
0
18 Apr 2018
Training a Binary Weight Object Detector by Knowledge Transfer for
  Autonomous Driving
Training a Binary Weight Object Detector by Knowledge Transfer for Autonomous Driving
Jiaolong Xu
Peng Wang
Hengzhang Yang
Antonio M. López
MQ
24
23
0
17 Apr 2018
IGCV$2$: Interleaved Structured Sparse Convolutional Neural Networks
IGCV222: Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie
Jingdong Wang
Ting Zhang
Jianhuang Lai
Richang Hong
Guo-Jun Qi
13
104
0
17 Apr 2018
Non-Vacuous Generalization Bounds at the ImageNet Scale: A PAC-Bayesian
  Compression Approach
Non-Vacuous Generalization Bounds at the ImageNet Scale: A PAC-Bayesian Compression Approach
Wenda Zhou
Victor Veitch
Morgane Austern
Ryan P. Adams
Peter Orbanz
27
209
0
16 Apr 2018
Fast inference of deep neural networks in FPGAs for particle physics
Fast inference of deep neural networks in FPGAs for particle physics
Javier Mauricio Duarte
Song Han
Philip C. Harris
S. Jindariani
E. Kreinar
...
J. Ngadiuba
M. Pierini
R. Rivera
N. Tran
Zhenbin Wu
AI4CE
75
386
0
16 Apr 2018
Data-Dependent Coresets for Compressing Neural Networks with
  Applications to Generalization Bounds
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
Cenk Baykal
Lucas Liebenwein
Igor Gilitschenski
Dan Feldman
Daniela Rus
10
79
0
15 Apr 2018
Select, Attend, and Transfer: Light, Learnable Skip Connections
Select, Attend, and Transfer: Light, Learnable Skip Connections
Saeid Asgari Taghanaki
A. Bentaieb
Anmol Sharma
S. Kevin Zhou
Yefeng Zheng
...
Puneet Sharma
Sasa Grbic
Zhoubing Xu
D. Comaniciu
Ghassan Hamarneh
28
20
0
14 Apr 2018
Pieces of Eight: 8-bit Neural Machine Translation
Pieces of Eight: 8-bit Neural Machine Translation
Jerry Quinn
Miguel Ballesteros
MQ
6
25
0
13 Apr 2018
The unreasonable effectiveness of the forget gate
The unreasonable effectiveness of the forget gate
J. Westhuizen
Joan Lasenby
14
86
0
13 Apr 2018
A Compact Network Learning Model for Distribution Regression
A Compact Network Learning Model for Distribution Regression
C. Kou
H. Lee
Teck Khim Ng
18
10
0
13 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
25
18
0
11 Apr 2018
Crafting a Toolchain for Image Restoration by Deep Reinforcement
  Learning
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning
K. Yu
Chao Dong
Liang Lin
Chen Change Loy
CLL
OffRL
19
174
0
10 Apr 2018
A Systematic DNN Weight Pruning Framework using Alternating Direction
  Method of Multipliers
A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers
Tianyun Zhang
Shaokai Ye
Kaiqi Zhang
Jian Tang
Wujie Wen
M. Fardad
Yanzhi Wang
13
434
0
10 Apr 2018
Previous
123...606162...676869
Next