ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown
Title
Accurate Optical Flow via Direct Cost Volume Processing
Accurate Optical Flow via Direct Cost Volume Processing
Jia Xu
René Ranftl
V. Koltun
11
237
0
24 Apr 2017
A Review on Deep Learning Techniques Applied to Semantic Segmentation
A Review on Deep Learning Techniques Applied to Semantic Segmentation
Alberto Garcia-Garcia
Sergio Orts
Sergiu Oprea
Victor Villena-Martinez
Jose Garcia-Rodriguez
3DV
SSeg
28
1,267
0
22 Apr 2017
Exploring Sparsity in Recurrent Neural Networks
Exploring Sparsity in Recurrent Neural Networks
Sharan Narang
Erich Elsen
G. Diamos
Shubho Sengupta
8
308
0
17 Apr 2017
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
948
20,549
0
17 Apr 2017
Enabling Embedded Inference Engine with ARM Compute Library: A Case
  Study
Enabling Embedded Inference Engine with ARM Compute Library: A Case Study
Dawei Sun
Shaoshan Liu
J. Gaudiot
6
13
0
12 Apr 2017
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
Sanjay Ganapathy
Swagath Venkataramani
Balaraman Ravindran
A. Raghunathan
17
8
0
04 Apr 2017
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible
  Representations
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
E. Agustsson
Fabian Mentzer
Michael Tschannen
Lukas Cavigelli
Radu Timofte
Luca Benini
Luc Van Gool
MQ
13
479
0
03 Apr 2017
Multi-Scale Dense Networks for Resource Efficient Image Classification
Multi-Scale Dense Networks for Resource Efficient Image Classification
Gao Huang
Danlu Chen
Tianhong Li
Felix Wu
L. V. D. van der Maaten
Kilian Q. Weinberger
VLM
14
137
0
29 Mar 2017
Coordinating Filters for Faster Deep Neural Networks
Coordinating Filters for Faster Deep Neural Networks
W. Wen
Cong Xu
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
11
138
0
28 Mar 2017
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
19
2,988
0
27 Mar 2017
More is Less: A More Complicated Network with Less Inference Complexity
More is Less: A More Complicated Network with Less Inference Complexity
Xuanyi Dong
Junshi Huang
Yi Yang
Shuicheng Yan
8
287
0
25 Mar 2017
Quality Resilient Deep Neural Networks
Quality Resilient Deep Neural Networks
Samuel F. Dodge
Lina Karam
OOD
13
46
0
23 Mar 2017
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification
  and Domain Adaptation
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation
Chunpeng Wu
W. Wen
Tariq Afzal
Yongmei Zhang
Yiran Chen
Hai Helen Li
22
45
0
12 Mar 2017
Deep Convolutional Neural Network Inference with Floating-point Weights
  and Fixed-point Activations
Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations
Liangzhen Lai
Naveen Suda
Vikas Chandra
MQ
20
85
0
08 Mar 2017
NoScope: Optimizing Neural Network Queries over Video at Scale
NoScope: Optimizing Neural Network Queries over Video at Scale
Daniel Kang
John Emmons
Firas Abuzaid
Peter Bailis
Matei A. Zaharia
13
203
0
07 Mar 2017
Theoretical Properties for Neural Networks with Weight Matrices of Low
  Displacement Rank
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
Liang Zhao
Siyu Liao
Yanzhi Wang
Zhe Li
Jian Tang
Victor Pan
Bo Yuan
23
61
0
01 Mar 2017
ShaResNet: reducing residual network parameter number by sharing weights
ShaResNet: reducing residual network parameter number by sharing weights
Alexandre Boulch
16
26
0
28 Feb 2017
Billion-scale similarity search with GPUs
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
6
3,635
0
28 Feb 2017
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its
  Application to Face Alignment
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its Application to Face Alignment
Nenad Markuš
Ivan Gogić
Igor S. Pandzic
Jörgen Ahlberg
CVBM
14
1
0
27 Feb 2017
Adaptive Ensemble Prediction for Deep Neural Networks based on
  Confidence Level
Adaptive Ensemble Prediction for Deep Neural Networks based on Confidence Level
H. Inoue
UQCV
FedML
11
1
0
27 Feb 2017
Low-Precision Batch-Normalized Activations
Low-Precision Batch-Normalized Activations
Benjamin Graham
MQ
11
9
0
27 Feb 2017
Fixed-point optimization of deep neural networks with adaptive step size
  retraining
Fixed-point optimization of deep neural networks with adaptive step size retraining
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
16
34
0
27 Feb 2017
Building Fast and Compact Convolutional Neural Networks for Offline
  Handwritten Chinese Character Recognition
Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition
Xuefeng Xiao
Lianwen Jin
Yafeng Yang
Weixin Yang
Jun Sun
Tianhai Chang
11
153
0
26 Feb 2017
Tuple-oriented Compression for Large-scale Mini-batch Stochastic
  Gradient Descent
Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent
Fengan Li
Lingjiao Chen
Yijing Zeng
Arun Kumar
Jeffrey F. Naughton
J. Patel
Xi Wu
21
19
0
22 Feb 2017
The Power of Sparsity in Convolutional Neural Networks
The Power of Sparsity in Convolutional Neural Networks
Soravit Changpinyo
Mark Sandler
A. Zhmoginov
6
132
0
21 Feb 2017
Soft Weight-Sharing for Neural Network Compression
Soft Weight-Sharing for Neural Network Compression
Karen Ullrich
Edward Meeds
Max Welling
18
411
0
13 Feb 2017
Incremental Network Quantization: Towards Lossless CNNs with
  Low-Precision Weights
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
311
1,047
0
10 Feb 2017
Exploiting Domain Knowledge via Grouped Weight Sharing with Application
  to Text Categorization
Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization
Ye Zhang
Matthew Lease
Byron C. Wallace
8
15
0
08 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
17
502
0
03 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
78
1,502
0
25 Jan 2017
Variational Dropout Sparsifies Deep Neural Networks
Variational Dropout Sparsifies Deep Neural Networks
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
6
819
0
19 Jan 2017
Compression of Deep Neural Networks for Image Instance Retrieval
Compression of Deep Neural Networks for Image Instance Retrieval
V. Chandrasekhar
Jie Lin
Q. Liao
Olivier Morère
D. Shapiro
Lingyu Duan
Tomaso Poggio
14
25
0
18 Jan 2017
The Incredible Shrinking Neural Network: New Perspectives on Learning
  Representations Through The Lens of Pruning
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning
Aditya Sharma
Nikolas Wolfe
Bhiksha Raj
13
18
0
16 Jan 2017
Embedding Watermarks into Deep Neural Networks
Embedding Watermarks into Deep Neural Networks
Yusuke Uchida
Yuki Nagai
S. Sakazawa
Shiníchi Satoh
35
597
0
15 Jan 2017
Scaling Binarized Neural Networks on Reconfigurable Logic
Scaling Binarized Neural Networks on Reconfigurable Logic
Nicholas J. Fraser
Yaman Umuroglu
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
8
57
0
12 Jan 2017
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
Tapabrata Ghosh
16
6
0
09 Jan 2017
Hardware for Machine Learning: Challenges and Opportunities
Hardware for Machine Learning: Challenges and Opportunities
Vivienne Sze
Yu-hsin Chen
Joel S. Einer
Amr Suleiman
Zhengdong Zhang
14
77
0
22 Dec 2016
Wide-Slice Residual Networks for Food Recognition
Wide-Slice Residual Networks for Food Recognition
N. Martinel
G. Foresti
C. Micheloni
20
200
0
20 Dec 2016
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
16
18
0
20 Dec 2016
Quantization and Training of Low Bit-Width Convolutional Neural Networks
  for Object Detection
Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection
Penghang Yin
Shuai Zhang
Y. Qi
Jack Xin
MQ
49
41
0
19 Dec 2016
Delta Networks for Optimized Recurrent Network Computation
Delta Networks for Optimized Recurrent Network Computation
Daniel Neil
Junhaeng Lee
T. Delbruck
Shih-Chii Liu
31
66
0
16 Dec 2016
FastText.zip: Compressing text classification models
FastText.zip: Compressing text classification models
Armand Joulin
Edouard Grave
Piotr Bojanowski
Matthijs Douze
Hervé Jégou
Tomáš Mikolov
MQ
11
1,187
0
12 Dec 2016
Learning in the Machine: Random Backpropagation and the Deep Learning
  Channel
Learning in the Machine: Random Backpropagation and the Deep Learning Channel
Pierre Baldi
Peter Sadowski
Zhiqin Lu
AAML
10
16
0
08 Dec 2016
Filter sharing: Efficient learning of parameters for volumetric
  convolutions
Filter sharing: Efficient learning of parameters for volumetric convolutions
Rahul Venkataramani
S. Thiruvenkadam
Prasad Sudhakar
Hariharan Ravishankar
V. Vaidya
3DPC
MedIm
19
0
0
08 Dec 2016
Spatially Adaptive Computation Time for Residual Networks
Spatially Adaptive Computation Time for Residual Networks
Michael Figurnov
Maxwell D. Collins
Yukun Zhu
Li Zhang
Jonathan Huang
Dmitry Vetrov
Ruslan Salakhutdinov
19
346
0
07 Dec 2016
Towards the Limit of Network Quantization
Towards the Limit of Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
14
191
0
05 Dec 2016
Trained Ternary Quantization
Trained Ternary Quantization
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
19
1,035
0
04 Dec 2016
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
Yaman Umuroglu
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
19
975
0
01 Dec 2016
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Song Han
Junlong Kang
Huizi Mao
Yiming Hu
Xin Li
...
Hong Luo
Song Yao
Yu Wang
Huazhong Yang
W. Dally
23
627
0
01 Dec 2016
Effective Quantization Methods for Recurrent Neural Networks
Effective Quantization Methods for Recurrent Neural Networks
Qinyao He
He Wen
Shuchang Zhou
Yuxin Wu
Cong Yao
Xinyu Zhou
Yuheng Zou
MQ
16
75
0
30 Nov 2016
Previous
123...66676869
Next