ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown
Title
StressedNets: Efficient Feature Representations via Stress-induced
  Evolutionary Synthesis of Deep Neural Networks
StressedNets: Efficient Feature Representations via Stress-induced Evolutionary Synthesis of Deep Neural Networks
M. Shafiee
Brendan Chwyl
Francis Li
Rongyan Chen
Michelle Karg
C. Scharfenberger
A. Wong
6
7
0
16 Jan 2018
Deep Net Triage: Analyzing the Importance of Network Layers via
  Structural Compression
Deep Net Triage: Analyzing the Importance of Network Layers via Structural Compression
Theodore S. Nowak
Jason J. Corso
FAtt
16
3
0
15 Jan 2018
Fix your classifier: the marginal value of training the last weight
  layer
Fix your classifier: the marginal value of training the last weight layer
Elad Hoffer
Itay Hubara
Daniel Soudry
19
101
0
14 Jan 2018
Focus: Querying Large Video Datasets with Low Latency and Low Cost
Focus: Querying Large Video Datasets with Low Latency and Low Cost
Kevin Hsieh
Ganesh Ananthanarayanan
P. Bodík
P. Bahl
Matthai Philipose
Phillip B. Gibbons
O. Mutlu
14
275
0
10 Jan 2018
Learning $3$D-FilterMap for Deep Convolutional Neural Networks
Learning 333D-FilterMap for Deep Convolutional Neural Networks
Yingzhen Yang
Jianchao Yang
N. Xu
Wei Han
3DV
MQ
13
1
0
05 Jan 2018
Overcoming catastrophic forgetting with hard attention to the task
Overcoming catastrophic forgetting with hard attention to the task
Joan Serra
Dídac Surís
M. Miron
Alexandros Karatzoglou
CLL
27
1,048
0
04 Jan 2018
Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks
Learning a Wavelet-like Auto-Encoder to Accelerate Deep Neural Networks
Tianshui Chen
Liang Lin
W. Zuo
Xiaonan Luo
Lei Zhang
8
56
0
20 Dec 2017
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car
DeepPicar: A Low-cost Deep Neural Network-based Autonomous Car
Michael Bechtel
Elise McEllhiney
Minje Kim
H. Yun
22
103
0
19 Dec 2017
Squeezed Convolutional Variational AutoEncoder for Unsupervised Anomaly
  Detection in Edge Device Industrial Internet of Things
Squeezed Convolutional Variational AutoEncoder for Unsupervised Anomaly Detection in Edge Device Industrial Internet of Things
Dohyung Kim
Hyochang Yang
Minki Chung
Sungzoon Cho
DRL
9
32
0
18 Dec 2017
Automated flow for compressing convolution neural networks for efficient
  edge-computation with FPGA
Automated flow for compressing convolution neural networks for efficient edge-computation with FPGA
F. Shafiq
Takato Yamada
Antonio T. Vilchez
Sakyasingha Dasgupta
MQ
16
3
0
18 Dec 2017
clcNet: Improving the Efficiency of Convolutional Neural Network using
  Channel Local Convolutions
clcNet: Improving the Efficiency of Convolutional Neural Network using Channel Local Convolutions
Dong-Qing Zhang
6
10
0
17 Dec 2017
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
16
3,043
0
15 Dec 2017
BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition
BT-Nets: Simplifying Deep Neural Networks via Block Term Decomposition
Guangxi Li
Jinmian Ye
Haiqin Yang
Di Chen
Shuicheng Yan
Zenglin Xu
14
11
0
15 Dec 2017
FFT-Based Deep Learning Deployment in Embedded Systems
FFT-Based Deep Learning Deployment in Embedded Systems
Sheng Lin
Ning Liu
M. Nazemi
Hongjia Li
Caiwen Ding
Yanzhi Wang
Massoud Pedram
25
52
0
13 Dec 2017
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks
NestedNet: Learning Nested Sparse Structures in Deep Neural Networks
Eunwoo Kim
Chanho Ahn
Songhwai Oh
16
2
0
11 Dec 2017
AdaComp : Adaptive Residual Gradient Compression for Data-Parallel
  Distributed Training
AdaComp : Adaptive Residual Gradient Compression for Data-Parallel Distributed Training
Chia-Yu Chen
Jungwook Choi
D. Brand
A. Agrawal
Wei Zhang
K. Gopalakrishnan
ODL
14
172
0
07 Dec 2017
Automated Pruning for Deep Neural Network Compression
Automated Pruning for Deep Neural Network Compression
Franco Manessi
A. Rozza
Simone Bianco
Paolo Napoletano
Raimondo Schettini
20
56
0
05 Dec 2017
Learning Sparse Neural Networks through $L_0$ Regularization
Learning Sparse Neural Networks through L0L_0L0​ Regularization
Christos Louizos
Max Welling
Diederik P. Kingma
14
1,128
0
04 Dec 2017
Adaptive Quantization for Deep Neural Network
Adaptive Quantization for Deep Neural Network
Yiren Zhou
Seyed-Mohsen Moosavi-Dezfooli
Ngai-man Cheung
P. Frossard
MQ
14
177
0
04 Dec 2017
Homomorphic Parameter Compression for Distributed Deep Learning Training
Homomorphic Parameter Compression for Distributed Deep Learning Training
Jaehee Jang
Byunggook Na
Sungroh Yoon
FedML
22
1
0
28 Nov 2017
WSNet: Compact and Efficient Networks Through Weight Sampling
WSNet: Compact and Efficient Networks Through Weight Sampling
Xiaojie Jin
Yingzhen Yang
N. Xu
Jianchao Yang
Nebojsa Jojic
Jiashi Feng
Shuicheng Yan
19
2
0
28 Nov 2017
Slim Embedding Layers for Recurrent Neural Language Models
Slim Embedding Layers for Recurrent Neural Language Models
Zhongliang Li
Raymond Kulhanek
Shaojun Wang
Yunxin Zhao
Shuang Wu
KELM
19
23
0
27 Nov 2017
SkipNet: Learning Dynamic Routing in Convolutional Networks
SkipNet: Learning Dynamic Routing in Convolutional Networks
Xin Wang
F. I. F. Richard Yu
Zi-Yi Dou
Trevor Darrell
Joseph E. Gonzalez
11
624
0
26 Nov 2017
CondenseNet: An Efficient DenseNet using Learned Group Convolutions
CondenseNet: An Efficient DenseNet using Learned Group Convolutions
Gao Huang
Shichen Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
28
795
0
25 Nov 2017
Deep Expander Networks: Efficient Deep Networks from Graph Theory
Deep Expander Networks: Efficient Deep Networks from Graph Theory
Ameya Prabhu
G. Varma
A. Namboodiri
GNN
21
70
0
23 Nov 2017
BlockDrop: Dynamic Inference Paths in Residual Networks
BlockDrop: Dynamic Inference Paths in Residual Networks
Zuxuan Wu
Tushar Nagarajan
Abhishek Kumar
Steven J. Rennie
L. Davis
Kristen Grauman
Rogerio Feris
15
461
0
22 Nov 2017
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
Bichen Wu
Alvin Wan
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Noah Golmant
A. Gholaminejad
Joseph E. Gonzalez
Kurt Keutzer
3DPC
18
363
0
22 Nov 2017
Evaluating Robustness of Neural Networks with Mixed Integer Programming
Evaluating Robustness of Neural Networks with Mixed Integer Programming
Vincent Tjeng
Kai Y. Xiao
Russ Tedrake
AAML
41
117
0
20 Nov 2017
Interleaver Design for Deep Neural Networks
Interleaver Design for Deep Neural Networks
Sourya Dey
P. Beerel
K. Chugg
13
6
0
18 Nov 2017
Training Simplification and Model Simplification for Deep Learning: A
  Minimal Effort Back Propagation Method
Training Simplification and Model Simplification for Deep Learning: A Minimal Effort Back Propagation Method
Xu Sun
Xuancheng Ren
Shuming Ma
Bingzhen Wei
Wei Li
Jingjing Xu
Houfeng Wang
Yi Zhang
23
25
0
17 Nov 2017
Improved Bayesian Compression
Improved Bayesian Compression
Marco Federici
Karen Ullrich
Max Welling
UQCV
BDL
17
19
0
17 Nov 2017
Mobile Video Object Detection with Temporally-Aware Feature Maps
Mobile Video Object Detection with Temporally-Aware Feature Maps
Mason Liu
Menglong Zhu
ObjD
16
196
0
17 Nov 2017
NISP: Pruning Networks using Neuron Importance Score Propagation
NISP: Pruning Networks using Neuron Importance Score Propagation
Ruichi Yu
Ang Li
Chun-Fu Chen
Jui-Hsin Lai
Vlad I. Morariu
Xintong Han
M. Gao
Ching-Yung Lin
L. Davis
24
795
0
16 Nov 2017
Apprentice: Using Knowledge Distillation Techniques To Improve
  Low-Precision Network Accuracy
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy
Asit K. Mishra
Debbie Marr
FedML
16
330
0
15 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with
  A Neural Network Compiler
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler
Yu Ji
Youhui Zhang
Wenguang Chen
Yuan Xie
19
56
0
15 Nov 2017
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
Arun Mallya
Svetlana Lazebnik
CLL
13
1,269
0
15 Nov 2017
Deep Rewiring: Training very sparse deep networks
Deep Rewiring: Training very sparse deep networks
G. Bellec
David Kappel
Wolfgang Maass
R. Legenstein
BDL
22
273
0
14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural
  Networks with Reduced Numerical Precision Weights and Activation
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
22
9
0
13 Nov 2017
Weightless: Lossy Weight Encoding For Deep Neural Network Compression
Weightless: Lossy Weight Encoding For Deep Neural Network Compression
Brandon Reagen
Udit Gupta
Bob Adolf
Michael Mitzenmacher
Alexander M. Rush
Gu-Yeon Wei
David Brooks
19
38
0
13 Nov 2017
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks
  for Image Super Resolution
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution
Haoyu Ren
Mostafa El-Khamy
Jungwon Lee
SupR
33
27
0
11 Nov 2017
Learning K-way D-dimensional Discrete Code For Compact Embedding
  Representations
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations
Ting Chen
Martin Renqiang Min
Yizhou Sun
11
10
0
08 Nov 2017
Revealing structure components of the retina by deep learning networks
Revealing structure components of the retina by deep learning networks
Qianyu Yan
Zhaofei Yu
Feng Chen
Jian K. Liu
FAtt
8
7
0
08 Nov 2017
Block-Sparse Recurrent Neural Networks
Block-Sparse Recurrent Neural Networks
Sharan Narang
Eric Undersander
G. Diamos
9
136
0
08 Nov 2017
Compression-aware Training of Deep Networks
Compression-aware Training of Deep Networks
J. Álvarez
Mathieu Salzmann
8
172
0
07 Nov 2017
Moonshine: Distilling with Cheap Convolutions
Moonshine: Distilling with Cheap Convolutions
Elliot J. Crowley
Gavia Gray
Amos Storkey
19
120
0
07 Nov 2017
Interpreting Convolutional Neural Networks Through Compression
Interpreting Convolutional Neural Networks Through Compression
R. Abbasi-Asl
Bin-Xia Yu
FAtt
11
21
0
07 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate
  Deep Neural Networks
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
16
30
0
07 Nov 2017
Characterizing Sparse Connectivity Patterns in Neural Networks
Characterizing Sparse Connectivity Patterns in Neural Networks
Sourya Dey
Kuan-Wen Huang
P. Beerel
K. Chugg
22
11
0
06 Nov 2017
Neural Speed Reading via Skim-RNN
Neural Speed Reading via Skim-RNN
Minjoon Seo
Sewon Min
Ali Farhadi
Hannaneh Hajishirzi
29
79
0
06 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Sourya Dey
Yinan Shao
K. Chugg
P. Beerel
22
16
0
03 Nov 2017
Previous
123...636465...676869
Next