ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,434 papers shown
Title
ReBNet: Residual Binarized Neural Network
ReBNet: Residual Binarized Neural Network
M. Ghasemzadeh
Mohammad Samragh
F. Koushanfar
MQ
22
4
0
03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting
  Input and Output Sparsity
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity
Jingyang Zhu
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
18
36
0
03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning
Compressing Word Embeddings via Deep Compositional Code Learning
Raphael Shu
Hideki Nakayama
21
129
0
03 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
16
6
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
23
231
0
01 Nov 2017
Tensorizing Generative Adversarial Nets
Tensorizing Generative Adversarial Nets
Xingwei Cao
Xuyang Zhao
Qibin Zhao
GAN
17
9
0
30 Oct 2017
Knowledge Projection for Deep Neural Networks
Knowledge Projection for Deep Neural Networks
Zhi Zhang
G. Ning
Zhihai He
28
15
0
26 Oct 2017
Trace norm regularization and faster inference for embedded speech
  recognition RNNs
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
M. Shoeybi
15
8
0
25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
15
1,085
0
23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
11
88
0
21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
12
270
0
19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
25
23
0
13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking
  Neural Networks for Energy Efficient Recognition
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition
Nitin Rathi
Priyadarshini Panda
Kaushik Roy
16
111
0
12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Jiaqi Guan
Yang Liu
Qiang Liu
Jian-wei Peng
14
33
0
10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks:
  A Tutorial
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
15
207
0
09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with
  Small Deep-Neural-Network Architectures
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures
F. Iandola
Kurt Keutzer
15
37
0
07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model
  compression
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
37
1,248
0
05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear
  Filters
Improving Efficiency in Convolutional Neural Network with Multilinear Filters
D. Tran
Alexandros Iosifidis
M. Gabbouj
16
40
0
28 Sep 2017
Connectivity Learning in Multi-Branch Networks
Connectivity Learning in Multi-Branch Networks
Karim Ahmed
Lorenzo Torresani
16
26
0
27 Sep 2017
Machine Learning Models that Remember Too Much
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
16
502
0
22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented
  Convolution Neural Network Accelerator Design
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
13
41
0
22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network
  Acceleration
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration
Huan Wang
Qiming Zhang
Yuehai Wang
Roland Hu
13
11
0
20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced
  Regularization in Ternary Networks
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
Julian Faraone
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
MQ
UQCV
13
12
0
19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient
  Reinforcement Learning
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
A. Ashok
Nicholas Rhinehart
Fares N. Beainy
Kris M. Kitani
16
169
0
18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage
  Requirement
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Tianchan Guan
Xiaoyang Zeng
Mingoo Seok
MQ
20
6
0
15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with
  Image and Feature Decomposition for Resource-limited System Applications
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Yuan Du
Li Du
Yilei Li
Junjie Su
Mau-Chung Frank Chang
14
6
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
21
140
0
15 Sep 2017
Supervising Unsupervised Learning
Supervising Unsupervised Learning
Vikas K. Garg
Adam Kalai
SSL
FedML
16
29
0
14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing
  model without retraining
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Ryuji Kamiya
Takayoshi Yamashita
Mitsuru Ambai
Ikuro Sato
Yuji Yamauchi
H. Fujiyoshi
MQ
12
4
0
14 Sep 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
23
10
0
13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to
  Alignment and Verification
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification
Chong-Jun Wang
Xipeng Lan
Yang Zhang
CVBM
15
26
0
09 Sep 2017
Real-time convolutional networks for sonar image classification in
  low-power embedded systems
Real-time convolutional networks for sonar image classification in low-power embedded systems
Matias Valdenegro-Toro
23
10
0
07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature
  Representations through Sexual Evolutionary Synthesis
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis
A. Chung
M. Shafiee
Paul Fieguth
A. Wong
10
4
0
07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
11
1,109
0
06 Sep 2017
Domain-adaptive deep network compression
Domain-adaptive deep network compression
Marc Masana
Joost van de Weijer
Luis Herranz
Andrew D. Bagdanov
J. Álvarez
36
62
0
04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks
Fast Image Processing with Fully-Convolutional Networks
Qifeng Chen
Jia Xu
V. Koltun
10
322
0
02 Sep 2017
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network
  Simulation Expansion and STDP Convergence Predictions
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions
Toby Lightheart
S. Grainger
Tien-Fu Lu
8
0
0
30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual
  Quantization
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li
Bingbing Ni
Wenjun Zhang
Xiaokang Yang
Wen Gao
MQ
16
105
0
29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using
  Block-CirculantWeight Matrices
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Caiwen Ding
Siyu Liao
Yanzhi Wang
Zhe Li
Ning Liu
...
Yipeng Zhang
Jian Tang
Qinru Qiu
X. Lin
Bo Yuan
GNN
19
258
0
29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of
  Images
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images
Duc Minh Nguyen
Evaggelia Tsiligianni
Nikos Deligiannis
13
26
0
28 Aug 2017
The Convergence of Machine Learning and Communications
The Convergence of Machine Learning and Communications
Wojciech Samek
S. Stańczak
Thomas Wiegand
AI4CE
24
29
0
28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming
Learning Efficient Convolutional Networks through Network Slimming
Zhuang Liu
Jianguo Li
Zhiqiang Shen
Gao Huang
Shoumeng Yan
Changshui Zhang
24
2,383
0
22 Aug 2017
Neural Networks Compression for Language Modeling
Neural Networks Compression for Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
14
30
0
20 Aug 2017
Deep Neural Network Capacity
Aosen Wang
Huan Zhou
Wenyao Xu
Xin Chen
11
4
0
16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
22
10
0
16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile
  Devices
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
15
97
0
16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS
Enabling Massive Deep Neural Networks with the GraphBLAS
J. Kepner
Manoj Kumar
José Moreira
P. Pattnaik
M. Serrano
H. Tufo
GNN
14
33
0
09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink
Prune the Convolutional Neural Networks with Sparse Shrink
X. Li
Changsong Liu
CVBM
11
4
0
08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha
Emily Pitler
Ji Ma
A. Bakalov
Alexandru Salcianu
David J. Weiss
Ryan T. McDonald
Slav Petrov
HAI
17
38
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an
  FPGA-Based Dataflow Platform
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
28
35
0
31 Jul 2017
Previous
123...646566676869
Next