ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,629 papers shown
Title
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML
  Acceleration
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML AccelerationIEEE Journal of Solid-State Circuits (JSSC), 2023
I. Miro-Panadès
Benoît Tain
J. Christmann
David Coriat
R. Lemaire
...
Jean-Marc Philippe
Y. Thonnart
A. Valentian
Frédéric Heitzmann
F. Clermidy
83
20
0
11 Apr 2023
Conditional Adapters: Parameter-efficient Transfer Learning with Fast
  Inference
Conditional Adapters: Parameter-efficient Transfer Learning with Fast InferenceNeural Information Processing Systems (NeurIPS), 2023
Tao Lei
Junwen Bai
Siddhartha Brahma
Joshua Ainslie
Kenton Lee
...
Vincent Zhao
Yuexin Wu
Yue Liu
Yu Zhang
Ming-Wei Chang
BDLAI4CE
209
80
0
11 Apr 2023
Model Sparsity Can Simplify Machine Unlearning
Model Sparsity Can Simplify Machine UnlearningNeural Information Processing Systems (NeurIPS), 2023
Jinghan Jia
Jiancheng Liu
Parikshit Ram
Yuguang Yao
Gaowen Liu
Yang Liu
Pranay Sharma
Sijia Liu
MU
632
189
0
11 Apr 2023
Graph Enabled Cross-Domain Knowledge Transfer
Graph Enabled Cross-Domain Knowledge Transfer
S. Yao
132
0
0
07 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
192
8
0
06 Apr 2023
Learning to Learn with Indispensable Connections
Learning to Learn with Indispensable Connections
Sambhavi Tiwari
Manas Gogoi
Shekhar Verma
Krishna Pratap Singh
CLL
139
1
0
06 Apr 2023
HNeRV: A Hybrid Neural Representation for Videos
HNeRV: A Hybrid Neural Representation for VideosComputer Vision and Pattern Recognition (CVPR), 2023
Hao Chen
M. Gwilliam
Ser-Nam Lim
Abhinav Shrivastava
141
107
1
05 Apr 2023
Efficient human-in-loop deep learning model training with iterative
  refinement and statistical result validation
Efficient human-in-loop deep learning model training with iterative refinement and statistical result validation
Manuel Zahn
Douglas P. Perrin
132
1
0
03 Apr 2023
Optimizing data-flow in Binary Neural Networks
Optimizing data-flow in Binary Neural NetworksItalian National Conference on Sensors (INS), 2023
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
192
6
0
03 Apr 2023
SEENN: Towards Temporal Spiking Early-Exit Neural Networks
SEENN: Towards Temporal Spiking Early-Exit Neural NetworksNeural Information Processing Systems (NeurIPS), 2023
Yuhang Li
Tamar Geller
Youngeun Kim
Priyadarshini Panda
274
59
0
02 Apr 2023
A Generative Framework for Low-Cost Result Validation of Machine
  Learning-as-a-Service Inference
A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service InferenceACM Asia Conference on Computer and Communications Security (AsiaCCS), 2023
Abhinav Kumar
Miguel A. Guirao Aguilera
R. Tourani
Satyajayant Misra
AAML
385
1
0
31 Mar 2023
BOLT: An Automated Deep Learning Framework for Training and Deploying
  Large-Scale Search and Recommendation Models on Commodity CPU Hardware
BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Search and Recommendation Models on Commodity CPU HardwareInternational Conference on Information and Knowledge Management (CIKM), 2023
Nicholas Meisburger
V. Lakshman
Benito Geordie
Joshua Engels
David Torres Ramos
...
Benjamin Meisburger
Shubh Gupta
Yashwanth Adunukota
Tharun Medini
Anshumali Shrivastava
220
3
0
30 Mar 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution
  Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision TransformerComputer Vision and Pattern Recognition (CVPR), 2023
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
327
72
0
30 Mar 2023
Distributed Neural Representation for Reactive in situ Visualization
Distributed Neural Representation for Reactive in situ VisualizationIEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Qi Wu
J. Insley
V. Mateevitsi
S. Rizzi
M. Papka
Kwan-Liu Ma
160
5
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based
  Real-time Mobile Vision Applications
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision ApplicationsIEEE International Conference on Computer Vision (ICCV), 2023
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
317
166
0
27 Mar 2023
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware
  Compression
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware Compression
Denis Kuznedelev
Soroush Tabesh
Kimia Noorbakhsh
Elias Frantar
Sara Beery
Eldar Kurtic
Dan Alistarh
MQVLM
193
3
0
25 Mar 2023
PowerPruning: Selecting Weights and Activations for Power-Efficient
  Neural Network Acceleration
PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network AccelerationDesign Automation Conference (DAC), 2023
Richard Petri
Grace Li Zhang
Yiran Chen
Ulf Schlichtmann
Bing Li
75
10
0
24 Mar 2023
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with
  Bridge Block Reconstruction for IoT Systems
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT SystemsIEEE Internet of Things Journal (IEEE IoT J.), 2023
Jemin Lee
Yongin Kwon
Sihyeong Park
Misun Yu
Jeman Park
Hwanjun Song
ViTMQ
222
12
0
22 Mar 2023
Low Rank Optimization for Efficient Deep Learning: Making A Balance
  between Compact Architecture and Fast Training
Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast TrainingJournal of Systems Engineering and Electronics (JSEE), 2023
Xinwei Ou
Zhangxin Chen
Ce Zhu
Yipeng Liu
196
9
0
22 Mar 2023
Performance-aware Approximation of Global Channel Pruning for Multitask
  CNNs
Performance-aware Approximation of Global Channel Pruning for Multitask CNNsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Hancheng Ye
Bo Zhang
Tao Chen
Jiayuan Fan
Sijin Yu
136
35
0
21 Mar 2023
Solving Oscillation Problem in Post-Training Quantization Through a
  Theoretical Perspective
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical PerspectiveComputer Vision and Pattern Recognition (CVPR), 2023
Yuexiao Ma
Huixia Li
Xiawu Zheng
Xuefeng Xiao
Rui Wang
Shilei Wen
Xin Pan
Jiayi Ji
Rongrong Ji
MQ
228
15
0
21 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training
  Efficiency
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training EfficiencyInternational Conference on Machine Learning (ICML), 2023
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
420
7
0
21 Mar 2023
Greedy Pruning with Group Lasso Provably Generalizes for Matrix Sensing
Greedy Pruning with Group Lasso Provably Generalizes for Matrix SensingNeural Information Processing Systems (NeurIPS), 2023
Nived Rajaraman
Devvrit
Aryan Mokhtari
Kannan Ramchandran
236
3
0
20 Mar 2023
ExplainFix: Explainable Spatially Fixed Deep Networks
ExplainFix: Explainable Spatially Fixed Deep Networks
Alex Gaudio
Christos Faloutsos
A. Smailagic
P. Costa
A. Campilho
FAtt
143
3
0
18 Mar 2023
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision
  Models
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models
Yucheng Ding
Chaoyue Niu
Fan Wu
Shaojie Tang
Chengfei Lyu
Guihai Chen
160
10
0
18 Mar 2023
Unleashing the Potential of Spiking Neural Networks by Dynamic
  Confidence
Unleashing the Potential of Spiking Neural Networks by Dynamic ConfidenceIEEE International Conference on Computer Vision (ICCV), 2023
Chen Li
Edward Jones
Steve Furber
286
22
0
17 Mar 2023
Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Iterative Soft Shrinkage Learning for Efficient Image Super-ResolutionIEEE International Conference on Computer Vision (ICCV), 2023
Jiamian Wang
Huan Wang
Yulun Zhang
Yun Fu
Zhiqiang Tao
SupR
152
4
0
16 Mar 2023
A High-Performance Accelerator for Super-Resolution Processing on
  Embedded GPU
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
126
12
0
16 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
166
3
0
15 Mar 2023
R2 Loss: Range Restriction Loss for Model Compression and Quantization
R2 Loss: Range Restriction Loss for Model Compression and Quantization
Arnav Kundu
Chungkuk Yoo
Srijan Mishra
Minsik Cho
Saurabh N. Adya
MQ
137
2
0
14 Mar 2023
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
Maorong Wang
L. Xiao
T. Yamasaki
KELMMoE
115
1
0
14 Mar 2023
FPUS23: An Ultrasound Fetus Phantom Dataset with Deep Neural Network
  Evaluations for Fetus Orientations, Fetal Planes, and Anatomical Features
FPUS23: An Ultrasound Fetus Phantom Dataset with Deep Neural Network Evaluations for Fetus Orientations, Fetal Planes, and Anatomical FeaturesIEEE Access (IEEE Access), 2023
B. Prabakaran
Paul Hamelmann
Erik Ostrowski
Mohamed Bennai
196
24
0
14 Mar 2023
Automatic Attention Pruning: Improving and Automating Model Pruning
  using Attentions
Automatic Attention Pruning: Improving and Automating Model Pruning using AttentionsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Kaiqi Zhao
Animesh Jain
Ming Zhao
181
19
0
14 Mar 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse
  Edge Environments
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge EnvironmentsACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2023
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
231
54
0
13 Mar 2023
Three Guidelines You Should Know for Universally Slimmable
  Self-Supervised Learning
Three Guidelines You Should Know for Universally Slimmable Self-Supervised LearningComputer Vision and Pattern Recognition (CVPR), 2023
Yunhao Cao
Peiqin Sun
Shuchang Zhou
107
5
0
13 Mar 2023
OTOV2: Automatic, Generic, User-Friendly
OTOV2: Automatic, Generic, User-FriendlyInternational Conference on Learning Representations (ICLR), 2023
Tianyi Chen
Luming Liang
Tian Ding
Zhihui Zhu
Ilya Zharkov
VLMMQ
217
48
0
13 Mar 2023
Complement Sparsification: Low-Overhead Model Pruning for Federated
  Learning
Complement Sparsification: Low-Overhead Model Pruning for Federated LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Xiaopeng Jiang
Cristian Borcea
FedML
165
30
0
10 Mar 2023
Sparse and Local Networks for Hypergraph Reasoning
Sparse and Local Networks for Hypergraph ReasoningLOG IN (LOG IN), 2023
Guangxuan Xiao
L. Kaelbling
Jiajun Wu
Jiayuan Mao
NAIReLMLRM
182
1
0
09 Mar 2023
A Privacy Preserving System for Movie Recommendations Using Federated
  Learning
A Privacy Preserving System for Movie Recommendations Using Federated Learning
David Neumann
Andreas Lutz
Karsten Müller
Wojciech Samek
333
16
0
07 Mar 2023
An Edge-based WiFi Fingerprinting Indoor Localization Using
  Convolutional Neural Network and Convolutional Auto-Encoder
An Edge-based WiFi Fingerprinting Indoor Localization Using Convolutional Neural Network and Convolutional Auto-EncoderIEEE Access (IEEE Access), 2023
Amin Kargar-Barzi
Ebrahim Farahmand
Nooshin Taheri Chatrudi
A. Mahani
M. Shafique
140
18
0
07 Mar 2023
Training-Free Acceleration of ViTs with Delayed Spatial Merging
Training-Free Acceleration of ViTs with Delayed Spatial Merging
J. Heo
Seyedarmin Azizi
A. Fayyazi
Massoud Pedram
254
4
0
04 Mar 2023
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Christian Westbrook
S. Pasricha
AAML
129
3
0
03 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!International Conference on Learning Representations (ICLR), 2023
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zinan Lin
182
31
0
03 Mar 2023
Rotation Invariant Quantization for Model Compression
Rotation Invariant Quantization for Model Compression
Dor-Joseph Kampeas
Yury Nahshan
Hanoch Kremer
Gil Lederman
Shira Zaloshinski
Zheng Li
E. Haleva
MQ
244
1
0
03 Mar 2023
TopSpark: A Timestep Optimization Methodology for Energy-Efficient
  Spiking Neural Networks on Autonomous Mobile Agents
TopSpark: A Timestep Optimization Methodology for Energy-Efficient Spiking Neural Networks on Autonomous Mobile AgentsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Rachmad Vidya Wicaksana Putra
Mohamed Bennai
172
18
0
03 Mar 2023
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker
  Verification
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
258
6
0
02 Mar 2023
Learning to Grow Pretrained Models for Efficient Transformer Training
Learning to Grow Pretrained Models for Efficient Transformer TrainingInternational Conference on Learning Representations (ICLR), 2023
Peihao Wang
Yikang Shen
Lucas Torroba Hennigen
P. Greengard
Leonid Karlinsky
Rogerio Feris
David D. Cox
Zinan Lin
Yoon Kim
183
70
0
02 Mar 2023
EdgeServe: A Streaming System for Decentralized Model Serving
EdgeServe: A Streaming System for Decentralized Model Serving
Ted Shaowang
Sanjay Krishnan
193
2
0
02 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
Structured Pruning for Deep Convolutional Neural Networks: A surveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yang He
Lingao Xiao
3DPC
348
255
0
01 Mar 2023
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
Axi Niu
Pei Wang
Yu Zhu
Jinqiu Sun
Qingsen Yan
Yanning Zhang
SupR
141
9
0
28 Feb 2023
Previous
123...181920...717273
Next