ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,628 papers shown
Title
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object
  Detection Network for Low Power Microcontrollers
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power MicrocontrollersInternational Conference on Artificial Intelligence Circuits and Systems (ICAICS), 2023
Julian Moosmann
Marco Giordano
Christian Vogt
Michele Magno
MQObjD
205
30
0
22 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical
  Structured Sparsity
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
165
41
0
22 May 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on
  Large Language Models
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language ModelsIEEE International Conference on Multimedia and Expo (ICME), 2023
Yijia Zhang
Lingran Zhao
Shijie Cao
Wenqiang Wang
Ting Cao
Fan Yang
Mao Yang
Shanghang Zhang
Ningyi Xu
MQ
139
24
0
21 May 2023
Self-Distillation with Meta Learning for Knowledge Graph Completion
Self-Distillation with Meta Learning for Knowledge Graph CompletionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yunshui Li
Junhao Liu
Chengming Li
Min Yang
184
8
0
20 May 2023
Efficient Prompting via Dynamic In-Context Learning
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Robert Bamler
Mrinmaya Sachan
157
25
0
18 May 2023
PDP: Parameter-free Differentiable Pruning is All You Need
PDP: Parameter-free Differentiable Pruning is All You NeedNeural Information Processing Systems (NeurIPS), 2023
Minsik Cho
Saurabh N. Adya
Devang Naik
VLM
187
15
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized
  Attention
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized AttentionInternational Journal of Computer Vision (IJCV), 2023
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGenDiffM
295
334
0
17 May 2023
Analyzing Compression Techniques for Computer Vision
Analyzing Compression Techniques for Computer Vision
Maniratnam Mandal
Imran Khan
154
1
0
14 May 2023
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
TIPS: Topologically Important Path Sampling for Anytime Neural NetworksInternational Conference on Machine Learning (ICML), 2023
Guihong Li
Kartikeya Bhardwaj
Yuedong Yang
R. Marculescu
AAML
279
0
0
13 May 2023
Efficient Asynchronize Stochastic Gradient Algorithm with Structured
  Data
Efficient Asynchronize Stochastic Gradient Algorithm with Structured Data
Zhao Song
Mingquan Ye
199
4
0
13 May 2023
Accelerator-Aware Training for Transducer-Based Speech Recognition
Accelerator-Aware Training for Transducer-Based Speech RecognitionSpoken Language Technology Workshop (SLT), 2023
Suhaila M. Shakiah
Rupak Vignesh Swaminathan
Hieu Duy Nguyen
Raviteja Chinta
Tariq Afzal
Nathan Susanj
Athanasios Mouchtaris
Grant P. Strimel
Ariya Rastrow
133
1
0
12 May 2023
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated
  Learning Systems
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated Learning SystemsNeural Networks (Neural Netw.), 2023
Yeshwanth Venkatesha
Youngeun Kim
Hyoungseob Park
Priyadarshini Panda
FedML
114
6
0
11 May 2023
Post-training Model Quantization Using GANs for Synthetic Data
  Generation
Post-training Model Quantization Using GANs for Synthetic Data Generation
Athanasios Masouris
Mansi Sharma
Adrian Boguszewski
Alexander Kozlov
Zhuo Wu
Raymond Lo
MQ
146
0
0
10 May 2023
VEDLIoT -- Next generation accelerated AIoT systems and applications
VEDLIoT -- Next generation accelerated AIoT systems and applicationsACM International Conference on Computing Frontiers (CF), 2023
Kevin Mika
R. Griessl
N. Kucza
F. Porrmann
M. Kaiser
...
Mario Porrmann
Hans-Martin Heyn
E. Knauss
Yufei Mao
Franz Meierhofer
114
6
0
09 May 2023
DietCNN: Multiplication-free Inference for Quantized CNNs
DietCNN: Multiplication-free Inference for Quantized CNNsIEEE International Joint Conference on Neural Network (IJCNN), 2023
Swarnava Dey
P. Dasgupta
P. Chakrabarti
MQ
237
1
0
09 May 2023
FrugalGPT: How to Use Large Language Models While Reducing Cost and
  Improving Performance
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Lingjiao Chen
Matei A. Zaharia
James Zou
LLMAG
346
378
0
09 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task
  Adaptation
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
205
1
0
08 May 2023
Compressing audio CNNs with graph centrality based filter pruning
Compressing audio CNNs with graph centrality based filter pruningIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
122
2
0
05 May 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device
  Learning
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device LearningInternational Symposium on High-Performance Computer Architecture (HPCA), 2023
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
206
9
0
04 May 2023
Input Layer Binarization with Bit-Plane Encoding
Input Layer Binarization with Bit-Plane EncodingInternational Conference on Artificial Neural Networks (ICANN), 2023
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
162
8
0
04 May 2023
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate
  Functions
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate FunctionsCSIAM Transactions on Applied Mathematics (TCAM), 2023
Lin Chen
Shitong Wu
Wen-Long Ye
Huihui Wu
Wen-Ying Zhang
Hao Wu
Bo Bai
59
9
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Cuttlefish: Low-Rank Model Training without All the TuningConference on Machine Learning and Systems (MLSys), 2023
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
270
26
0
04 May 2023
Dynamic Sparse Training with Structured Sparsity
Dynamic Sparse Training with Structured SparsityInternational Conference on Learning Representations (ICLR), 2023
Mike Lasby
A. Golubeva
Utku Evci
Mihai Nica
Yani Andrew Ioannou
566
33
0
03 May 2023
A Digital Twin Empowered Lightweight Model Sharing Scheme for
  Multi-Robot Systems
A Digital Twin Empowered Lightweight Model Sharing Scheme for Multi-Robot SystemsIEEE Internet of Things Journal (IEEE IoT J.), 2023
Kai Xiong
Zhihong Wang
S. Leng
Jianhua He
109
14
0
03 May 2023
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge
  Platforms
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge Platforms
Ziyang Zhang
Huan Li
Yang Zhao
Changyao Lin
Jie Liu
145
5
0
01 May 2023
CORSD: Class-Oriented Relational Self Distillation
CORSD: Class-Oriented Relational Self DistillationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muzhou Yu
S. Tan
Kailu Wu
Runpei Dong
Linfeng Zhang
Kaisheng Ma
102
1
0
28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified
  Neural Network Models
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models
D. Honegger
Konstantin Schurholt
Damian Borth
236
5
0
26 Apr 2023
Optimizing Deep Learning Models For Raspberry Pi
Optimizing Deep Learning Models For Raspberry Pi
Sa Ameen
Kangaranmulle Siriwardana
Theodoros Theodoridis
VLM
90
11
0
25 Apr 2023
Multiplierless In-filter Computing for tinyML Platforms
Multiplierless In-filter Computing for tinyML PlatformsInternational Conference on VLSI Design (VLSID), 2023
Abhishek Ramdas Nair
P. Nath
S. Chakrabartty
Chetan Singh Thakur
95
1
0
24 Apr 2023
The Case for Hierarchical Deep Learning Inference at the Network Edge
The Case for Hierarchical Deep Learning Inference at the Network Edge
Ghina Al-Atat
Andrea Fresa
Adarsh Prasad Behera
Vishnu Narayanan Moothedath
James Gross
J. Champati
149
12
0
23 Apr 2023
Deep Convolutional Tables: Deep Learning without Convolutions
Deep Convolutional Tables: Deep Learning without ConvolutionsIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
S. Dekel
Y. Keller
Aharon Bar-Hillel
3DV
250
0
0
23 Apr 2023
QuMoS: A Framework for Preserving Security of Quantum Machine Learning
  Model
QuMoS: A Framework for Preserving Security of Quantum Machine Learning ModelInternational Conference on Quantum Computing and Engineering (QCE), 2023
Zhepeng Wang
Jinyang Li
Zhirui Hu
Blake Gage
Elizabeth Iwasawa
Weiwen Jiang
261
16
0
23 Apr 2023
Identifying Appropriate Intellectual Property Protection Mechanisms for
  Machine Learning Models: A Systematization of Watermarking, Fingerprinting,
  Model Access, and Attacks
Identifying Appropriate Intellectual Property Protection Mechanisms for Machine Learning Models: A Systematization of Watermarking, Fingerprinting, Model Access, and AttacksIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Isabell Lederer
Rudolf Mayer
Andreas Rauber
228
29
0
22 Apr 2023
Securing Neural Networks with Knapsack Optimization
Securing Neural Networks with Knapsack Optimization
Yakir Gorski
Amir Jevnisek
S. Avidan
AAML
104
1
0
20 Apr 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in
  Autonomous Driving: A Comprehensive Review
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewIEEE Transactions on Intelligent Vehicles (TIV), 2023
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
244
171
0
20 Apr 2023
Knowledge Distillation Under Ideal Joint Classifier Assumption
Knowledge Distillation Under Ideal Joint Classifier AssumptionNeural Networks (Neural Netw.), 2023
Huayu Li
Xiwen Chen
G. Ditzler
Janet Roveda
Ao Li
140
2
0
19 Apr 2023
Adaptive Scheduling for Edge-Assisted DNN Serving
Adaptive Scheduling for Edge-Assisted DNN ServingIEEE International Conference on Mobile Adhoc and Sensor Systems (MASS), 2023
Jian He
Chen-Shun Yang
Zhaoyuan He
Ghufran Baig
L. Qiu
104
1
0
19 Apr 2023
Model Pruning Enables Localized and Efficient Federated Learning for
  Yield Forecasting and Data Sharing
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data SharingExpert systems with applications (ESWA), 2023
An-dong Li
Milan Markovic
P. Edwards
Georgios Leontidis
FedML
131
24
0
19 Apr 2023
Neural Network Quantisation for Faster Homomorphic Encryption
Neural Network Quantisation for Faster Homomorphic EncryptionIEEE International Symposium on On-Line Testing and Robust System Design (IOLTS), 2023
Wouter Legiest
Jan-Pieter DÁnvers
Furkan Turan
Michiel Van Beirendonck
Ingrid Verbauwhede
MQ
148
6
0
19 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by
  equivalent and optimal shifting and scaling
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scalingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Yazhe Niu
Jian Ren
Zhengang Li
MQ
203
57
0
18 Apr 2023
Frequency Regularization: Restricting Information Redundancy of
  Convolutional Neural Networks
Frequency Regularization: Restricting Information Redundancy of Convolutional Neural NetworksIEEE Access (IEEE Access), 2023
Chenqiu Zhao
Guanfang Dong
Shupei Zhang
Zijie Tan
Anup Basu
311
4
0
17 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Alexander Warnecke
Julian Speith
Janka Möller
Konrad Rieck
C. Paar
AAML
478
3
0
17 Apr 2023
SalientGrads: Sparse Models for Communication Efficient and Data Aware
  Distributed Federated Training
SalientGrads: Sparse Models for Communication Efficient and Data Aware Distributed Federated Training
Riyasat Ohib
Bishal Thapaliya
Pratyush Gaggenapalli
Qingbin Liu
Vince D. Calhoun
Sergey Plis
FedML
132
2
0
15 Apr 2023
Generating Adversarial Examples with Better Transferability via Masking
  Unimportant Parameters of Surrogate Model
Generating Adversarial Examples with Better Transferability via Masking Unimportant Parameters of Surrogate ModelIEEE International Joint Conference on Neural Network (IJCNN), 2023
Dingcheng Yang
Wenjian Yu
Zihao Xiao
Jiaqi Luo
AAMLDiffM
161
6
0
14 Apr 2023
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving
  Services
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving ServicesIEEE Communications Surveys and Tutorials (COMST), 2023
Dewant Katare
Diego Perino
J. Nurmi
M. Warnier
Marijn Janssen
Aaron Yi Ding
262
61
0
13 Apr 2023
Learning Accurate Performance Predictors for Ultrafast Automated Model
  Compression
Learning Accurate Performance Predictors for Ultrafast Automated Model CompressionInternational Journal of Computer Vision (IJCV), 2023
Ziwei Wang
Jiwen Lu
Han Xiao
Shengyu Liu
Jie Zhou
OffRL
150
1
0
13 Apr 2023
Boosting Convolutional Neural Networks with Middle Spectrum Grouped
  Convolution
Boosting Convolutional Neural Networks with Middle Spectrum Grouped ConvolutionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Z. Su
Jiehua Zhang
Tianpeng Liu
Zhen Liu
Shuanghui Zhang
M. Pietikäinen
Tianpeng Liu
154
5
0
13 Apr 2023
EcoFed: Efficient Communication for DNN Partitioning-based Federated
  Learning
EcoFed: Efficient Communication for DNN Partitioning-based Federated LearningIEEE Transactions on Parallel and Distributed Systems (TPDS), 2023
Di Wu
R. Ullah
Philip Rodgers
Peter Kilpatrick
I. Spence
Blesson Varghese
FedML
255
8
0
11 Apr 2023
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Jose Javier Gonzalez Ortiz
John Guttag
Adrian Dalca
210
0
0
11 Apr 2023
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML
  Acceleration
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML AccelerationIEEE Journal of Solid-State Circuits (JSSC), 2023
I. Miro-Panadès
Benoît Tain
J. Christmann
David Coriat
R. Lemaire
...
Jean-Marc Philippe
Y. Thonnart
A. Valentian
Frédéric Heitzmann
F. Clermidy
83
19
0
11 Apr 2023
Previous
123...171819...717273
Next