ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,622 papers shown
Title
Quantized Neural Networks for Microcontrollers: A Comprehensive Review of Methods, Platforms, and Applications
Quantized Neural Networks for Microcontrollers: A Comprehensive Review of Methods, Platforms, and Applications
Hamza A. Abushahla
Dara Varam
Ariel J. N. Panopio
Mohamed I. AlHajri
MQ
275
0
0
20 Aug 2025
Enhancing Robustness of Implicit Neural Representations Against Weight Perturbations
Enhancing Robustness of Implicit Neural Representations Against Weight PerturbationsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Wenyong Zhou
Yuxin Cheng
Zhengwu Liu
Taiqiang Wu
Chen Zhang
Ngai Wong
AAML
100
2
0
19 Aug 2025
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
Mikołaj Janusz
Tomasz Wojnar
Yawei Li
Luca Benini
Kamil Adamczewski
VLM
94
2
0
19 Aug 2025
Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Samiul Basir Bhuiyan
Md. Sazzad Hossain Adib
Mohammed Aman Bhuiyan
Muhammad Rafsan Kabir
Moshiur Farazi
Shafin Rahman
Nabeel Mohammed
132
1
0
18 Aug 2025
TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform
TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles PlatformIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025
Jun Liu
Zhenglun Kong
Pu Zhao
Weihao Zeng
Hao Tang
...
Wenbin Zhang
Geng Yuan
Wei Niu
Xue Lin
Yanzhi Wang
145
7
0
17 Aug 2025
Quantization through Piecewise-Affine Regularization: Optimization and Statistical Guarantees
Quantization through Piecewise-Affine Regularization: Optimization and Statistical Guarantees
Jianhao Ma
Lin Xiao
56
0
0
14 Aug 2025
Quantization vs Pruning: Insights from the Strong Lottery Ticket Hypothesis
Quantization vs Pruning: Insights from the Strong Lottery Ticket Hypothesis
Aakash Kumar
Emanuele Natale
MQ
61
0
0
14 Aug 2025
Harnessing Input-Adaptive Inference for Efficient VLN
Harnessing Input-Adaptive Inference for Efficient VLN
Dongwoo Kang
Akhil Perincherry
Zachary Coalson
Aiden Gabriel
Stefan Lee
Sanghyun Hong
LM&Ro
98
0
0
12 Aug 2025
Pruning Large Language Models by Identifying and Preserving Functional Networks
Pruning Large Language Models by Identifying and Preserving Functional Networks
Yiheng Liu
Junhao Ning
Sichen Xia
Xiaohui Gao
Ning Qiang
Bao Ge
Junwei Han
Xiaoyan Cai
100
0
0
07 Aug 2025
HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models
HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models
Young D. Kwon
Rui Li
Sijia Li
Da Li
S. Bhattacharya
Stylianos I. Venieris
VLM
104
2
0
06 Aug 2025
WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification
WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification
Thang Duc Tran
Thai Hoang Le
MU
93
0
0
06 Aug 2025
InfoQ: Mixed-Precision Quantization via Global Information Flow
InfoQ: Mixed-Precision Quantization via Global Information Flow
Mehmet Emre Akbulut
Hazem Hesham Yousef Shalby
Fabrizio Pittorino
Manuel Roveri
MQ
40
0
0
06 Aug 2025
4D-PreNet: A Unified Preprocessing Framework for 4D-STEM Data Analysis
4D-PreNet: A Unified Preprocessing Framework for 4D-STEM Data Analysis
Mingyu Liu
Zian Mao
Zhu Liu
Haoran Zhang
Jintao Guo
...
Xi Huang
Shufen Chu
Chun Cheng
Jun Ding
Yujun Xie
85
0
0
05 Aug 2025
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yuanteng Chen
Yuantian Shao
Peisong Wang
Jian Cheng
MoE
103
2
0
03 Aug 2025
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
Zishan Shao
Yixiao Wang
Qinsi Wang
Ting Jiang
Zhixu Du
Hancheng Ye
Danyang Zhuo
Yiran Chen
Xue Yang
67
3
0
02 Aug 2025
Fusion Sampling Validation in Data Partitioning for Machine Learning
Fusion Sampling Validation in Data Partitioning for Machine Learning
Christopher Godwin Udomboso
Caston Sigauke
Ini Adinya
88
0
0
02 Aug 2025
Compression-Induced Communication-Efficient Large Model Training and Inferencing
Compression-Induced Communication-Efficient Large Model Training and Inferencing
Sudip K. Seal
Maksudul Alam
Jorge Ramirez
Sajal Dash
Hao Lu
AI4CE
78
0
0
01 Aug 2025
Anomaly detection with spiking neural networks for LHC physics
Anomaly detection with spiking neural networks for LHC physics
Barry M. Dillon
Jim Harkin
Aqib Javed
100
0
0
31 Jul 2025
Improved Robustness and Functional Localization in Topographic CNNs Through Weight Similarity
Improved Robustness and Functional Localization in Topographic CNNs Through Weight Similarity
Nhut Truong
Uri Hasson
64
0
0
31 Jul 2025
An Architecture for Spatial Networking
An Architecture for Spatial Networking
Josh Millar
Ryan Gibb
Roy Ang
Hamed Haddadi
Hamed Haddadi
164
0
0
30 Jul 2025
Improving Neural Network Training using Dynamic Learning Rate Schedule for PINNs and Image Classification
Improving Neural Network Training using Dynamic Learning Rate Schedule for PINNs and Image Classification
D. Veerababu
Ashwin A. Raikar
Prasanta K. Ghosh
87
2
0
29 Jul 2025
From Waveforms to Pixels: A Survey on Audio-Visual Segmentation
From Waveforms to Pixels: A Survey on Audio-Visual Segmentation
Jia Li
Yapeng Tian
VOS
186
1
0
29 Jul 2025
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios
Kele Shao
Keda Tao
Kejia Zhang
Sicheng Feng
Mu Cai
Yuzhang Shang
Haoxuan You
Can Qin
Yang Sui
Huan Wang
397
9
0
27 Jul 2025
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
Chen Zhu
Wangbo Zhao
Huiwen Zhang
Samir Khaki
Yuhao Zhou
...
Zhihang Yuan
Yuzhang Shang
Xiaojiang Peng
Kai Wang
Dawei Yang
149
0
0
25 Jul 2025
Knowledge Grafting: A Mechanism for Optimizing AI Model Deployment in Resource-Constrained Environments
Knowledge Grafting: A Mechanism for Optimizing AI Model Deployment in Resource-Constrained Environments
Osama Almurshed
Ashish Kaushal
Asmail Muftah
Nitin Auluck
Omer Rana
164
0
0
25 Jul 2025
The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models
The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models
Yang Xiao
Gen Li
Jie Ji
Ruimeng Ye
Xiaolong Ma
Bo Hui
MU
218
1
0
24 Jul 2025
FedVLM: Scalable Personalized Vision-Language Models through Federated Learning
FedVLM: Scalable Personalized Vision-Language Models through Federated Learning
Arkajyoti Mitra
Afia Anjum
Paul Agbaje
Mert D. Pesé
Habeeb Olufowobi
VLM
130
1
0
23 Jul 2025
Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks
Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks
Yash Kumar
48
0
0
22 Jul 2025
CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage
CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage
Na Li
Yansong Gao
Hongsheng Hu
Boyu Kuang
Anmin Fu
148
0
0
22 Jul 2025
A Survey on Efficiency Optimization Techniques for DNN-based Video Analytics: Process Systems, Algorithms, and Applications
A Survey on Efficiency Optimization Techniques for DNN-based Video Analytics: Process Systems, Algorithms, and Applications
Shanjiang Tang
Rui Huang
Hsinyu Luo
C. Wang
Ce Yu
Yusen Li
Hao Fu
Chao Sun
and Jian Xiao
110
0
0
21 Jul 2025
Search-Optimized Quantization in Biomedical Ontology Alignment
Search-Optimized Quantization in Biomedical Ontology AlignmentFrontiers in Artificial Intelligence (Front. Artif. Intell.), 2025
Oussama Bouaggad
Natalia Grabar
MQ
129
0
0
18 Jul 2025
EEG Foundation Models: A Critical Review of Current Progress and Future Directions
EEG Foundation Models: A Critical Review of Current Progress and Future Directions
Gayal Kuruppu
Neeraj Wagh
Y. Varatharajah
209
0
0
15 Jul 2025
IPPRO: Importance-based Pruning with PRojective Offset for Magnitude-indifferent Structural Pruning
IPPRO: Importance-based Pruning with PRojective Offset for Magnitude-indifferent Structural Pruning
Jaeheun Jung
Jaehyuk Lee
Yeajin Lee
Donghun Lee
107
0
0
10 Jul 2025
Pay Attention to Small Weights
Pay Attention to Small Weights
Chao Zhou
Tom Jacobs
Advait Gadhikar
R. Burkholz
106
0
0
26 Jun 2025
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
Advances in Intelligent Hearing Aids: Deep Learning Approaches to Selective Noise Cancellation
Haris Khan
Shumaila Asif
Hassan Nasir
Kamran Aziz Bhatti
Shahzad Amin Sheikh
102
1
0
25 Jun 2025
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding HelpsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Jiashun Cheng
Chenyi Zi
Polydoros Giannouris
Ziqi Gao
Yuhan Li
Jia Li
Fugee Tsung
176
0
0
20 Jun 2025
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
Samir Khaki
Xiuyu Li
Junxian Guo
Ligeng Zhu
Chenfeng Xu
Konstantinos N. Plataniotis
Amir Yazdanbakhsh
Kurt Keutzer
Song Han
Zhijian Liu
174
2
0
19 Jun 2025
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Xun Wang
Jing Xu
Franziska Boenisch
Michael Backes
Christopher A. Choquette-Choo
Adam Dziedzic
AAML
170
0
0
19 Jun 2025
A Real-time Endoscopic Image Denoising System
A Real-time Endoscopic Image Denoising System
Yu Xing
Shishi Huang
Meng Lv
Guo Chen
Huailiang Wang
Lingzhi Sui
105
0
0
18 Jun 2025
A Survey on World Models Grounded in Acoustic Physical Information
A Survey on World Models Grounded in Acoustic Physical Information
Xiaoliang Chen
Le Chang
Xin Yu
Yunhe Huang
Xianling Tu
SyDaAI4CE
149
0
0
16 Jun 2025
MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models
MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models
Yan Sun
Qixin Zhang
Zhiyuan Yu
Xikun Zhang
Li Shen
Dacheng Tao
143
1
0
15 Jun 2025
ReFrame: Layer Caching for Accelerated Inference in Real-Time Rendering
ReFrame: Layer Caching for Accelerated Inference in Real-Time Rendering
Lufei Liu
Tor M. Aamodt
118
0
0
14 Jun 2025
Compression Aware Certified Training
Compression Aware Certified Training
Changming Xu
Gagandeep Singh
130
0
0
13 Jun 2025
Auto-Compressing Networks
Auto-Compressing Networks
Vaggelis Dorovatas
Georgios Paraskevopoulos
Alexandros Potamianos
292
2
0
11 Jun 2025
SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving
SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving
Xiangchen Li
Dimitrios Spatharakis
Saeid Ghafouri
Jiakun Fan
Dimitrios Nikolopoulos
Deepu John
Bo Ji
Dimitrios S. Nikolopoulos
228
4
0
11 Jun 2025
A Topological Improvement of the Overall Performance of Sparse Evolutionary Training: Motif-Based Structural Optimization of Sparse MLPs Project
A Topological Improvement of the Overall Performance of Sparse Evolutionary Training: Motif-Based Structural Optimization of Sparse MLPs Project
Xiaotian Chen
Hongyun Liu
Seyed Sahand Mohammadi Ziabari
130
0
0
10 Jun 2025
Hyperpruning: Efficient Search through Pruned Variants of Recurrent Neural Networks Leveraging Lyapunov Spectrum
Hyperpruning: Efficient Search through Pruned Variants of Recurrent Neural Networks Leveraging Lyapunov Spectrum
Caleb Zheng
Eli Shlizerman
110
0
0
09 Jun 2025
Modified K-means Algorithm with Local Optimality Guarantees
Modified K-means Algorithm with Local Optimality Guarantees
Mingyi Li
Michael R. Metel
Akiko Takeda
DRL
137
0
0
08 Jun 2025
Event Classification of Accelerometer Data for Industrial Package Monitoring with Embedded Deep Learning
Event Classification of Accelerometer Data for Industrial Package Monitoring with Embedded Deep LearningAnnual International Computer Software and Applications Conference (COMPSAC), 2025
Manon Renault
Hamoud Younes
Hugo Tessier
Ronan Le Roy
Bastien Pasdeloup
Mathieu Léonardon
91
0
0
05 Jun 2025
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order
Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order
Egor Petrov
Grigoriy Evseev
Aleksey Antonov
Andrey Veprikov
Nikolay Bushkov
Nikolay Bushkov
Stanislav Moiseev
345
2
0
04 Jun 2025
Previous
123456...717273
Next