Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,628 papers shown
Title
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers
International Conference on Artificial Intelligence Circuits and Systems (ICAICS), 2023
Julian Moosmann
Marco Giordano
Christian Vogt
Michele Magno
MQ
ObjD
205
30
0
22 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
165
41
0
22 May 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
IEEE International Conference on Multimedia and Expo (ICME), 2023
Yijia Zhang
Lingran Zhao
Shijie Cao
Wenqiang Wang
Ting Cao
Fan Yang
Mao Yang
Shanghang Zhang
Ningyi Xu
MQ
139
24
0
21 May 2023
Self-Distillation with Meta Learning for Knowledge Graph Completion
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yunshui Li
Junhao Liu
Chengming Li
Min Yang
184
8
0
20 May 2023
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Robert Bamler
Mrinmaya Sachan
157
25
0
18 May 2023
PDP: Parameter-free Differentiable Pruning is All You Need
Neural Information Processing Systems (NeurIPS), 2023
Minsik Cho
Saurabh N. Adya
Devang Naik
VLM
187
15
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
International Journal of Computer Vision (IJCV), 2023
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGen
DiffM
295
334
0
17 May 2023
Analyzing Compression Techniques for Computer Vision
Maniratnam Mandal
Imran Khan
154
1
0
14 May 2023
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
International Conference on Machine Learning (ICML), 2023
Guihong Li
Kartikeya Bhardwaj
Yuedong Yang
R. Marculescu
AAML
279
0
0
13 May 2023
Efficient Asynchronize Stochastic Gradient Algorithm with Structured Data
Zhao Song
Mingquan Ye
199
4
0
13 May 2023
Accelerator-Aware Training for Transducer-Based Speech Recognition
Spoken Language Technology Workshop (SLT), 2023
Suhaila M. Shakiah
Rupak Vignesh Swaminathan
Hieu Duy Nguyen
Raviteja Chinta
Tariq Afzal
Nathan Susanj
Athanasios Mouchtaris
Grant P. Strimel
Ariya Rastrow
133
1
0
12 May 2023
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated Learning Systems
Neural Networks (Neural Netw.), 2023
Yeshwanth Venkatesha
Youngeun Kim
Hyoungseob Park
Priyadarshini Panda
FedML
114
6
0
11 May 2023
Post-training Model Quantization Using GANs for Synthetic Data Generation
Athanasios Masouris
Mansi Sharma
Adrian Boguszewski
Alexander Kozlov
Zhuo Wu
Raymond Lo
MQ
146
0
0
10 May 2023
VEDLIoT -- Next generation accelerated AIoT systems and applications
ACM International Conference on Computing Frontiers (CF), 2023
Kevin Mika
R. Griessl
N. Kucza
F. Porrmann
M. Kaiser
...
Mario Porrmann
Hans-Martin Heyn
E. Knauss
Yufei Mao
Franz Meierhofer
114
6
0
09 May 2023
DietCNN: Multiplication-free Inference for Quantized CNNs
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Swarnava Dey
P. Dasgupta
P. Chakrabarti
MQ
237
1
0
09 May 2023
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Lingjiao Chen
Matei A. Zaharia
James Zou
LLMAG
346
378
0
09 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
205
1
0
08 May 2023
Compressing audio CNNs with graph centrality based filter pruning
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
122
2
0
05 May 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
International Symposium on High-Performance Computer Architecture (HPCA), 2023
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
206
9
0
04 May 2023
Input Layer Binarization with Bit-Plane Encoding
International Conference on Artificial Neural Networks (ICANN), 2023
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
162
8
0
04 May 2023
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate Functions
CSIAM Transactions on Applied Mathematics (TCAM), 2023
Lin Chen
Shitong Wu
Wen-Long Ye
Huihui Wu
Wen-Ying Zhang
Hao Wu
Bo Bai
59
9
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Conference on Machine Learning and Systems (MLSys), 2023
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
270
26
0
04 May 2023
Dynamic Sparse Training with Structured Sparsity
International Conference on Learning Representations (ICLR), 2023
Mike Lasby
A. Golubeva
Utku Evci
Mihai Nica
Yani Andrew Ioannou
566
33
0
03 May 2023
A Digital Twin Empowered Lightweight Model Sharing Scheme for Multi-Robot Systems
IEEE Internet of Things Journal (IEEE IoT J.), 2023
Kai Xiong
Zhihong Wang
S. Leng
Jianhua He
109
14
0
03 May 2023
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge Platforms
Ziyang Zhang
Huan Li
Yang Zhao
Changyao Lin
Jie Liu
145
5
0
01 May 2023
CORSD: Class-Oriented Relational Self Distillation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Muzhou Yu
S. Tan
Kailu Wu
Runpei Dong
Linfeng Zhang
Kaisheng Ma
102
1
0
28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models
D. Honegger
Konstantin Schurholt
Damian Borth
236
5
0
26 Apr 2023
Optimizing Deep Learning Models For Raspberry Pi
Sa Ameen
Kangaranmulle Siriwardana
Theodoros Theodoridis
VLM
90
11
0
25 Apr 2023
Multiplierless In-filter Computing for tinyML Platforms
International Conference on VLSI Design (VLSID), 2023
Abhishek Ramdas Nair
P. Nath
S. Chakrabartty
Chetan Singh Thakur
95
1
0
24 Apr 2023
The Case for Hierarchical Deep Learning Inference at the Network Edge
Ghina Al-Atat
Andrea Fresa
Adarsh Prasad Behera
Vishnu Narayanan Moothedath
James Gross
J. Champati
149
12
0
23 Apr 2023
Deep Convolutional Tables: Deep Learning without Convolutions
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
S. Dekel
Y. Keller
Aharon Bar-Hillel
3DV
250
0
0
23 Apr 2023
QuMoS: A Framework for Preserving Security of Quantum Machine Learning Model
International Conference on Quantum Computing and Engineering (QCE), 2023
Zhepeng Wang
Jinyang Li
Zhirui Hu
Blake Gage
Elizabeth Iwasawa
Weiwen Jiang
261
16
0
23 Apr 2023
Identifying Appropriate Intellectual Property Protection Mechanisms for Machine Learning Models: A Systematization of Watermarking, Fingerprinting, Model Access, and Attacks
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Isabell Lederer
Rudolf Mayer
Andreas Rauber
228
29
0
22 Apr 2023
Securing Neural Networks with Knapsack Optimization
Yakir Gorski
Amir Jevnisek
S. Avidan
AAML
104
1
0
20 Apr 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
244
171
0
20 Apr 2023
Knowledge Distillation Under Ideal Joint Classifier Assumption
Neural Networks (Neural Netw.), 2023
Huayu Li
Xiwen Chen
G. Ditzler
Janet Roveda
Ao Li
140
2
0
19 Apr 2023
Adaptive Scheduling for Edge-Assisted DNN Serving
IEEE International Conference on Mobile Adhoc and Sensor Systems (MASS), 2023
Jian He
Chen-Shun Yang
Zhaoyuan He
Ghufran Baig
L. Qiu
104
1
0
19 Apr 2023
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data Sharing
Expert systems with applications (ESWA), 2023
An-dong Li
Milan Markovic
P. Edwards
Georgios Leontidis
FedML
131
24
0
19 Apr 2023
Neural Network Quantisation for Faster Homomorphic Encryption
IEEE International Symposium on On-Line Testing and Robust System Design (IOLTS), 2023
Wouter Legiest
Jan-Pieter DÁnvers
Furkan Turan
Michiel Van Beirendonck
Ingrid Verbauwhede
MQ
148
6
0
19 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Yazhe Niu
Jian Ren
Zhengang Li
MQ
203
57
0
18 Apr 2023
Frequency Regularization: Restricting Information Redundancy of Convolutional Neural Networks
IEEE Access (IEEE Access), 2023
Chenqiu Zhao
Guanfang Dong
Shupei Zhang
Zijie Tan
Anup Basu
311
4
0
17 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Alexander Warnecke
Julian Speith
Janka Möller
Konrad Rieck
C. Paar
AAML
478
3
0
17 Apr 2023
SalientGrads: Sparse Models for Communication Efficient and Data Aware Distributed Federated Training
Riyasat Ohib
Bishal Thapaliya
Pratyush Gaggenapalli
Qingbin Liu
Vince D. Calhoun
Sergey Plis
FedML
132
2
0
15 Apr 2023
Generating Adversarial Examples with Better Transferability via Masking Unimportant Parameters of Surrogate Model
IEEE International Joint Conference on Neural Network (IJCNN), 2023
Dingcheng Yang
Wenjian Yu
Zihao Xiao
Jiaqi Luo
AAML
DiffM
161
6
0
14 Apr 2023
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving Services
IEEE Communications Surveys and Tutorials (COMST), 2023
Dewant Katare
Diego Perino
J. Nurmi
M. Warnier
Marijn Janssen
Aaron Yi Ding
262
61
0
13 Apr 2023
Learning Accurate Performance Predictors for Ultrafast Automated Model Compression
International Journal of Computer Vision (IJCV), 2023
Ziwei Wang
Jiwen Lu
Han Xiao
Shengyu Liu
Jie Zhou
OffRL
150
1
0
13 Apr 2023
Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Z. Su
Jiehua Zhang
Tianpeng Liu
Zhen Liu
Shuanghui Zhang
M. Pietikäinen
Tianpeng Liu
154
5
0
13 Apr 2023
EcoFed: Efficient Communication for DNN Partitioning-based Federated Learning
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2023
Di Wu
R. Ullah
Philip Rodgers
Peter Kilpatrick
I. Spence
Blesson Varghese
FedML
255
8
0
11 Apr 2023
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Jose Javier Gonzalez Ortiz
John Guttag
Adrian Dalca
210
0
0
11 Apr 2023
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML Acceleration
IEEE Journal of Solid-State Circuits (JSSC), 2023
I. Miro-Panadès
Benoît Tain
J. Christmann
David Coriat
R. Lemaire
...
Jean-Marc Philippe
Y. Thonnart
A. Valentian
Frédéric Heitzmann
F. Clermidy
83
19
0
11 Apr 2023
Previous
1
2
3
...
17
18
19
...
71
72
73
Next