Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,629 papers shown
Title
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML Acceleration
IEEE Journal of Solid-State Circuits (JSSC), 2023
I. Miro-Panadès
Benoît Tain
J. Christmann
David Coriat
R. Lemaire
...
Jean-Marc Philippe
Y. Thonnart
A. Valentian
Frédéric Heitzmann
F. Clermidy
83
20
0
11 Apr 2023
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Neural Information Processing Systems (NeurIPS), 2023
Tao Lei
Junwen Bai
Siddhartha Brahma
Joshua Ainslie
Kenton Lee
...
Vincent Zhao
Yuexin Wu
Yue Liu
Yu Zhang
Ming-Wei Chang
BDL
AI4CE
209
80
0
11 Apr 2023
Model Sparsity Can Simplify Machine Unlearning
Neural Information Processing Systems (NeurIPS), 2023
Jinghan Jia
Jiancheng Liu
Parikshit Ram
Yuguang Yao
Gaowen Liu
Yang Liu
Pranay Sharma
Sijia Liu
MU
632
189
0
11 Apr 2023
Graph Enabled Cross-Domain Knowledge Transfer
S. Yao
132
0
0
07 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
192
8
0
06 Apr 2023
Learning to Learn with Indispensable Connections
Sambhavi Tiwari
Manas Gogoi
Shekhar Verma
Krishna Pratap Singh
CLL
139
1
0
06 Apr 2023
HNeRV: A Hybrid Neural Representation for Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Hao Chen
M. Gwilliam
Ser-Nam Lim
Abhinav Shrivastava
141
107
1
05 Apr 2023
Efficient human-in-loop deep learning model training with iterative refinement and statistical result validation
Manuel Zahn
Douglas P. Perrin
132
1
0
03 Apr 2023
Optimizing data-flow in Binary Neural Networks
Italian National Conference on Sensors (INS), 2023
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
192
6
0
03 Apr 2023
SEENN: Towards Temporal Spiking Early-Exit Neural Networks
Neural Information Processing Systems (NeurIPS), 2023
Yuhang Li
Tamar Geller
Youngeun Kim
Priyadarshini Panda
274
59
0
02 Apr 2023
A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference
ACM Asia Conference on Computer and Communications Security (AsiaCCS), 2023
Abhinav Kumar
Miguel A. Guirao Aguilera
R. Tourani
Satyajayant Misra
AAML
385
1
0
31 Mar 2023
BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Search and Recommendation Models on Commodity CPU Hardware
International Conference on Information and Knowledge Management (CIKM), 2023
Nicholas Meisburger
V. Lakshman
Benito Geordie
Joshua Engels
David Torres Ramos
...
Benjamin Meisburger
Shubh Gupta
Yashwanth Adunukota
Tharun Medini
Anshumali Shrivastava
220
3
0
30 Mar 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Computer Vision and Pattern Recognition (CVPR), 2023
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
327
72
0
30 Mar 2023
Distributed Neural Representation for Reactive in situ Visualization
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2023
Qi Wu
J. Insley
V. Mateevitsi
S. Rizzi
M. Papka
Kwan-Liu Ma
160
5
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
IEEE International Conference on Computer Vision (ICCV), 2023
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
317
166
0
27 Mar 2023
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware Compression
Denis Kuznedelev
Soroush Tabesh
Kimia Noorbakhsh
Elias Frantar
Sara Beery
Eldar Kurtic
Dan Alistarh
MQ
VLM
193
3
0
25 Mar 2023
PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration
Design Automation Conference (DAC), 2023
Richard Petri
Grace Li Zhang
Yiran Chen
Ulf Schlichtmann
Bing Li
75
10
0
24 Mar 2023
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
IEEE Internet of Things Journal (IEEE IoT J.), 2023
Jemin Lee
Yongin Kwon
Sihyeong Park
Misun Yu
Jeman Park
Hwanjun Song
ViT
MQ
222
12
0
22 Mar 2023
Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training
Journal of Systems Engineering and Electronics (JSEE), 2023
Xinwei Ou
Zhangxin Chen
Ce Zhu
Yipeng Liu
196
9
0
22 Mar 2023
Performance-aware Approximation of Global Channel Pruning for Multitask CNNs
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Hancheng Ye
Bo Zhang
Tao Chen
Jiayuan Fan
Sijin Yu
136
35
0
21 Mar 2023
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective
Computer Vision and Pattern Recognition (CVPR), 2023
Yuexiao Ma
Huixia Li
Xiawu Zheng
Xuefeng Xiao
Rui Wang
Shilei Wen
Xin Pan
Jiayi Ji
Rongrong Ji
MQ
228
15
0
21 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
International Conference on Machine Learning (ICML), 2023
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
420
7
0
21 Mar 2023
Greedy Pruning with Group Lasso Provably Generalizes for Matrix Sensing
Neural Information Processing Systems (NeurIPS), 2023
Nived Rajaraman
Devvrit
Aryan Mokhtari
Kannan Ramchandran
236
3
0
20 Mar 2023
ExplainFix: Explainable Spatially Fixed Deep Networks
Alex Gaudio
Christos Faloutsos
A. Smailagic
P. Costa
A. Campilho
FAtt
143
3
0
18 Mar 2023
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models
Yucheng Ding
Chaoyue Niu
Fan Wu
Shaojie Tang
Chengfei Lyu
Guihai Chen
160
10
0
18 Mar 2023
Unleashing the Potential of Spiking Neural Networks by Dynamic Confidence
IEEE International Conference on Computer Vision (ICCV), 2023
Chen Li
Edward Jones
Steve Furber
286
22
0
17 Mar 2023
Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
IEEE International Conference on Computer Vision (ICCV), 2023
Jiamian Wang
Huan Wang
Yulun Zhang
Yun Fu
Zhiqiang Tao
SupR
152
4
0
16 Mar 2023
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
126
12
0
16 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
166
3
0
15 Mar 2023
R2 Loss: Range Restriction Loss for Model Compression and Quantization
Arnav Kundu
Chungkuk Yoo
Srijan Mishra
Minsik Cho
Saurabh N. Adya
MQ
137
2
0
14 Mar 2023
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
Maorong Wang
L. Xiao
T. Yamasaki
KELM
MoE
115
1
0
14 Mar 2023
FPUS23: An Ultrasound Fetus Phantom Dataset with Deep Neural Network Evaluations for Fetus Orientations, Fetal Planes, and Anatomical Features
IEEE Access (IEEE Access), 2023
B. Prabakaran
Paul Hamelmann
Erik Ostrowski
Mohamed Bennai
196
24
0
14 Mar 2023
Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Kaiqi Zhao
Animesh Jain
Ming Zhao
181
19
0
14 Mar 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments
ACM/IEEE International Conference on Mobile Computing and Networking (MobiCom), 2023
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
231
54
0
13 Mar 2023
Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning
Computer Vision and Pattern Recognition (CVPR), 2023
Yunhao Cao
Peiqin Sun
Shuchang Zhou
107
5
0
13 Mar 2023
OTOV2: Automatic, Generic, User-Friendly
International Conference on Learning Representations (ICLR), 2023
Tianyi Chen
Luming Liang
Tian Ding
Zhihui Zhu
Ilya Zharkov
VLM
MQ
217
48
0
13 Mar 2023
Complement Sparsification: Low-Overhead Model Pruning for Federated Learning
AAAI Conference on Artificial Intelligence (AAAI), 2023
Xiaopeng Jiang
Cristian Borcea
FedML
165
30
0
10 Mar 2023
Sparse and Local Networks for Hypergraph Reasoning
LOG IN (LOG IN), 2023
Guangxuan Xiao
L. Kaelbling
Jiajun Wu
Jiayuan Mao
NAI
ReLM
LRM
182
1
0
09 Mar 2023
A Privacy Preserving System for Movie Recommendations Using Federated Learning
David Neumann
Andreas Lutz
Karsten Müller
Wojciech Samek
333
16
0
07 Mar 2023
An Edge-based WiFi Fingerprinting Indoor Localization Using Convolutional Neural Network and Convolutional Auto-Encoder
IEEE Access (IEEE Access), 2023
Amin Kargar-Barzi
Ebrahim Farahmand
Nooshin Taheri Chatrudi
A. Mahani
M. Shafique
140
18
0
07 Mar 2023
Training-Free Acceleration of ViTs with Delayed Spatial Merging
J. Heo
Seyedarmin Azizi
A. Fayyazi
Massoud Pedram
254
4
0
04 Mar 2023
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Christian Westbrook
S. Pasricha
AAML
129
3
0
03 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
International Conference on Learning Representations (ICLR), 2023
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zinan Lin
182
31
0
03 Mar 2023
Rotation Invariant Quantization for Model Compression
Dor-Joseph Kampeas
Yury Nahshan
Hanoch Kremer
Gil Lederman
Shira Zaloshinski
Zheng Li
E. Haleva
MQ
244
1
0
03 Mar 2023
TopSpark: A Timestep Optimization Methodology for Energy-Efficient Spiking Neural Networks on Autonomous Mobile Agents
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Rachmad Vidya Wicaksana Putra
Mohamed Bennai
172
18
0
03 Mar 2023
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
258
6
0
02 Mar 2023
Learning to Grow Pretrained Models for Efficient Transformer Training
International Conference on Learning Representations (ICLR), 2023
Peihao Wang
Yikang Shen
Lucas Torroba Hennigen
P. Greengard
Leonid Karlinsky
Rogerio Feris
David D. Cox
Zinan Lin
Yoon Kim
183
70
0
02 Mar 2023
EdgeServe: A Streaming System for Decentralized Model Serving
Ted Shaowang
Sanjay Krishnan
193
2
0
02 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yang He
Lingao Xiao
3DPC
348
255
0
01 Mar 2023
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
Axi Niu
Pei Wang
Yu Zhu
Jinqiu Sun
Qingsen Yan
Yanning Zhang
SupR
141
9
0
28 Feb 2023
Previous
1
2
3
...
18
19
20
...
71
72
73
Next