Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,631 papers shown
Revisiting Structured Dropout
Asian Conference on Machine Learning (ACML), 2022
Yiren Zhao
Oluwatomisin Dada
Xitong Gao
Robert D. Mullins
BDL
182
5
0
05 Oct 2022
Streaming Video Analytics On The Edge With Asynchronous Cloud Support
Anurag Ghosh
Srinivasan Iyengar
Stephen Lee
Anuj Rathore
Venkat N. Padmanabhan
193
2
0
04 Oct 2022
spred: Solving
L
1
L_1
L
1
Penalty with SGD
International Conference on Machine Learning (ICML), 2022
Liu Ziyin
Zihao Wang
539
20
0
03 Oct 2022
Limitations of neural network training due to numerical instability of backpropagation
Advances in Computational Mathematics (ACM), 2022
Clemens Karner
V. Kazeev
P. Petersen
259
5
0
03 Oct 2022
EAPruning: Evolutionary Pruning for Vision Transformers and CNNs
British Machine Vision Conference (BMVC), 2022
Qingyuan Li
Bo Zhang
Xiangxiang Chu
ViT
VLM
132
3
0
01 Oct 2022
Diving into Unified Data-Model Sparsity for Class-Imbalanced Graph Representation Learning
Chunhui Zhang
Chao Huang
Yijun Tian
Qianlong Wen
Z. Ouyang
Youhuan Li
Yanfang Ye
Chuxu Zhang
139
8
0
01 Oct 2022
Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
DD
233
54
0
29 Sep 2022
Physics-aware Differentiable Discrete Codesign for Diffractive Optical Neural Networks
Yingjie Li
Ruiyang Chen
Weilu Gao
Cunxi Yu
197
13
0
28 Sep 2022
Adaptive Sparse ViT: Towards Learnable Adaptive Token Pruning by Fully Exploiting Self-Attention
International Joint Conference on Artificial Intelligence (IJCAI), 2022
Xiangcheng Liu
Tianyi Wu
Guodong Guo
ViT
205
45
0
28 Sep 2022
Sauron U-Net: Simple automated redundancy elimination in medical image segmentation via filter pruning
Neurocomputing (Neurocomputing), 2022
Juan Miguel Valverde
Artem Shatillo
Jussi Tohka
AAML
208
12
0
27 Sep 2022
Neural Network Panning: Screening the Optimal Sparse Network Before Training
Asian Conference on Computer Vision (ACCV), 2022
Xiatao Kang
P. Li
Jiayi Yao
Chengxi Li
VLM
123
1
0
27 Sep 2022
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models
Neural Information Processing Systems (NeurIPS), 2022
Xiuying Wei
Yunchen Zhang
Xiangguo Zhang
Yazhe Niu
Shanghang Zhang
Tao Gui
F. Yu
Xianglong Liu
MQ
378
194
0
27 Sep 2022
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training
Neural Information Processing Systems (NeurIPS), 2022
Geng Yuan
Yanyu Li
Sheng Li
Zhenglun Kong
Sergey Tulyakov
Xulong Tang
Yanzhi Wang
Jian Ren
290
20
0
22 Sep 2022
Deep Learning on Home Drone: Searching for the Optimal Architecture
IEEE International Conference on Robotics and Automation (ICRA), 2022
Alaa Maalouf
Yotam Gurfinkel
Barak Diker
O. Gal
Daniela Rus
Dan Feldman
125
6
0
21 Sep 2022
State-driven Implicit Modeling for Sparsity and Robustness in Neural Networks
Alicia Y. Tsai
Juliette Decugis
L. Ghaoui
Alper Atamtürk
207
3
0
19 Sep 2022
Tree-based Text-Vision BERT for Video Search in Baidu Video Advertising
Tan Yu
Jie Liu
Yi Yang
Yi Li
Hongliang Fei
Ping Li
136
2
0
19 Sep 2022
Enabling Conversational Interaction with Mobile UI using Large Language Models
International Conference on Human Factors in Computing Systems (CHI), 2022
Bryan Wang
Gang Li
Yang Li
402
174
0
18 Sep 2022
Improving the Performance of DNN-based Software Services using Automated Layer Caching
M. Abedi
Yanni Iouannou
Pooyan Jamshidi
Hadi Hemmati
129
0
0
18 Sep 2022
Pruning Neural Networks via Coresets and Convex Geometry: Towards No Assumptions
Neural Information Processing Systems (NeurIPS), 2022
M. Tukan
Loay Mualem
Alaa Maalouf
3DPC
187
26
0
18 Sep 2022
Learning to Weight Samples for Dynamic Early-exiting Networks
European Conference on Computer Vision (ECCV), 2022
Yizeng Han
Yifan Pu
Zihang Lai
Chaofei Wang
Qing Xiao
Junfen Cao
Wenhui Huang
Chao Deng
Gao Huang
269
65
0
17 Sep 2022
PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation
European Conference on Computer Vision (ECCV), 2022
Haoyu Ma
Zhe Wang
Yifei Chen
Deying Kong
Liangjian Chen
Xingwei Liu
Xiangyi Yan
Hao Tang
Xiaohui Xie
ViT
180
70
0
16 Sep 2022
Self-Attentive Pooling for Efficient Deep Learning
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Fang Chen
Gourav Datta
Souvik Kundu
Peter A. Beerel
259
13
0
16 Sep 2022
Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Sheng-Chun Kao
Amir Yazdanbakhsh
Suvinay Subramanian
Shivani Agrawal
Utku Evci
T. Krishna
322
15
0
15 Sep 2022
MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems
Jieyang Chen
Chenhao Xie
J. Firoz
Jiajia Li
Shuaiwen Leon Song
Kevin J. Barker
Mark Raugas
Ang Li
149
3
0
15 Sep 2022
Neural Networks Reduction via Lumping
International Conference of the Italian Association for Artificial Intelligence (AIxIA), 2022
Dalila Ressi
Riccardo Romanello
S. Rossi
Carla Piazza
226
5
0
15 Sep 2022
Efficient Quantized Sparse Matrix Operations on Tensor Cores
International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2022
Shigang Li
Kazuki Osawa
Torsten Hoefler
415
45
0
14 Sep 2022
Federated Pruning: Improving Neural Network Efficiency with Federated Learning
Interspeech (Interspeech), 2022
Rongmei Lin
Yonghui Xiao
Tien-Ju Yang
Ding Zhao
Li Xiong
Giovanni Motta
Franccoise Beaufays
FedML
146
14
0
14 Sep 2022
Sparsity-guided Network Design for Frame Interpolation
Tian Ding
Luming Liang
Zhihui Zhu
Tianyi Chen
Ilya Zharkov
232
7
0
09 Sep 2022
ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD), 2022
Jing Gong
Hassaan Saadat
Hasindu Gamaarachchi
Haris Javaid
X. Hu
S. Parameswaran
249
21
0
09 Sep 2022
Seeking Interpretability and Explainability in Binary Activated Neural Networks
Benjamin Leblanc
Pascal Germain
FAtt
449
1
0
07 Sep 2022
Improving the Cross-Lingual Generalisation in Visual Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2022
Farhad Nooralahzadeh
Rico Sennrich
245
8
0
07 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU
Jian-He Liao
Mingzhen Li
Qingxiao Sun
Jiwei Hao
F. Yu
...
Ye Tao
Zicheng Zhang
Hailong Yang
Zhongzhi Luan
D. Qian
152
4
0
06 Sep 2022
Low-Power Hardware-Based Deep-Learning Diagnostics Support Case Study
Biomedical Circuits and Systems Conference (BioCAS), 2018
Khushal Sethi
V. Parmar
Manan Suri
81
15
0
03 Sep 2022
Incremental Online Learning Algorithms Comparison for Gesture and Visual Smart Sensors
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Alessandro Avi
Andrea Albanese
Davide Brunelli
235
11
0
01 Sep 2022
On Quantizing Implicit Neural Representations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Cameron Gordon
Shin-Fang Chng
L. MacDonald
Simon Lucey
MQ
309
22
0
01 Sep 2022
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
Micro (MICRO), 2022
Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yun-Bo Liu
Minyi Guo
Yuhao Zhu
MQ
179
96
0
30 Aug 2022
Symmetric Pruning in Quantum Neural Networks
International Conference on Learning Representations (ICLR), 2022
Xinbiao Wang
Sergii Strelchuk
Tongliang Liu
Yong Luo
Yuxuan Du
Dacheng Tao
261
28
0
30 Aug 2022
A Deep Neural Networks ensemble workflow from hyperparameter search to inference leveraging GPU clusters
International Conference on High Performance Computing in Asia-Pacific Region (HPCAsia), 2022
Pierrick Pochelu
S. Petiton
B. Conche
AI4CE
216
6
0
30 Aug 2022
Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation
Journal of Lightwave Technology (JLT), 2022
Pedro J. Freire
A. Napoli
D. A. Ron
B. Spinnler
M. Anderson
W. Schairer
T. Bex
N. Costa
S. Turitsyn
Jaroslaw E. Prilepsky
288
40
0
26 Aug 2022
Complexity-Driven CNN Compression for Resource-constrained Edge AI
IEEE Transactions on Artificial Intelligence (IEEE TAI), 2022
Muhammad Zawish
Steven Davy
L. Abraham
210
33
0
26 Aug 2022
Anytime-Lidar: Deadline-aware 3D Object Detection
IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA), 2022
Ahmet Soyyigit
Shuochao Yao
H. Yun
3DPC
125
10
0
25 Aug 2022
Adaptation of MobileNetV2 for Face Detection on Ultra-Low Power Platform
Swiss Conference on Data Science (SDS), 2022
Simon Narduzzi
Engin Turetken
Jean-Philippe Thiran
L. A. Dunbar
3DH
CVBM
80
1
0
23 Aug 2022
Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost
AAAI Conference on Artificial Intelligence (AAAI), 2022
Lu Yin
Shiwei Liu
Fang Meng
Tianjin Huang
Vlado Menkovski
Mykola Pechenizkiy
122
14
0
23 Aug 2022
RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN
European Conference on Computer Vision (ECCV), 2022
Huy Phan
Cong Shi
Yi Xie
Tian-Di Zhang
Zhuohang Li
Tianming Zhao
Jian-Dong Liu
Yan Wang
Ying-Cong Chen
Bo Yuan
AAML
219
8
0
22 Aug 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
296
19
0
11 Aug 2022
An Accelerated Doubly Stochastic Gradient Method with Faster Explicit Model Identification
International Conference on Information and Knowledge Management (CIKM), 2022
Runxue Bao
Bin Gu
Heng-Chiao Huang
266
18
0
11 Aug 2022
Self-Knowledge Distillation via Dropout
Computer Vision and Image Understanding (CVIU), 2022
Hyoje Lee
Yeachan Park
Hyun Seo
Myung-joo Kang
FedML
149
23
0
11 Aug 2022
Safety and Performance, Why not Both? Bi-Objective Optimized Model Compression toward AI Software Deployment
International Conference on Automated Software Engineering (ASE), 2022
Jie Zhu
Leye Wang
Xiao Han
243
14
0
11 Aug 2022
Customized Watermarking for Deep Neural Networks via Label Distribution Perturbation
Tzu-Yun Chien
Chih-Ya Shen
AAML
101
2
0
10 Aug 2022
Fast Heterogeneous Federated Learning with Hybrid Client Selection
Conference on Uncertainty in Artificial Intelligence (UAI), 2022
Guangyuan Shen
D. Gao
Duanxiao Song
Libin Yang
Xukai Zhou
Shirui Pan
W. Lou
Fang Zhou
FedML
369
16
0
10 Aug 2022
Previous
1
2
3
...
22
23
24
...
71
72
73
Next