Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.03294
Cited By
A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers
10 April 2018
Tianyun Zhang
Shaokai Ye
Kaiqi Zhang
Jian Tang
Wujie Wen
M. Fardad
Yanzhi Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers"
50 / 60 papers shown
Title
Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
Arshia Kermani
Ehsan Zeraatkar
Habib Irani
41
2
0
23 Feb 2025
Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection
Xiang Fang
Arvind Easwaran
B. Genest
36
4
0
16 Sep 2024
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
39
0
0
07 Aug 2024
A Generic Layer Pruning Method for Signal Modulation Recognition Deep Learning Models
Yao Lu
Yutao Zhu
Yuqi Li
Dongwei Xu
Yun Lin
Qi Xuan
Xiaoniu Yang
39
5
0
12 Jun 2024
Combining Relevance and Magnitude for Resource-Aware DNN Pruning
C. Chiasserini
F. Malandrino
Nuria Molner
Zhiqiang Zhao
32
0
0
21 May 2024
SCoTTi: Save Computation at Training Time with an adaptive framework
Ziyu Li
Enzo Tartaglione
Van-Tam Nguyen
31
0
0
19 Dec 2023
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
31
11
0
17 Dec 2023
FedDIP: Federated Learning with Extreme Dynamic Pruning and Incremental Regularization
Qianyu Long
Christos Anagnostopoulos
S. P. Parambath
Daning Bi
AI4CE
FedML
23
2
0
13 Sep 2023
Training Acceleration of Low-Rank Decomposed Networks using Sequential Freezing and Rank Quantization
Habib Hajimolahoseini
Walid Ahmed
Yang Liu
OffRL
MQ
19
6
0
07 Sep 2023
Learning Kernel-Modulated Neural Representation for Efficient Light Field Compression
Jinglei Shi
Yihong Xu
C. Guillemot
19
6
0
12 Jul 2023
Learning-based Spatial and Angular Information Separation for Light Field Compression
Jinglei Shi
Yihong Xu
C. Guillemot
21
0
0
13 Apr 2023
Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural Network Pruning
Shangli Zhou
Mikhail A. Bragin
Lynn Pepin
Deniz Gurevin
Fei Miao
Caiwen Ding
16
3
0
08 Apr 2023
Physics-aware Roughness Optimization for Diffractive Optical Neural Networks
Shangli Zhou
Yingjie Li
Minhan Lou
Weilu Gao
Zhijie Shi
Cunxi Yu
Caiwen Ding
27
2
0
04 Apr 2023
On Model Compression for Neural Networks: Framework, Algorithm, and Convergence Guarantee
Chenyang Li
Jihoon Chung
Mengnan Du
Haimin Wang
Xianlian Zhou
Bohao Shen
33
1
0
13 Mar 2023
Less is More: The Influence of Pruning on the Explainability of CNNs
David Weber
F. Merkle
Pascal Schöttle
Stephan Schlögl
Martin Nocker
FAtt
34
1
0
17 Feb 2023
Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off
Shaoyi Huang
Bowen Lei
Dongkuan Xu
Hongwu Peng
Yue Sun
Mimi Xie
Caiwen Ding
21
19
0
30 Nov 2022
Towards Real-Time Temporal Graph Learning
Deniz Gurevin
Mohsin Shan
Tong Geng
Weiwen Jiang
Caiwen Ding
O. Khan
AI4TS
AI4CE
37
0
0
08 Oct 2022
PIM-QAT: Neural Network Quantization for Processing-In-Memory (PIM) Systems
Qing Jin
Zhiyu Chen
J. Ren
Yanyu Li
Yanzhi Wang
Kai-Min Yang
MQ
13
2
0
18 Sep 2022
Towards Sparsification of Graph Neural Networks
Hongwu Peng
Deniz Gurevin
Shaoyi Huang
Tong Geng
Weiwen Jiang
O. Khan
Caiwen Ding
GNN
30
24
0
11 Sep 2022
Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis
Long Zhuo
Guangcong Wang
Shikai Li
Wayne Wu
Ziwei Liu
VGen
53
20
0
11 Jul 2022
Quantum Neural Network Compression
Zhirui Hu
Peiyan Dong
Zhepeng Wang
Youzuo Lin
Yanzhi Wang
Weiwen Jiang
GNN
25
28
0
04 Jul 2022
Pruning has a disparate impact on model accuracy
Cuong Tran
Ferdinando Fioretto
Jung-Eun Kim
Rakshit Naidu
39
38
0
26 May 2022
PointDistiller: Structured Knowledge Distillation Towards Efficient and Compact 3D Detection
Linfeng Zhang
Runpei Dong
Hung-Shuo Tai
Kaisheng Ma
3DPC
72
47
0
23 May 2022
E^2TAD: An Energy-Efficient Tracking-based Action Detector
Xin Hu
Zhenyu Wu
Haoyuan Miao
Siqi Fan
Taiyu Long
...
Pengcheng Pi
Yi Wu
Zhou Ren
Zhangyang Wang
G. Hua
24
2
0
09 Apr 2022
Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning
Guyue Huang
Haoran Li
Minghai Qin
Fei Sun
Yufei Din
Yuan Xie
25
18
0
09 Mar 2022
Enabling On-Device Smartphone GPU based Training: Lessons Learned
Anish Das
Young D. Kwon
Jagmohan Chauhan
Cecilia Mascolo
3DH
30
10
0
21 Feb 2022
ICSML: Industrial Control Systems ML Framework for native inference using IEC 61131-3 code
Constantine Doumanidis
Prashant Hari Narayan Rajput
Michail Maniatakos
14
2
0
21 Feb 2022
Deadwooding: Robust Global Pruning for Deep Neural Networks
Sawinder Kaur
Ferdinando Fioretto
Asif Salekin
19
4
0
10 Feb 2022
Mixture-of-Rookies: Saving DNN Computations by Predicting ReLU Outputs
D. Pinto
J. Arnau
Antonio González
31
1
0
10 Feb 2022
Robust Binary Models by Pruning Randomly-initialized Networks
Chen Liu
Ziqi Zhao
Sabine Süsstrunk
Mathieu Salzmann
TPM
AAML
MQ
26
4
0
03 Feb 2022
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin
Tianyun Zhang
Fei Sun
Yen-kuang Chen
M. Fardad
Yanzhi Wang
Yuan Xie
49
0
0
21 Dec 2021
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Yifan Gong
Geng Yuan
Zheng Zhan
Wei Niu
Zhengang Li
...
Sijia Liu
Bin Ren
Xue Lin
Xulong Tang
Yanzhi Wang
20
10
0
22 Nov 2021
RGP: Neural Network Pruning through Its Regular Graph Structure
Zhuangzhi Chen
Jingyang Xiang
Yao Lu
Qi Xuan
Xiaoniu Yang
25
1
0
28 Oct 2021
Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks
Hassan Dbouk
Naresh R Shanbhag
AAML
21
7
0
28 Oct 2021
CHIP: CHannel Independence-based Pruning for Compact Neural Networks
Yang Sui
Miao Yin
Yi Xie
Huy Phan
S. Zonouz
Bo Yuan
VLM
33
128
0
26 Oct 2021
Weight Evolution: Improving Deep Neural Networks Training through Evolving Inferior Weight Values
Zhenquan Lin
K. Guo
Xiaofen Xing
Xiangmin Xu
ODL
24
1
0
09 Oct 2021
FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator
Geng Yuan
Payman Behnam
Zhengang Li
Ali Shafiee
Sheng Lin
...
Hang Liu
Xuehai Qian
M. N. Bojnordi
Yanzhi Wang
Caiwen Ding
19
68
0
16 Jun 2021
Deep Neural Networks Based Weight Approximation and Computation Reuse for 2-D Image Classification
M. Tolba
H. Tesfai
H. Saleh
B. Mohammad
Mahmoud Al-Qutayri
18
4
0
28 Apr 2021
Content-Aware GAN Compression
Yuchen Liu
Zhixin Shu
Yijun Li
Zhe-nan Lin
Federico Perazzi
S. Kung
GAN
35
58
0
06 Apr 2021
AttentionLite: Towards Efficient Self-Attention Models for Vision
Souvik Kundu
Sairam Sundaresan
16
22
0
21 Dec 2020
Bringing AI To Edge: From Deep Learning's Perspective
Di Liu
Hao Kong
Xiangzhong Luo
Weichen Liu
Ravi Subramaniam
52
116
0
25 Nov 2020
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
29
69
0
02 Sep 2020
HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs
H. Habi
Roy H. Jennings
Arnon Netzer
MQ
21
65
0
20 Jul 2020
Computation on Sparse Neural Networks: an Inspiration for Future Hardware
Fei Sun
Minghai Qin
Tianyun Zhang
Liu Liu
Yen-kuang Chen
Yuan Xie
29
7
0
24 Apr 2020
Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Sparsification
Huanrui Yang
Minxue Tang
W. Wen
Feng Yan
Daniel Hu
Ang Li
H. Li
Yiran Chen
31
63
0
20 Apr 2020
Filter Sketch for Network Pruning
Mingbao Lin
Liujuan Cao
Shaojie Li
QiXiang Ye
Yonghong Tian
Jianzhuang Liu
Q. Tian
Rongrong Ji
CLIP
3DPC
25
82
0
23 Jan 2020
BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method
Xiaolong Ma
ZeLin Li
Yifan Gong
Tianyun Zhang
Wei Niu
...
Pu Zhao
Jian Tang
X. Lin
Bin Ren
Yanzhi Wang
12
14
0
23 Jan 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu
Xiaolong Ma
Sheng Lin
Shihao Wang
Xuehai Qian
X. Lin
Yanzhi Wang
Bin Ren
MQ
11
226
0
01 Jan 2020
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
30
53
0
18 Dec 2019
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices
Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
CVBM
22
173
0
06 Sep 2019
1
2
Next