Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.00138
Cited By
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
1 January 2020
Wei Niu
Xiaolong Ma
Sheng Lin
Shihao Wang
Xuehai Qian
X. Lin
Yanzhi Wang
Bin Ren
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning"
50 / 82 papers shown
Title
Dynamic Gradient Sparse Update for Edge Training
I-Hsuan Li
Tian-Sheuan Chang
66
1
0
23 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
52
1
0
08 Mar 2025
Low-Rank Compression for IMC Arrays
Kang Eun Jeon
Johnny Rhe
J. Ko
40
0
0
10 Feb 2025
UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles
Abhishek Balasubramaniam
Febin P. Sunny
S. Pasricha
3DPC
41
0
0
08 Jan 2025
AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning
Lixian Jing
Jianpeng Qi
Junyu Dong
Yanwei Yu
3DPC
AI4CE
44
0
0
24 Dec 2024
BlabberSeg: Real-Time Embedded Open-Vocabulary Aerial Segmentation
Haechan Mark Bong
Ricardo de Azambuja
Giovanni Beltrame
VLM
33
0
0
16 Oct 2024
AdapMTL: Adaptive Pruning Framework for Multitask Learning Model
Mingcan Xiang
Steven Jiaxun Tang
Qizheng Yang
Hui Guan
Tongping Liu
VLM
39
0
0
07 Aug 2024
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices
Hayun Lee
Dongkun Shin
MQ
28
0
0
29 Jul 2024
AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge
Chao Wu
Yifan Gong
Liangkai Liu
Mengquan Li
Yushu Wu
Xuan Shen
Zhimin Li
Geng Yuan
Weisong Shi
Yanzhi Wang
23
1
0
25 Jul 2024
SoD
2
^2
2
: Statically Optimizing Dynamic Deep Neural Network
Wei Niu
Gagan Agrawal
Bin Ren
33
4
0
29 Feb 2024
REPrune: Channel Pruning via Kernel Representative Selection
Mincheol Park
Dongjin Kim
Cheonjun Park
Yuna Park
Gyeong Eun Gong
Won Woo Ro
Suhyun Kim
VLM
46
1
0
27 Feb 2024
Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
Zhenyu Liu
Garrett Gagnon
Swagath Venkataramani
Liu Liu
AAML
28
0
0
06 Feb 2024
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing
Sheng Li
Geng Yuan
Yuezhen Dai
Youtao Zhang
Yanzhi Wang
Xulong Tang
31
18
0
30 Jan 2024
DTMM: Deploying TinyML Models on Extremely Weak IoT Devices with Pruning
Lixiang Han
Zhen Xiao
Zhenjiang Li
41
5
0
17 Jan 2024
Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization
K. Balaskas
Andreas Karatzas
Christos Sad
K. Siozios
Iraklis Anagnostopoulos
Georgios Zervakis
Jörg Henkel
MQ
33
10
0
23 Dec 2023
Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI
Kai Huang
Wei Gao
15
35
0
21 Dec 2023
EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Bufang Yang
Lixing He
Neiwen Ling
Zhenyu Yan
Guoliang Xing
Xian Shuai
Xiaozhe Ren
Xin Jiang
43
20
0
18 Nov 2023
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group Sparsity
Haitao Xu
Songwei Liu
Yuyang Xu
Shuai Wang
Jiashi Li
Chenqian Yan
Liangqiang Li
Lean Fu
Xin Pan
Fangmin Chen
MQ
17
0
0
30 Oct 2023
Edge-InversionNet: Enabling Efficient Inference of InversionNet on Edge Devices
Zhepeng Wang
Isaacshubhanand Putla
Weiwen Jiang
Youzuo Lin
24
2
0
14 Oct 2023
Enabling Resource-efficient AIoT System with Cross-level Optimization: A survey
Sicong Liu
Bin Guo
Cheng Fang
Ziqi Wang
Shiyan Luo
Zimu Zhou
Zhiwen Yu
AI4CE
34
22
0
27 Sep 2023
Efficient N:M Sparse DNN Training Using Algorithm, Architecture, and Dataflow Co-Design
Chao Fang
Wei Sun
Aojun Zhou
Zhongfeng Wang
11
10
0
22 Sep 2023
Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Fei Dou
Jin Ye
Geng Yuan
Qin Lu
Wei Niu
...
Hongyue Sun
Yunli Shao
Changying Li
Tianming Liu
Wenzhan Song
AI4CE
37
29
0
14 Sep 2023
LLMCad: Fast and Scalable On-device Large Language Model Inference
Daliang Xu
Wangsong Yin
Xin Jin
Yuhang Zhang
Shiyun Wei
Mengwei Xu
Xuanzhe Liu
17
43
0
08 Sep 2023
FPGA Resource-aware Structured Pruning for Real-Time Neural Networks
Benjamin Ramhorst
Vladimir Loncar
G. Constantinides
25
4
0
09 Aug 2023
Towards Machine Learning and Inference for Resource-constrained MCUs
Yu-Shan Huang
Hamed Haddadi
24
1
0
30 May 2023
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study
Muzhou Yu
Linfeng Zhang
Kaisheng Ma
23
2
0
22 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
29
23
0
22 May 2023
Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural Network Pruning
Shangli Zhou
Mikhail A. Bragin
Lynn Pepin
Deniz Gurevin
Fei Miao
Caiwen Ding
16
3
0
08 Apr 2023
Mobiprox: Supporting Dynamic Approximate Computing on Mobiles
Matevz Fabjancic
O. Machidon
Hashim Sharif
Yifan Zhao
Sasa Misailovic
V. Pejović
24
2
0
16 Mar 2023
R-TOSS: A Framework for Real-Time Object Detection using Semi-Structured Pruning
Abhishek Balasubramaniam
Febin P. Sunny
S. Pasricha
VLM
38
12
0
03 Mar 2023
When Layers Play the Lottery, all Tickets Win at Initialization
Artur Jordão
George Correa de Araujo
H. Maia
Hélio Pedrini
13
3
0
25 Jan 2023
SGCN: Exploiting Compressed-Sparse Features in Deep Graph Convolutional Network Accelerators
Mingi Yoo
Jaeyong Song
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
35
17
0
25 Jan 2023
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network Accelerators
Min-hee Yoo
Jaeyong Song
Hyeyoon Lee
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
36
5
0
24 Jan 2023
Reaching the Edge of the Edge: Image Analysis in Space
R. Bayer
Julian Priest
Pınar Tözün
27
5
0
12 Jan 2023
All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management
Yifan Gong
Zheng Zhan
Pu Zhao
Yushu Wu
Chaoan Wu
Caiwen Ding
Weiwen Jiang
Minghai Qin
Yanzhi Wang
23
7
0
09 Dec 2022
Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge Devices
Yimeng Zhang
A. Kamath
Qiucheng Wu
Zhiwen Fan
Wuyang Chen
Zhangyang Wang
Shiyu Chang
Sijia Liu
Cong Hao
21
6
0
16 Oct 2022
Advancing Model Pruning via Bi-level Optimization
Yihua Zhang
Yuguang Yao
Parikshit Ram
Pu Zhao
Tianlong Chen
Min-Fong Hong
Yanzhi Wang
Sijia Liu
49
68
0
08 Oct 2022
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training
Geng Yuan
Yanyu Li
Sheng Li
Zhenglun Kong
Sergey Tulyakov
Xulong Tang
Yanzhi Wang
Jian Ren
33
15
0
22 Sep 2022
SparCL: Sparse Continual Learning on the Edge
Zifeng Wang
Zheng Zhan
Yifan Gong
Geng Yuan
Wei Niu
T. Jian
Bin Ren
Stratis Ioannidis
Yanzhi Wang
Jennifer Dy
CLL
60
58
0
20 Sep 2022
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
Yushu Wu
Yifan Gong
Pu Zhao
Yanyu Li
Zheng Zhan
Wei Niu
Hao Tang
Minghai Qin
Bin Ren
Yanzhi Wang
SupR
MQ
32
23
0
25 Jul 2022
EVE: Environmental Adaptive Neural Network Models for Low-power Energy Harvesting System
Sahidul Islam
Shangli Zhou
Ran Ran
Yufang Jin
Wu-Shao Wen
Caiwen Ding
Mimi Xie
26
9
0
14 Jul 2022
Sparse Periodic Systolic Dataflow for Lowering Latency and Power Dissipation of Convolutional Neural Network Accelerators
J. Heo
A. Fayyazi
Amirhossein Esmaili
Massoud Pedram
11
3
0
30 Jun 2022
Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding
Connor Holmes
Minjia Zhang
Yuxiong He
Bo Wu
17
3
0
30 Jun 2022
CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing Framework
Xiaofeng Li
Bin Ren
Xipeng Shen
Yanzhi Wang
GNN
20
0
0
21 Jun 2022
Boosting DNN Cold Inference on Edge Devices
Rongjie Yi
Ting Cao
Ao Zhou
Xiao Ma
Shangguang Wang
Mengwei Xu
74
6
0
15 Jun 2022
Slim-neck by GSConv: A lightweight-design for real-time detector architectures
Hulin Li
Jun Li
Hanbing Wei
Zheng Liu
Zhenfei Zhan
Qiliang Ren
18
151
0
06 Jun 2022
Compilation and Optimizations for Efficient Machine Learning on Embedded Systems
Xiaofan Zhang
Yao Chen
Cong Hao
Sitao Huang
Yuhong Li
Deming Chen
39
1
0
06 Jun 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Yujun Lin
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
24
107
0
25 Apr 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
33
11
0
06 Apr 2022
Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets
Tianlong Chen
Xuxi Chen
Xiaolong Ma
Yanzhi Wang
Zhangyang Wang
16
34
0
09 Feb 2022
1
2
Next