Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.15929
Cited By
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity
24 October 2023
Yun Li
Lin Niu
Xipeng Zhang
Kai Liu
Jianchen Zhu
Zhanhui Kang
MoE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity"
5 / 5 papers shown
Title
Efficient Reasoning Models: A Survey
Sicheng Feng
Gongfan Fang
Xinyin Ma
Xinchao Wang
ReLM
LRM
107
0
0
15 Apr 2025
Symmetric Pruning of Large Language Models
Kai Yi
Peter Richtárik
AAML
VLM
57
0
0
31 Jan 2025
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
57
15
0
06 Oct 2024
Boosting Mobile CNN Inference through Semantic Memory
Yun Li
Chen Zhang
S. Han
Li Lyna Zhang
B. Yin
Yunxin Liu
Mengwei Xu
ObjD
39
16
0
05 Dec 2021
What is the State of Neural Network Pruning?
Davis W. Blalock
Jose Javier Gonzalez Ortiz
Jonathan Frankle
John Guttag
178
1,027
0
06 Mar 2020
1