Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.07304
Cited By
Inference Performance Optimization for Large Language Models on CPUs
10 July 2024
Pujiang He
Shan Zhou
Wenhuan Huang
Changqing Li
Duyi Wang
Bin Guo
Chen Meng
Sheng Gui
Weifei Yu
Yi Xie
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Inference Performance Optimization for Large Language Models on CPUs"
1 / 1 papers shown
Title
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
57
15
0
06 Oct 2024
1