Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.00029
Cited By
Distributed Inference Performance Optimization for LLMs on CPUs
16 May 2024
Pujiang He
Shan Zhou
Changqing Li
Wenhuan Huang
Weifei Yu
Duyi Wang
Chen Meng
Sheng Gui
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Distributed Inference Performance Optimization for LLMs on CPUs"
1 / 1 papers shown
Title
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
54
13
0
06 Oct 2024
1