Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.05391
Cited By
Efficient LLM inference solution on Intel GPU
19 December 2023
Hui Wu
Yi Gan
Feng Yuan
Jing Ma
Wei Zhu
Yutao Xu
Hong Zhu
Yuhua Zhu
Xiaoli Liu
Jinghui Gu
Peng Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient LLM inference solution on Intel GPU"
1 / 1 papers shown
Title
Taming the Titans: A Survey of Efficient LLM Inference Serving
Ranran Zhen
J. Li
Yixin Ji
Z. Yang
Tong Liu
Qingrong Xia
Xinyu Duan
Z. Wang
Baoxing Huai
M. Zhang
LLMAG
77
0
0
28 Apr 2025
1