ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.07304
  4. Cited By
Inference Performance Optimization for Large Language Models on CPUs

Inference Performance Optimization for Large Language Models on CPUs

10 July 2024
Pujiang He
Shan Zhou
Wenhuan Huang
Changqing Li
Duyi Wang
Bin Guo
Chen Meng
Sheng Gui
Weifei Yu
Yi Xie
ArXivPDFHTML

Papers citing "Inference Performance Optimization for Large Language Models on CPUs"

1 / 1 papers shown
Title
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
57
15
0
06 Oct 2024
1