ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.00029
  4. Cited By
Distributed Inference Performance Optimization for LLMs on CPUs

Distributed Inference Performance Optimization for LLMs on CPUs

16 May 2024
Pujiang He
Shan Zhou
Changqing Li
Wenhuan Huang
Weifei Yu
Duyi Wang
Chen Meng
Sheng Gui
ArXivPDFHTML

Papers citing "Distributed Inference Performance Optimization for LLMs on CPUs"

1 / 1 papers shown
Title
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
54
13
0
06 Oct 2024
1