ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.02425
  4. Cited By
LLM-Pilot: Characterize and Optimize Performance of your LLM Inference
  Services

LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services

International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024
3 October 2024
Małgorzata Łazuka
Andreea Anghel
Thomas Parnell
ArXiv (abs)PDFHTML

Papers citing "LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services"

8 / 8 papers shown
Reasoning Language Model Inference Serving Unveiled: An Empirical Study
Reasoning Language Model Inference Serving Unveiled: An Empirical Study
Qi Li
Junpan Wu
Xiang Liu
Yuxin Wang
Z. Li
Zhenheng Tang
Yuhan Chen
Shaohuai Shi
Xiaowen Chu
ReLMLRM
256
1
0
21 Oct 2025
Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi
Yi Ding
MQ
130
3
0
22 Aug 2025
The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware)
The Hitchhikers Guide to Production-ready Trustworthy Foundation Model powered Software (FMware)
Kirill Vasilevski
Benjamin Rombaut
Gopi Krishnan Rajbahadur
G. Oliva
Keheliya Gallaba
...
Haoxiang Zhang
Bouyan Chen
Kishanthan Thangarajah
Ahmed E. Hassan
Zhen Ming
325
0
0
15 May 2025
Unveiling the Landscape of LLM Deployment in the Wild: An Empirical Study
Unveiling the Landscape of LLM Deployment in the Wild: An Empirical Study
Xinyi Hou
Jiahao Han
Yanjie Zhao
Haoyu Wang
273
5
0
05 May 2025
Taming the Titans: A Survey of Efficient LLM Inference Serving
Taming the Titans: A Survey of Efficient LLM Inference Serving
Ranran Zhen
Junlin Li
Yixin Ji
Zhiyong Yang
Tong Liu
Qingrong Xia
Xinyu Duan
Zehao Wang
Baoxing Huai
Hao Fei
LLMAG
413
7
0
28 Apr 2025
ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction
ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and PredictionThe Web Conference (WWW), 2025
Yiqiao Jin
Stefano Petrangeli
Yu Shen
Gang Wu
LLMAGLM&Ro
918
2
0
26 Mar 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
312
3
0
28 Jan 2025
Software Performance Engineering for Foundation Model-Powered Software
  (FMware)
Software Performance Engineering for Foundation Model-Powered Software (FMware)
Haoxiang Zhang
Shi Chang
Arthur Leung
Kishanthan Thangarajah
Boyuan Chen
Hanan Lutfiyya
Ahmed E. Hassan
572
3
0
14 Nov 2024
1