Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.06856
Cited By
Aladdin: Joint Placement and Scaling for SLO-Aware LLM Serving
11 May 2024
Chengyi Nie
Rodrigo Fonseca
Zhenhua Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Aladdin: Joint Placement and Scaling for SLO-Aware LLM Serving"
1 / 1 papers shown
Title
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction
Haoran Qiu
Weichao Mao
Archit Patke
Shengkun Cui
Saurabh Jha
Chen Wang
Hubertus Franke
Zbigniew T. Kalbarczyk
Tamer Basar
Ravishankar K. Iyer
14
23
0
12 Apr 2024
1