ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.06856
  4. Cited By
Aladdin: Joint Placement and Scaling for SLO-Aware LLM Serving

Aladdin: Joint Placement and Scaling for SLO-Aware LLM Serving

11 May 2024
Chengyi Nie
Rodrigo Fonseca
Zhenhua Liu
ArXivPDFHTML

Papers citing "Aladdin: Joint Placement and Scaling for SLO-Aware LLM Serving"

1 / 1 papers shown
Title
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length
  Prediction
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction
Haoran Qiu
Weichao Mao
Archit Patke
Shengkun Cui
Saurabh Jha
Chen Wang
Hubertus Franke
Zbigniew T. Kalbarczyk
Tamer Basar
Ravishankar K. Iyer
14
23
0
12 Apr 2024
1