ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.00704
  4. Cited By
Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical
  Scaling
v1v2 (latest)

Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling

31 March 2024
Kamran Razavi
Saeid Ghafouri
Max Mühlhäuser
Pooyan Jamshidi
Lin Wang
ArXiv (abs)PDFHTML

Papers citing "Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling"

1 / 1 papers shown
Title
A Study of Skews, Imbalances, and Pathological Conditions in LLM Inference Deployment on GPU Clusters detectable from DPU
A Study of Skews, Imbalances, and Pathological Conditions in LLM Inference Deployment on GPU Clusters detectable from DPU
Javed I. Khan an Henry Uwabor Moye
60
0
0
09 Sep 2025
1