Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.00704
Cited By
v1
v2 (latest)
Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling
31 March 2024
Kamran Razavi
Saeid Ghafouri
Max Mühlhäuser
Pooyan Jamshidi
Lin Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling"
1 / 1 papers shown
Title
A Study of Skews, Imbalances, and Pathological Conditions in LLM Inference Deployment on GPU Clusters detectable from DPU
Javed I. Khan an Henry Uwabor Moye
60
0
0
09 Sep 2025
1