Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.04021
Cited By
Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving
6 May 2025
Shan Yu
Jiarong Xing
Yifan Qiao
Mingyuan Ma
Y. Li
Yang Wang
Shuo Yang
Zhiqiang Xie
Shiyi Cao
Ke Bao
Ion Stoica
Harry Xu
Ying Sheng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving"
Title
No papers