Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.13803
Cited By
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
24 March 2023
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters"
5 / 5 papers shown
Title
Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey
Feng Liang
Zhen Zhang
Haifeng Lu
Chengming Li
Victor C. M. Leung
Yanyi Guo
Xiping Hu
38
3
0
12 Jun 2024
MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving
Jiangfei Duan
Runyu Lu
Haojie Duanmu
Xiuhong Li
Xingcheng Zhang
Dahua Lin
Ion Stoica
Hao Zhang
35
1
0
02 Apr 2024
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning
Xupeng Miao
Gabriele Oliaro
Xinhao Cheng
Vineeth Kada
Ruohan Gao
...
April Yang
Yingcheng Wang
Mengdi Wu
Colin Unger
Zhihao Jia
MoE
88
8
0
29 Feb 2024
PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices
Minghao Yan
Hongyi Wang
Shivaram Venkataraman
6
0
0
30 Oct 2023
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
244
35,884
0
25 Aug 2016
1