Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.08115
Cited By
Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey
12 June 2024
Feng Liang
Zhen Zhang
Haifeng Lu
Chengming Li
Victor C. M. Leung
Yanyi Guo
Xiping Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey"
3 / 3 papers shown
Title
Secure Resource Allocation via Constrained Deep Reinforcement Learning
Jianfei Sun
Qiang Gao
Cong Wu
Yuxian Li
Jiacheng Wang
Dusit Niyato
52
0
0
20 Jan 2025
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
21
11
0
24 Mar 2023
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines
Shigang Li
Torsten Hoefler
GNN
AI4CE
LRM
77
94
0
14 Jul 2021
1