ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.08115
  4. Cited By
Resource Allocation and Workload Scheduling for Large-Scale Distributed
  Deep Learning: A Survey

Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey

12 June 2024
Feng Liang
Zhen Zhang
Haifeng Lu
Chengming Li
Victor C. M. Leung
Yanyi Guo
Xiping Hu
ArXivPDFHTML

Papers citing "Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey"

3 / 3 papers shown
Title
Secure Resource Allocation via Constrained Deep Reinforcement Learning
Secure Resource Allocation via Constrained Deep Reinforcement Learning
Jianfei Sun
Qiang Gao
Cong Wu
Yuxian Li
Jiacheng Wang
Dusit Niyato
52
0
0
20 Jan 2025
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep
  Learning Clusters
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
21
11
0
24 Mar 2023
Chimera: Efficiently Training Large-Scale Neural Networks with
  Bidirectional Pipelines
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines
Shigang Li
Torsten Hoefler
GNN
AI4CE
LRM
77
94
0
14 Jul 2021
1