Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.08586
Cited By
Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling
16 August 2024
Xinyi Zhang
Hanyu Zhao
Wencong Xiao
Xianyan Jia
Fei Xu
Yong Li
Wei Lin
Fangming Liu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling"
1 / 1 papers shown
Title
Learning in Chaos: Efficient Autoscaling and Self-healing for Distributed Training at the Edge
Wenjiao Feng
Rongxing Xiao
Zonghang Li
Hongfang Yu
Gang Sun
Long Luo
Mohsen Guizani
Qirong Ho
56
0
0
19 May 2025
1