
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement
Yongji Wu
Wenjie Qu
Tianyang Tao
Zhuang Wang
Wei Bai
Zhuohao Li
Yuan Tian
Jiaheng Zhang
Matthew Lentz
Danyang Zhuo
Papers citing "Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement"
3 / 3 papers shown
Title |
---|