ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.10065
  4. Cited By
Efficient Strong Scaling Through Burst Parallel Training

Efficient Strong Scaling Through Burst Parallel Training

19 December 2021
S. Park
Joshua Fried
Sunghyun Kim
Mohammad Alizadeh
Adam Belay
    GNN
    LRM
ArXivPDFHTML

Papers citing "Efficient Strong Scaling Through Burst Parallel Training"

2 / 2 papers shown
Title
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep
  Learning Clusters
MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters
Yihao Zhao
Xin Liu
Shufan Liu
Xiang Li
Yibo Zhu
Gang Huang
Xuanzhe Liu
Xin Jin
27
11
0
24 Mar 2023
Megatron-LM: Training Multi-Billion Parameter Language Models Using
  Model Parallelism
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,817
0
17 Sep 2019
1