Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.07585
Cited By
Communication Optimization for Distributed Training: Architecture, Advances, and Opportunities
12 March 2024
Yunze Wei
Tianshuo Hu
Cong Liang
Yong Cui
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Communication Optimization for Distributed Training: Architecture, Advances, and Opportunities"
1 / 1 papers shown
Title
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
243
1,815
0
17 Sep 2019
1