Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.21124
Cited By
Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
30 December 2024
Tim Tsz-Kit Lau
Weijian Li
Chenwei Xu
Han Liu
Mladen Kolar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism"
Title
No papers