ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.21124
  4. Cited By
Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism

Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism

30 December 2024
Tim Tsz-Kit Lau
Weijian Li
Chenwei Xu
Han Liu
Mladen Kolar
ArXivPDFHTML

Papers citing "Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism"

Title
No papers