ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.03047
  4. Cited By
FlashRecovery: Fast and Low-Cost Recovery from Failures for Large-Scale Training of LLMs

FlashRecovery: Fast and Low-Cost Recovery from Failures for Large-Scale Training of LLMs

3 September 2025
H. Zhang
Jinxiang Wang
Zhenhua Yu
Y. Zhang
Xuejie Ji
Kaining Mao
Jun Zhang
Y. Zhang
Ting Wu
F. Jie
X. Y. Huang
Zhifang Cai
Junhua Cheng
S. Wang
W. Li
Xiaoming Bao
H. Xu
Shixiong Zhao
Jun Yu Li
Hongwei Sun
Z. Zhang
Yi Xiong
Chunsheng Li
    VLM
ArXiv (abs)PDFHTML

Papers citing "FlashRecovery: Fast and Low-Cost Recovery from Failures for Large-Scale Training of LLMs"

1 / 1 papers shown
xLLM Technical Report
xLLM Technical Report
T. Liu
Tao Peng
Peijun Yang
X. Zhao
Xiusheng Lu
...
Tong Yang
Hailong Yang
Jing-Jing Li
Guiguang Ding
Ke Zhang
156
2
0
16 Oct 2025
1