Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2507.09846
Cited By
v1
v2
v3
v4 (latest)
Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training
14 July 2025
Minhak Song
Beomhan Baek
Kwangjun Ahn
Chulhee Yun
CLL
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (110★)
Papers citing
"Through the River: Understanding the Benefit of Schedule-Free Methods for Language Model Training"
2 / 2 papers shown
Scaling with Collapse: Efficient and Predictable Training of LLM Families
Shane Bergsma
Bin Claire Zhang
Nolan Dey
Shaheer Muhammad
Gurpreet Gosal
Joel Hestness
146
2
0
29 Sep 2025
WSM: Decay-Free Learning Rate Schedule via Checkpoint Merging for LLM Pre-training
Changxin Tian
Jiapeng Wang
Qian Zhao
Kunlong Chen
Jia-Ling Liu
Ziqi Liu
Jiaxin Mao
Wayne Xin Zhao
Zhiqiang Zhang
Jun Zhou
MoMe
CLL
264
7
0
23 Jul 2025
1
Page 1 of 1