Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2602.01058
Cited By
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
1 February 2026
Dylan Zhang
Yufeng Xu
Haojin Wang
Qingzhi Chen
Hao Peng
OffRL
ReLM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning"
0 / 0 papers shown
No papers found
Page 1 of 0