ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.12235
  4. Cited By
RL Fine-Tuning Heals OOD Forgetting in SFT

RL Fine-Tuning Heals OOD Forgetting in SFT

8 September 2025
Hangzhan Jin
Sitao Luan
Sicheng Lyu
Guillaume Rabusseau
Reihaneh Rabbany
Doina Precup
Mohammad Hamdaqa
    CLLLRM
ArXiv (abs)PDFHTMLGithub (6★)

Papers citing "RL Fine-Tuning Heals OOD Forgetting in SFT"

2 / 2 papers shown
Title
Debunk the Myth of SFT Generalization
Debunk the Myth of SFT Generalization
Xiaofeng Lin
Hejian Sang
Zhipeng Wang
Xuezhou Zhang
OffRLLRM
21
0
0
30 Sep 2025
How LLMs Learn to Reason: A Complex Network Perspective
How LLMs Learn to Reason: A Complex Network Perspective
Sihan Hu
X-D Cai
Yuan Huang
Zhiyuan Yao
Linfeng Zhang
Pan Zhang
Youjin Deng
Kun Chen
LRM
77
0
0
28 Sep 2025
1