Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2507.05386
Cited By
v1
v2
v3
v4
v5 (latest)
Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training
7 July 2025
Song Lai
Haohan Zhao
Rong Feng
Changyi Ma
Wenzhuo Liu
Hongbo Zhao
Xi Lin
Dong Yi
Min Xie
Gang Qu
Hongbin Liu
Gaofeng Meng
CLL
KELM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1★)
Papers citing
"Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training"
9 / 9 papers shown
Learning to Refuse: Refusal-Aware Reinforcement Fine-Tuning for Hard-Irrelevant Queries in Video Temporal Grounding
Jin-Seop Lee
SungJoon Lee
SeongJun Jung
Boyang Li
Jee-Hyong Lee
OOD
179
0
0
28 Nov 2025
Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting
Howard Chen
Noam Razin
Karthik Narasimhan
Danqi Chen
CLL
KELM
398
12
0
21 Oct 2025
Continual Learning via Sparse Memory Finetuning
Jessy Lin
Luke Zettlemoyer
Gargi Ghosh
Wen-tau Yih
Aram H. Markosyan
Vincent-Pierre Berges
Barlas Oğuz
KELM
CLL
155
0
0
16 Oct 2025
Deterministic algorithms for inhomogeneous Bernoulli trials: Shapley value of network devices
Jesse D Wei
Guo Wei
FAtt
227
0
0
08 Oct 2025
Beyond English-Centric Training: How Reinforcement Learning Improves Cross-Lingual Reasoning in LLMs
Shulin Huang
Yiran Ding
Junshu Pan
Yue Zhang
OffRL
LRM
130
2
0
28 Sep 2025
RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
Kohsei Matsutani
Shota Takashiro
Gouki Minegishi
Takeshi Kojima
Yusuke Iwasawa
Yutaka Matsuo
OffRL
ReLM
LRM
208
6
0
25 Sep 2025
Reinforcement Learning on Pre-Training Data
Siheng Li
Kejiao Li
Zenan Xu
Guanhua Huang
Evander Yang
...
Jianchen Zhu
W. Lam
Wayyt Wang
Bo Zhou
Di Wang
OffRL
LRM
184
4
0
23 Sep 2025
RL's Razor: Why Online Reinforcement Learning Forgets Less
Idan Shenfeld
Jyothish Pari
Pulkit Agrawal
CLL
194
43
0
04 Sep 2025
EFRame: Deeper Reasoning via Exploration-Filter-Replay Reinforcement Learning Framework
Chen Wang
Lai Wei
Yanzhi Zhang
Chenyang Shao
Zedong Dan
Weiran Huang
Yuzhi Zhang
Yue Wang
LRM
OffRL
382
2
0
27 Jun 2025
1
Page 1 of 1