Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.14257
Cited By
v1
v2 (latest)
From Correction to Mastery: Reinforced Distillation of Large Language Model Agents
12 September 2025
Yuanjie Lyu
Chengyu Wang
Jun Huang
Tong Xu
ALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (163★)
Papers citing
"From Correction to Mastery: Reinforced Distillation of Large Language Model Agents"
1 / 1 papers shown
MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards
Changsu Choi
Hoyun Song
Dongyeon Kim
WooHyeon Jung
Minkyung Cho
Sunjin Park
NohHyeob Bae
Seona Yu
Kyungtae Lim
162
0
0
21 Oct 2025
1