ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.14257
  4. Cited By
From Correction to Mastery: Reinforced Distillation of Large Language Model Agents
v1v2 (latest)

From Correction to Mastery: Reinforced Distillation of Large Language Model Agents

12 September 2025
Yuanjie Lyu
Chengyu Wang
Jun Huang
Tong Xu
    ALMLRM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (163★)

Papers citing "From Correction to Mastery: Reinforced Distillation of Large Language Model Agents"

1 / 1 papers shown
MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards
MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards
Changsu Choi
Hoyun Song
Dongyeon Kim
WooHyeon Jung
Minkyung Cho
Sunjin Park
NohHyeob Bae
Seona Yu
Kyungtae Lim
162
0
0
21 Oct 2025
1