Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2509.14257
Cited By

From Correction to Mastery: Reinforced Distillation of Large Language Model Agents

v1v2 (latest)

From Correction to Mastery: Reinforced Distillation of Large Language Model Agents

12 September 2025

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (163★)

Papers citing "From Correction to Mastery: Reinforced Distillation of Large Language Model Agents"

1 / 1 papers shown

MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards

MENTOR: A Reinforcement Learning Framework for Enabling Tool Use in Small Models via Teacher-Optimized Rewards

162

0

0

21 Oct 2025