ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.23870
  4. Cited By
Rethinking Reward Miscalibration of GRPO in Agentic RL
v1v2 (latest)

Rethinking Reward Miscalibration of GRPO in Agentic RL

28 September 2025
Jingyu Liu
xiaopeng Wu
Jingquan Peng
Kehan Chen
Chuan Yu
Lizhong Ding
Yong Liu
ArXiv (abs)PDFHTML

Papers citing "Rethinking Reward Miscalibration of GRPO in Agentic RL"

0 / 0 papers shown
Title

No papers found