ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.13786
  4. Cited By
The Art of Scaling Reinforcement Learning Compute for LLMs

The Art of Scaling Reinforcement Learning Compute for LLMs

15 October 2025
Devvrit Khatri
Lovish Madaan
Rishabh Tiwari
Rachit Bansal
Sai Surya Duvvuri
Manzil Zaheer
Inderjit Dhillon
David Brandfonbrener
Rishabh Agarwal
    OffRL
ArXiv (abs)PDFHTMLHuggingFace (27 upvotes)Github (967★)

Papers citing "The Art of Scaling Reinforcement Learning Compute for LLMs"

1 / 1 papers shown
Title
Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients
Advantage Shaping as Surrogate Reward Maximization: Unifying Pass@K Policy Gradients
Christos Thrampoulidis
Sadegh Mahdavi
Wenlong Deng
OffRL
32
0
0
27 Oct 2025
1