ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.14363
  4. Cited By
Improving RL Exploration for LLM Reasoning through Retrospective Replay

Improving RL Exploration for LLM Reasoning through Retrospective Replay

19 April 2025
Shihan Dou
Muling Wu
Jingwen Xu
Rui Zheng
Tao Gui
Qi Zhang
Xuanjing Huang
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Improving RL Exploration for LLM Reasoning through Retrospective Replay"

Title
No papers