ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.02142
  4. Cited By
Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study

Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study

4 May 2025
Xiaoyu Tian
Sitong Zhao
Haotian Wang
Shuaiting Chen
Yiping Peng
Yunjie Ji
Han Zhao
Xiangang Li
    OffRL
    LRM
ArXivPDFHTML

Papers citing "Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study"

Title
No papers