ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
  • Feedback
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2508.07534
  4. Cited By
From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR
v1v2 (latest)

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

11 August 2025
Jia Deng
Jie Chen
Zhipeng Chen
Daixuan Cheng
Fei Bai
Beichen Zhang
Yinqian Min
Y. Gao
Wayne Xin Zhao
Ji-Rong Wen
    LRM
ArXiv (abs)PDFHTMLGithub (727★)

Papers citing "From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR"

Title

No papers found