Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.23808
Cited By
v1
v2
v3 (latest)
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR
28 September 2025
Fanding Huang
Guanbo Huang
Xiao Fan
Yi He
Xiao Liang
Xiao Chen
Qinting Jiang
Faisal Nadeem Khan
Jingyan Jiang
Zhi Wang
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (46 upvotes)
Github (26★)
Papers citing
"Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR"
0 / 0 papers shown
No papers found