Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2506.22200
Cited By
v1
v2
v3
v4
v5 (latest)
EFRame: Deeper Reasoning via Exploration-Filter-Replay Reinforcement Learning Framework
27 June 2025
Chen Wang
Lai Wei
Yanzhi Zhang
Chenyang Shao
Zedong Dan
Weiran Huang
Yuzhi Zhang
Yue Wang
LRM
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (738★)
Papers citing
"EFRame: Deeper Reasoning via Exploration-Filter-Replay Reinforcement Learning Framework"
1 / 1 papers shown
Arbitrary Entropy Policy Optimization Breaks The Exploration Bottleneck of Reinforcement Learning
Chen Wang
Ruoyao Xiao
Jionghao Bai
Yuzhi Zhang
Shisheng Cui
Zhou Zhao
Yue Wang
380
0
0
09 Oct 2025
1
Page 1 of 1