ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.22200
  4. Cited By
EFRame: Deeper Reasoning via Exploration-Filter-Replay Reinforcement Learning Framework
v1v2v3v4v5 (latest)

EFRame: Deeper Reasoning via Exploration-Filter-Replay Reinforcement Learning Framework

27 June 2025
Chen Wang
Lai Wei
Yanzhi Zhang
Chenyang Shao
Zedong Dan
Weiran Huang
Yuzhi Zhang
Yue Wang
    LRMOffRL
ArXiv (abs)PDFHTMLGithub (738★)

Papers citing "EFRame: Deeper Reasoning via Exploration-Filter-Replay Reinforcement Learning Framework"

1 / 1 papers shown
Arbitrary Entropy Policy Optimization Breaks The Exploration Bottleneck of Reinforcement Learning
Arbitrary Entropy Policy Optimization Breaks The Exploration Bottleneck of Reinforcement Learning
Chen Wang
Ruoyao Xiao
Jionghao Bai
Yuzhi Zhang
Shisheng Cui
Zhou Zhao
Yue Wang
380
0
0
09 Oct 2025
1
Page 1 of 1