ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.11520
  4. Cited By
AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

17 February 2025
Xiaoyu Tan
Tianchu Yao
C. Qu
Bin Li
Minghao Yang
Dakuan Lu
Haozhe Wang
Xihe Qiu
Wei Chu
Yinghui Xu
Yuan Qi
    OffRL
    LRM
ArXivPDFHTML

Papers citing "AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification"

1 / 1 papers shown
Title
Efficient Process Reward Model Training via Active Learning
Efficient Process Reward Model Training via Active Learning
Keyu Duan
Zichen Liu
Xin Mao
Tianyu Pang
Changyu Chen
Qiguang Chen
Michael Shieh
Longxu Dou
LRM
20
1
0
14 Apr 2025
1