Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.11520
Cited By
AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
17 February 2025
Xiaoyu Tan
Tianchu Yao
C. Qu
Bin Li
Minghao Yang
Dakuan Lu
Haozhe Wang
Xihe Qiu
Wei Chu
Yinghui Xu
Yuan Qi
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification"
1 / 1 papers shown
Title
Efficient Process Reward Model Training via Active Learning
Keyu Duan
Zichen Liu
Xin Mao
Tianyu Pang
Changyu Chen
Qiguang Chen
Michael Shieh
Longxu Dou
LRM
20
1
0
14 Apr 2025
1