Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.09655
Cited By
DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
14 May 2025
Xiwen Chen
Wenhui Zhu
Peijie Qiu
Xuanzhao Dong
Hao Wang
Haiyu Wu
Huayu Li
Aristeidis Sotiras
Y. Wang
Abolfazl Razi
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models"
Title
No papers