Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2508.00410
Cited By
v1
v2 (latest)
Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models
1 August 2025
Zizhuo Zhang
Jianing Zhu
Xinmu Ge
Zihua Zhao
Zhanke Zhou
Xuan Li
Xiao Feng
Jiangchao Yao
Bo Han
ALM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (37★)
Papers citing
"Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"
0 / 0 papers shown
No papers found