Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.02172
Cited By
RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization
2 October 2025
Zhaoning Yu
Will Su
Leitian Tao
Haozhu Wang
Aashu Singh
Hanchao Yu
Jianyu Wang
Hongyang Gao
Weizhe Yuan
Jason Weston
Ping Yu
Jing Xu
OffRL
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (6 upvotes)
Papers citing
"RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization"
0 / 0 papers shown
Title
No papers found