Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2602.08499
Cited By
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
9 February 2026
Xiaodong Lu
Xiaohan Wang
Jiajun Chai
Guojun Yin
Wei Lin
Zhijun Chen
Yu Luo
Fuzhen Zhuang
Yikun Ban
Deqing Wang
OffRL
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1090★)
Papers citing
"Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards"
0 / 0 papers shown
No papers found
Page 1 of 0