Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.13818
Cited By
Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning
18 April 2025
Yixuan Even Xu
Yash Savani
Fei Fang
Zico Kolter
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning"
1 / 1 papers shown
Title
Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models
Xiaobao Wu
LRM
62
0
0
05 May 2025
1