Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.02391
Cited By
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
5 May 2025
Jiarui Yao
Yifan Hao
Hanning Zhang
Hanze Dong
Wei Xiong
Nan Jiang
Tong Zhang
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL"
Title
No papers