Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.11343
Cited By
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
15 April 2025
Wei Xiong
Jiarui Yao
Yuhui Xu
Bo Pang
Lei Wang
Doyen Sahoo
Junnan Li
Nan Jiang
Tong Zhang
Caiming Xiong
Hanze Dong
OffRL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce"
2 / 2 papers shown
Title
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu
Hanze Dong
Lei Wang
Doyen Sahoo
Junnan Li
Caiming Xiong
OffRL
LRM
47
0
0
08 May 2025
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Jiarui Yao
Yifan Hao
Hanning Zhang
Hanze Dong
Wei Xiong
Nan Jiang
Tong Zhang
LRM
47
0
0
05 May 2025
1