ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.11343
  4. Cited By
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

15 April 2025
Wei Xiong
Jiarui Yao
Yuhui Xu
Bo Pang
Lei Wang
Doyen Sahoo
Junnan Li
Nan Jiang
Tong Zhang
Caiming Xiong
Hanze Dong
    OffRL
    LRM
ArXivPDFHTML

Papers citing "A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce"

2 / 2 papers shown
Title
Scalable Chain of Thoughts via Elastic Reasoning
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu
Hanze Dong
Lei Wang
Doyen Sahoo
Junnan Li
Caiming Xiong
OffRL
LRM
44
0
0
08 May 2025
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
Jiarui Yao
Yifan Hao
Hanning Zhang
Hanze Dong
Wei Xiong
Nan Jiang
Tong Zhang
LRM
47
0
0
05 May 2025
1