ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.20571
  4. Cited By
Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

29 April 2025
Yiping Wang
Qing Yang
Zhiyuan Zeng
Liliang Ren
L. Liu
Baolin Peng
Hao Cheng
Xuehai He
Kuan-Chieh Jackson Wang
Jianfeng Gao
Weizhu Chen
S. Wang
Simon S. Du
Yelong Shen
    OffRL
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Reinforcement Learning for Reasoning in Large Language Models with One Training Example"

1 / 1 papers shown
Title
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
Taiwei Shi
Yiyang Wu
Linxin Song
Tianyi Zhou
Jieyu Zhao
LRM
67
1
0
07 Apr 2025
1