Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.22756
Cited By

Decomposing Elements of Problem Solving: What "Math" Does RL Teach?

Decomposing Elements of Problem Solving: What "Math" Does RL Teach?

28 May 2025

Core Francisco Park

Hidenori Tanaka

David Alvarez-Melis

ArXiv (abs)PDF HTML

Papers citing "Decomposing Elements of Problem Solving: What "Math" Does RL Teach?"

11 / 11 papers shown

Before you <think>, monitor: Implementing Flavell's metacognitive framework in LLMs

Before you <think>, monitor: Implementing Flavell's metacognitive framework in LLMs

134

0

0

18 Oct 2025

RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs

RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs

Kohsei Matsutani

Shota Takashiro

Gouki Minegishi

207

5

0

25 Sep 2025

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

...

Shuaiqiang Wang

Simon Shaolei Du

812

160

0

29 Apr 2025

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

716

446

0

18 Apr 2025

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

DiffM LRM AI4CE

462

72

0

16 Apr 2025

GRPO-LEAD: A Difficulty-Aware Reinforcement Learning Approach for Concise Mathematical Reasoning in Language Models

GRPO-LEAD: A Difficulty-Aware Reinforcement Learning Approach for Concise Mathematical Reasoning in Language Models

257

49

0

13 Apr 2025

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

...

557

141

0

07 Apr 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Understanding R1-Zero-Like Training: A Critical Perspective

523

600

0

26 Mar 2025

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

595

346

0

24 Mar 2025

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Ayush Chakravarthy

Noah D. Goodman

528

290

0

03 Mar 2025

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Jordan Juravsky

Christopher Ré

Azalia Mirhoseini

943

571

0

03 Jan 2025