ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.03646
  4. Cited By
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
v1v2v3 (latest)

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

3 September 2025
Haozhe Wang
Qixin Xu
Che Liu
J. Wu
Fangzhen Lin
Wenhu Chen
    LRM
ArXiv (abs)PDFHTMLHuggingFace (14 upvotes)Github (2045★)

Papers citing "Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning"

14 / 14 papers shown
Title
From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning
From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning
C. Wang
Haozhe Wang
Xi Chen
J. Liu
Taofeng Xue
Chong Peng
Donglian Qi
Fangzhen Lin
Yunfeng Yan
OffRLLRM
304
0
0
28 Nov 2025
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Peng Xia
K. Zeng
Jiaqi Liu
Can Qin
Fang Wu
Yiyang Zhou
Caiming Xiong
Huaxiu Yao
LLMAGLM&RoSyDa
709
3
0
20 Nov 2025
Demystifying Reinforcement Learning in Agentic Reasoning
Demystifying Reinforcement Learning in Agentic Reasoning
Zhaochen Yu
Ling Yang
Jiaru Zou
Shuicheng Yan
Mengdi Wang
AI4TSLRM
258
5
0
13 Oct 2025
LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics
LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics
Enric Junque de Fortuny
Veronica Roberta Cappelli
LLMAGLM&RoLRM
64
0
0
12 Oct 2025
Rethinking Entropy Interventions in RLVR: An Entropy Change Perspective
Rethinking Entropy Interventions in RLVR: An Entropy Change Perspective
Zhezheng Hao
Hong Wang
Haoyang Liu
Jian Luo
Jiarui Yu
Hande Dong
Qiang Lin
Can Wang
Jiawei Chen
AAML
82
5
0
11 Oct 2025
Selection, Reflection and Self-Refinement: Revisit Reasoning Tasks via a Causal Lens
Selection, Reflection and Self-Refinement: Revisit Reasoning Tasks via a Causal Lens
Yunlong Deng
Boyang Sun
Yan Li
Lingjing Kong
Zeyu Tang
Kun Zhang
Guangyi Chen
LRM
120
0
0
09 Oct 2025
The Debate on RLVR Reasoning Capability Boundary: Shrinkage, Expansion, or Both? A Two-Stage Dynamic View
The Debate on RLVR Reasoning Capability Boundary: Shrinkage, Expansion, or Both? A Two-Stage Dynamic View
Xinhao Yao
Lu Yu
Xiaolin Hu
Fengwei Teng
Qing Cui
Jun Zhou
Yong Liu
LRM
173
0
0
05 Oct 2025
Training Large Language Models To Reason In Parallel With Global Forking Tokens
Training Large Language Models To Reason In Parallel With Global Forking Tokens
Sheng Jia
Xiao Wang
Shiva Prasad Kasiviswanathan
LRM
153
1
0
01 Oct 2025
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
Siru Ouyang
Jun Yan
I-Hung Hsu
Yanfei Chen
Ke Jiang
...
Mahsan Rofouei
Hangfei Lin
Jiawei Han
Chen-Yu Lee
Tomas Pfister
LLMAGCLLLRM
132
10
0
29 Sep 2025
Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm
Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm
Kaisen Yang
Lixuan He
Rushi Shah
Kaicheng Yang
Qinwei Ma
Dianbo Liu
Alex Lamb
OffRLLRM
158
0
0
28 Sep 2025
RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
Kohsei Matsutani
Shota Takashiro
Gouki Minegishi
Takeshi Kojima
Yusuke Iwasawa
Yutaka Matsuo
OffRLReLMLRM
206
4
0
25 Sep 2025
Reverse-Engineered Reasoning for Open-Ended Generation
Reverse-Engineered Reasoning for Open-Ended Generation
Haozhe Wang
Haoran Que
Qixin Xu
Minghao Liu
Wangchunshu Zhou
...
Wei Ye
Tong Yang
Wenhao Huang
G. Zhang
Fangzhen Lin
ReLMLRM
192
9
0
07 Sep 2025
Language-Driven Object-Oriented Two-Stage Method for Scene Graph Anticipation
Language-Driven Object-Oriented Two-Stage Method for Scene Graph Anticipation
X. Zhu
Changwei Wang
Haozhe Wang
Xinyu Liu
Fangzhen Lin
192
1
0
06 Sep 2025
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Alex Su
Haozhe Wang
Weiming Ren
Fangzhen Lin
Lei Ma
MLLMOffRLLRMVLM
315
93
0
21 May 2025
1