Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.03646
Cited By
v1
v2
v3 (latest)
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning
3 September 2025
Haozhe Wang
Qixin Xu
Che Liu
J. Wu
Fangzhen Lin
Wenhu Chen
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (14 upvotes)
Github (2045★)
Papers citing
"Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning"
14 / 14 papers shown
Title
From Illusion to Intention: Visual Rationale Learning for Vision-Language Reasoning
C. Wang
Haozhe Wang
Xi Chen
J. Liu
Taofeng Xue
Chong Peng
Donglian Qi
Fangzhen Lin
Yunfeng Yan
OffRL
LRM
304
0
0
28 Nov 2025
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Peng Xia
K. Zeng
Jiaqi Liu
Can Qin
Fang Wu
Yiyang Zhou
Caiming Xiong
Huaxiu Yao
LLMAG
LM&Ro
SyDa
709
3
0
20 Nov 2025
Demystifying Reinforcement Learning in Agentic Reasoning
Zhaochen Yu
Ling Yang
Jiaru Zou
Shuicheng Yan
Mengdi Wang
AI4TS
LRM
258
5
0
13 Oct 2025
LLMs as Strategic Agents: Beliefs, Best Response Behavior, and Emergent Heuristics
Enric Junque de Fortuny
Veronica Roberta Cappelli
LLMAG
LM&Ro
LRM
64
0
0
12 Oct 2025
Rethinking Entropy Interventions in RLVR: An Entropy Change Perspective
Zhezheng Hao
Hong Wang
Haoyang Liu
Jian Luo
Jiarui Yu
Hande Dong
Qiang Lin
Can Wang
Jiawei Chen
AAML
82
5
0
11 Oct 2025
Selection, Reflection and Self-Refinement: Revisit Reasoning Tasks via a Causal Lens
Yunlong Deng
Boyang Sun
Yan Li
Lingjing Kong
Zeyu Tang
Kun Zhang
Guangyi Chen
LRM
120
0
0
09 Oct 2025
The Debate on RLVR Reasoning Capability Boundary: Shrinkage, Expansion, or Both? A Two-Stage Dynamic View
Xinhao Yao
Lu Yu
Xiaolin Hu
Fengwei Teng
Qing Cui
Jun Zhou
Yong Liu
LRM
173
0
0
05 Oct 2025
Training Large Language Models To Reason In Parallel With Global Forking Tokens
Sheng Jia
Xiao Wang
Shiva Prasad Kasiviswanathan
LRM
153
1
0
01 Oct 2025
ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
Siru Ouyang
Jun Yan
I-Hung Hsu
Yanfei Chen
Ke Jiang
...
Mahsan Rofouei
Hangfei Lin
Jiawei Han
Chen-Yu Lee
Tomas Pfister
LLMAG
CLL
LRM
132
10
0
29 Sep 2025
Explore-Execute Chain: Towards an Efficient Structured Reasoning Paradigm
Kaisen Yang
Lixuan He
Rushi Shah
Kaicheng Yang
Qinwei Ma
Dianbo Liu
Alex Lamb
OffRL
LRM
158
0
0
28 Sep 2025
RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
Kohsei Matsutani
Shota Takashiro
Gouki Minegishi
Takeshi Kojima
Yusuke Iwasawa
Yutaka Matsuo
OffRL
ReLM
LRM
206
4
0
25 Sep 2025
Reverse-Engineered Reasoning for Open-Ended Generation
Haozhe Wang
Haoran Que
Qixin Xu
Minghao Liu
Wangchunshu Zhou
...
Wei Ye
Tong Yang
Wenhao Huang
G. Zhang
Fangzhen Lin
ReLM
LRM
192
9
0
07 Sep 2025
Language-Driven Object-Oriented Two-Stage Method for Scene Graph Anticipation
X. Zhu
Changwei Wang
Haozhe Wang
Xinyu Liu
Fangzhen Lin
192
1
0
06 Sep 2025
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Alex Su
Haozhe Wang
Weiming Ren
Fangzhen Lin
Lei Ma
MLLM
OffRL
LRM
VLM
315
93
0
21 May 2025
1