Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.01833
Cited By
Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning
2 October 2025
Zhihao Dou
Qinjian Zhao
Zhongwei Wan
Dinggen Zhang
Weida Wang
Towsif Raiyan
Benteng Chen
Qingtao Pan
Yang Ouyang
Zhiqiang Gao
Shufei Zhang
Sumon Biswas
LLMAG
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning"
1 / 1 papers shown
Title
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
Zhongwei Wan
Zhihao Dou
Che Liu
Yu Zhang
Dongfei Cui
...
Yifan Jiang
Yangfan He
Mi Zhang
Shen Yan
Shen Yan
LRM
184
25
0
02 Jun 2025
1