ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2510.01833
  4. Cited By
Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

2 October 2025
Zhihao Dou
Qinjian Zhao
Zhongwei Wan
Dinggen Zhang
Weida Wang
Towsif Raiyan
Benteng Chen
Qingtao Pan
Yang Ouyang
Zhiqiang Gao
Shufei Zhang
Sumon Biswas
    LLMAGLRM
ArXiv (abs)PDFHTML

Papers citing "Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning"

1 / 1 papers shown
Title
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
Zhongwei Wan
Zhihao Dou
Che Liu
Yu Zhang
Dongfei Cui
...
Yifan Jiang
Yangfan He
Mi Zhang
Shen Yan
Shen Yan
LRM
184
25
0
02 Jun 2025
1