ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2509.23967
  4. Cited By
HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs
v1v2 (latest)

HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs

28 September 2025
K. Deng
Zizheng Zhan
Wen Xiang
Wenqiang Zhu
Tianhao Peng
X. Lei
W. Li
Jingxuan Xu
Kun Wu
Yifan Yao
Haoyang Huang
Huaixi Tang
Kepeng Lei
Zhiyi Lai
Songwei Yu
Zongxian Feng
Zuchen Gao
Weihao Xie
C. Zhang
Yanan Wu
Yuanxing Zhang
Daigang Xu
Yuqun Zhang
Jie Liu
Zhaoxiang Zhang
Haotian Zhang
Bin Chen
Jiaheng Liu
    LRM
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)

Papers citing "HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs"

2 / 2 papers shown
Title
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
Jingxuan Xu
K. Deng
W. Li
Songwei Yu
Huaixi Tang
...
Zhaoxiang Zhang
Yuqun Zhang
H. Zhang
Bin Chen
Jiaheng Liu
ELM
320
1
0
07 Nov 2025
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
Yuhang Li
Chenchen Zhang
Ruilin Lv
Ao Liu
K. Deng
Yuanxing Zhang
Jiaheng Liu
Wiggin Zhou
B. Zhou
LRM
75
3
0
13 Oct 2025
1