Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2509.23967
Cited By
v1
v2 (latest)
HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs
28 September 2025
K. Deng
Zizheng Zhan
Wen Xiang
Wenqiang Zhu
Tianhao Peng
X. Lei
W. Li
Jingxuan Xu
Kun Wu
Yifan Yao
Haoyang Huang
Huaixi Tang
Kepeng Lei
Zhiyi Lai
Songwei Yu
Zongxian Feng
Zuchen Gao
Weihao Xie
C. Zhang
Yanan Wu
Yuanxing Zhang
Daigang Xu
Yuqun Zhang
Jie Liu
Zhaoxiang Zhang
Haotian Zhang
Bin Chen
Jiaheng Liu
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Papers citing
"HiPO: Hybrid Policy Optimization for Dynamic Reasoning in LLMs"
2 / 2 papers shown
Title
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
Jingxuan Xu
K. Deng
W. Li
Songwei Yu
Huaixi Tang
...
Zhaoxiang Zhang
Yuqun Zhang
H. Zhang
Bin Chen
Jiaheng Liu
ELM
320
1
0
07 Nov 2025
ReLook: Vision-Grounded RL with a Multimodal LLM Critic for Agentic Web Coding
Yuhang Li
Chenchen Zhang
Ruilin Lv
Ao Liu
K. Deng
Yuanxing Zhang
Jiaheng Liu
Wiggin Zhou
B. Zhou
LRM
75
3
0
13 Oct 2025
1