Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.17621
Cited By
Process Supervision-Guided Policy Optimization for Code Generation
23 October 2024
Ning Dai
Zheng Wu
Renjie Zheng
Ziyun Wei
Wenlei Shi
Xing Jin
Guanlin Liu
Chen Dun
Liang Huang
Lin Yan
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Process Supervision-Guided Policy Optimization for Code Generation"
2 / 2 papers shown
Title
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators
Yilun Zhou
Austin Xu
Peifeng Wang
Caiming Xiong
Shafiq R. Joty
ELM
ALM
LRM
45
2
0
21 Apr 2025
IterPref: Focal Preference Learning for Code Generation via Iterative Debugging
Jie Wu
Haoling Li
Xin Zhang
Jianwen Luo
Yangyu Huang
Ruihang Chu
Y. Yang
Scarlett Li
71
0
0
04 Mar 2025
1