Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.13346
Cited By
J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization
19 May 2025
Austin Xu
Yilun Zhou
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
ELM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization"
Title
No papers