Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2510.01555
Cited By
v1
v2 (latest)
Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization
2 October 2025
Kezhao Liu
Jason Klein Liu
Mingtao Chen
Yiming Liu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (956★)
Papers citing
"Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization"
0 / 0 papers shown
No papers found
Page 1 of 0