Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.04950
Cited By
A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization
7 April 2025
Wenyuan Xu
Xiaochen Zuo
Chao Xin
Yu Yue
Lin Yan
Yonghui Wu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization"
Title
No papers