Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.09620
Cited By
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment
17 January 2025
Chaoqi Wang
Zhuokai Zhao
Yibo Jiang
Zhaorun Chen
Chen Zhu
Yuxin Chen
Jiayi Liu
Lizhu Zhang
Xiangjun Fan
Hao Ma
Sinong Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment"
1 / 1 papers shown
Title
A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future
Jialun Zhong
Wei Shen
Yanzeng Li
Songyang Gao
Hua Lu
Yicheng Chen
Yang Zhang
Wei Zhou
Jinjie Gu
Lei Zou
LRM
35
1
0
12 Apr 2025
1