Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2405.12421
Cited By
v1
v2 (latest)
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
20 May 2024
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback"
2 / 2 papers shown
Beyond RLHF and NLHF: Population-Proportional Alignment under an Axiomatic Framework
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
337
1
0
05 Jun 2025
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
493
66
0
30 Apr 2024
1
Page 1 of 1