ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.12421
  4. Cited By
A Unified Linear Programming Framework for Offline Reward Learning from
  Human Demonstrations and Feedback
v1v2 (latest)

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback

20 May 2024
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
    OffRL
ArXiv (abs)PDFHTMLGithub

Papers citing "A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback"

2 / 2 papers shown
Beyond RLHF and NLHF: Population-Proportional Alignment under an Axiomatic Framework
Beyond RLHF and NLHF: Population-Proportional Alignment under an Axiomatic Framework
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
337
1
0
05 Jun 2025
RLHF from Heterogeneous Feedback via Personalization and Preference
  Aggregation
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
Chanwoo Park
Mingyang Liu
Dingwen Kong
Kaiqing Zhang
Asuman Ozdaglar
493
66
0
30 Apr 2024
1
Page 1 of 1