Dynamic value alignment through preference aggregation of multiple objectives

9 October 2023

Papers citing "Dynamic value alignment through preference aggregation of multiple objectives"

1 / 1 papers shown

Title
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation Xiaoyu Chen Han Zhong Zhuoran Yang Zhaoran Wang Liwei Wang 118 60 0 23 May 2022