ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.05871
  4. Cited By
Dynamic value alignment through preference aggregation of multiple
  objectives

Dynamic value alignment through preference aggregation of multiple objectives

9 October 2023
Marcin Korecki
Damian Dailisan
Cesare Carissimo
ArXivPDFHTML

Papers citing "Dynamic value alignment through preference aggregation of multiple objectives"

1 / 1 papers shown
Title
Human-in-the-loop: Provably Efficient Preference-based Reinforcement
  Learning with General Function Approximation
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen
Han Zhong
Zhuoran Yang
Zhaoran Wang
Liwei Wang
118
60
0
23 May 2022
1