ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.17975
  4. Cited By
Sample-Efficient Preference-based Reinforcement Learning with Dynamics
  Aware Rewards

Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards

28 February 2024
Katherine Metcalf
Miguel Sarabia
Natalie Mackraz
B. Theobald
ArXivPDFHTML

Papers citing "Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards"

2 / 2 papers shown
Title
Programming Refusal with Conditional Activation Steering
Programming Refusal with Conditional Activation Steering
Bruce W. Lee
Inkit Padhi
K. Ramamurthy
Erik Miehling
Pierre L. Dognin
Manish Nagireddy
Amit Dhurandhar
LLMSV
91
13
0
06 Sep 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
1