Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning

17 February 2023

Papers citing "Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning"

4 / 4 papers shown

Title
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences Z. Liu Junjie Xu Xingjiao Wu J. Yang Liang He 26 0 0 11 Sep 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning Calarina Muslimani M. E. Taylor OffRL 38 2 0 30 Apr 2024
Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments Tung Thai Mingyu Shen M. Garg Ayush Kalani Nakul Vaidya ... Neeraj Varshney Chitta Baral Subbarao Kambhampati Jivko Sinapov matthias. scheutz 22 0 0 28 Feb 2023
Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations S. Sreedharan Utkarsh Soni Mudit Verma Siddharth Srivastava S. Kambhampati 63 30 0 04 Feb 2020