Methods and Mechanisms for Interactive Novelty Handling in Adversarial
EnvironmentsAdaptive Agents and Multi-Agent Systems (AAMAS), 2023 |
Data Driven Reward Initialization for Preference based Reinforcement
Learning Mudit Verma Subbarao Kambhampati |