On-line Policy Improvement using Monte-Carlo SearchNeural Information Processing Systems (NeurIPS), 1996 |
Shielding in Resource-Constrained Goal POMDPsAAAI Conference on Artificial Intelligence (AAAI), 2022 |
Multiagent Reinforcement Learning for Autonomous Routing and Pickup
Problem with Adaptation to Variable DemandIEEE International Conference on Robotics and Automation (ICRA), 2022 |
Sample-Based Bounds for Coherent Risk Measures: Applications to Policy
Synthesis and VerificationArtificial Intelligence (AIJ), 2022 |
Proximal Reinforcement Learning: Efficient Off-Policy Evaluation in
Partially Observed Markov Decision ProcessesOperational Research (OR), 2021 |
Policies for the Dynamic Traveling Maintainer Problem with AlertsEuropean Journal of Operational Research (EJOR), 2021 |