The Value-Improvement Path: Towards Better Representations for
Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2020 |
Sample-based Distributional Policy GradientConference on Learning for Dynamics & Control (L4DC), 2020 |
Learning to Score Behaviors for Guided Policy OptimizationInternational Conference on Machine Learning (ICML), 2019 |