A Temporal-Difference Approach to Policy Gradient EstimationInternational Conference on Machine Learning (ICML), 2022 |
Contextual Latent-Movements Off-Policy Optimization for Robotic
Manipulation SkillsIEEE International Conference on Robotics and Automation (ICRA), 2020 |
Statistically Efficient Off-Policy Policy GradientsInternational Conference on Machine Learning (ICML), 2020 |