Data-Efficient Policy Evaluation Through Behavior Policy SearchInternational Conference on Machine Learning (ICML), 2017 |
Generalized Value Iteration Networks: Life Beyond LatticesAAAI Conference on Artificial Intelligence (AAAI), 2017 |
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive EnvironmentsNeural Information Processing Systems (NeurIPS), 2017 |
Parameter Space Noise for ExplorationInternational Conference on Learning Representations (ICLR), 2017 |
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient
Estimation for Deep Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2017 |
Constrained Policy OptimizationInternational Conference on Machine Learning (ICML), 2017 |
Learning End-to-end Multimodal Sensor Policies for Autonomous NavigationConference on Robot Learning (CoRL), 2017 |