Title |
---|
![]() Concave Utility Reinforcement Learning: the Mean-Field Game Viewpoint M. Geist Julien Pérolat Mathieu Laurière Romuald Elie Sarah Perrin Olivier Bachem Rémi Munos Olivier Pietquin |
![]() Decoupled Exploration and Exploitation Policies for Sample-Efficient
Reinforcement Learning William F. Whitney Michael Bloesch Jost Tobias Springenberg A. Abdolmaleki Kyunghyun Cho Martin Riedmiller |
![]() Data-efficient Hindsight Off-policy Option Learning Markus Wulfmeier Dushyant Rao Roland Hafner Thomas Lampe A. Abdolmaleki ...Michael Neunert Dhruva Tirumala Noah Y. Siegel N. Heess Martin Riedmiller |
![]() What Matters In On-Policy Reinforcement Learning? A Large-Scale
Empirical Study Marcin Andrychowicz Anton Raichuk Piotr Stańczyk Manu Orsini Sertan Girgin ...M. Geist Olivier Pietquin Marcin Michalski Sylvain Gelly Olivier Bachem |