Title |
---|
![]() Decoupled Exploration and Exploitation Policies for Sample-Efficient
Reinforcement Learning William F. Whitney Michael Bloesch Jost Tobias Springenberg A. Abdolmaleki Kyunghyun Cho Martin Riedmiller |
![]() Towards General and Autonomous Learning of Core Skills: A Case Study in
Locomotion Roland Hafner Tim Hertweck Philipp Kloppner Michael Bloesch Michael Neunert Markus Wulfmeier S. Tunyasuvunakool N. Heess Martin Riedmiller |
![]() Data-efficient Hindsight Off-policy Option Learning Markus Wulfmeier Dushyant Rao Roland Hafner Thomas Lampe A. Abdolmaleki ...Michael Neunert Dhruva Tirumala Noah Y. Siegel N. Heess Martin Riedmiller |
![]() Compositional Transfer in Hierarchical Reinforcement Learning Markus Wulfmeier A. Abdolmaleki Roland Hafner Jost Tobias Springenberg Michael Neunert Tim Hertweck Thomas Lampe Noah Y. Siegel N. Heess Martin Riedmiller |
![]() Relative Entropy Regularized Policy Iteration A. Abdolmaleki Jost Tobias Springenberg Jonas Degrave Steven Bohez Yuval Tassa Dan Belov N. Heess Martin Riedmiller |