Title |
---|
![]() Soft Actor-Critic Algorithms and Applications Tuomas Haarnoja Aurick Zhou Kristian Hartikainen George Tucker Sehoon Ha ...Vikash Kumar Henry Zhu Abhishek Gupta Pieter Abbeel Sergey Levine |
![]() Relative Entropy Regularized Policy Iteration A. Abdolmaleki Jost Tobias Springenberg Jonas Degrave Steven Bohez Yuval Tassa Dan Belov N. Heess Martin Riedmiller |
![]() Learning by Playing - Solving Sparse Reward Tasks from Scratch Martin Riedmiller Roland Hafner Thomas Lampe Michael Neunert Jonas Degrave T. Wiele Volodymyr Mnih N. Heess Jost Tobias Springenberg |