Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning
AlgorithmsAAAI Conference on Artificial Intelligence (AAAI), 2021 |
Differentiable Trust Region Layers for Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2021 |
Relative Entropy Regularized Policy Iteration A. Abdolmaleki Jost Tobias Springenberg Jonas Degrave Steven Bohez Yuval Tassa Dan Belov N. Heess Martin Riedmiller |