Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning

7 September 2019

Papers citing "Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning"

5 / 5 papers shown

Title
Simultaneous Best-Response Dynamics in Random Potential Games Galit Ashkenazi-Golan Domenico Mergoni Cecchelli Edward Plumb 24 0 0 15 May 2025
Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots Yu Wang Wenchuan Jia Yi Sun Dong He 32 0 0 25 Sep 2024
Residual-MPPI: Online Policy Customization for Continuous Control Pengcheng Wang Chenran Li Catherine Weaver Kenta Kawamoto Masayoshi Tomizuka Chen Tang Wei Zhan OffRL 37 3 0 01 Jul 2024
Inverse Decision Modeling: Learning Interpretable Representations of Behavior Daniel Jarrett Alihan Huyuk M. Schaar AI4CE 22 27 0 28 Oct 2023
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework Amber Srivastava S. Salapaka 14 11 0 17 Jun 2020