Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.03198
Cited By
Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning
7 September 2019
Wenjie Shi
Shiji Song
Cheng Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning"
5 / 5 papers shown
Title
Simultaneous Best-Response Dynamics in Random Potential Games
Galit Ashkenazi-Golan
Domenico Mergoni Cecchelli
Edward Plumb
24
0
0
15 May 2025
Behavior evolution-inspired approach to walking gait reinforcement training for quadruped robots
Yu Wang
Wenchuan Jia
Yi Sun
Dong He
32
0
0
25 Sep 2024
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
37
3
0
01 Jul 2024
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Daniel Jarrett
Alihan Huyuk
M. Schaar
AI4CE
22
27
0
28 Oct 2023
Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework
Amber Srivastava
S. Salapaka
14
11
0
17 Jun 2020
1