If MaxEnt RL is the Answer, What is the Question?

4 October 2019

Papers citing "If MaxEnt RL is the Answer, What is the Question?"

12 / 12 papers shown

Title
Recent Advances in Path Integral Control for Trajectory Optimization: An Overview in Theoretical and Algorithmic Perspectives Muhammad Kazim JunGee Hong Min-Gyeom Kim Kwang-Ki K. Kim 37 16 0 22 Sep 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality François Ged M. H. Veiga 31 0 0 22 Mar 2023
Fast Rates for Maximum Entropy Exploration D. Tiapkin Denis Belomestny Daniele Calandriello Eric Moulines Rémi Munos A. Naumov Pierre Perrault Yunhao Tang Michal Valko Pierre Menard 44 17 0 14 Mar 2023
Offline RL Policies Should be Trained to be Adaptive Dibya Ghosh Anurag Ajay Pulkit Agrawal Sergey Levine OffRL 35 45 0 05 Jul 2022
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning Jinxin Liu Hongyin Zhang Donglin Wang OffRL 38 32 0 13 Mar 2022
Model-Free Risk-Sensitive Reinforcement Learning Grégoire Delétang Jordi Grau-Moya M. Kunesch Tim Genewein Rob Brekelmans Shane Legg Pedro A. Ortega OOD 10 9 0 04 Nov 2021
Divergence-Regularized Multi-Agent Actor-Critic Kefan Su Zongqing Lu 46 25 0 01 Oct 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability Dibya Ghosh Jad Rahme Aviral Kumar Amy Zhang Ryan P. Adams Sergey Levine OffRL 278 109 0 13 Jul 2021
Same State, Different Task: Continual Reinforcement Learning without Interference Samuel Kessler Jack Parker-Holder Philip J. Ball S. Zohren Stephen J. Roberts CLL OffRL 19 46 0 05 Jun 2021
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control Ian Fox Joyce M. Lee R. Pop-Busui Jenna Wiens BDL OffRL 27 50 0 18 Sep 2020
Control as Hybrid Inference Alexander Tschantz Beren Millidge A. Seth Christopher L. Buckley 19 9 0 11 Jul 2020
Ready Policy One: World Building Through Active Learning Philip J. Ball Jack Parker-Holder Aldo Pacchiano K. Choromanski Stephen J. Roberts OffRL 32 49 0 07 Feb 2020