ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.01913
  4. Cited By
If MaxEnt RL is the Answer, What is the Question?

If MaxEnt RL is the Answer, What is the Question?

4 October 2019
Benjamin Eysenbach
Sergey Levine
ArXivPDFHTML

Papers citing "If MaxEnt RL is the Answer, What is the Question?"

12 / 12 papers shown
Title
Recent Advances in Path Integral Control for Trajectory Optimization: An
  Overview in Theoretical and Algorithmic Perspectives
Recent Advances in Path Integral Control for Trajectory Optimization: An Overview in Theoretical and Algorithmic Perspectives
Muhammad Kazim
JunGee Hong
Min-Gyeom Kim
Kwang-Ki K. Kim
37
16
0
22 Sep 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
31
0
0
22 Mar 2023
Fast Rates for Maximum Entropy Exploration
Fast Rates for Maximum Entropy Exploration
D. Tiapkin
Denis Belomestny
Daniele Calandriello
Eric Moulines
Rémi Munos
A. Naumov
Pierre Perrault
Yunhao Tang
Michal Valko
Pierre Menard
44
17
0
14 Mar 2023
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement
  Learning
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning
Jinxin Liu
Hongyin Zhang
Donglin Wang
OffRL
38
32
0
13 Mar 2022
Model-Free Risk-Sensitive Reinforcement Learning
Model-Free Risk-Sensitive Reinforcement Learning
Grégoire Delétang
Jordi Grau-Moya
M. Kunesch
Tim Genewein
Rob Brekelmans
Shane Legg
Pedro A. Ortega
OOD
10
9
0
04 Nov 2021
Divergence-Regularized Multi-Agent Actor-Critic
Divergence-Regularized Multi-Agent Actor-Critic
Kefan Su
Zongqing Lu
46
25
0
01 Oct 2021
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit
  Partial Observability
Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Dibya Ghosh
Jad Rahme
Aviral Kumar
Amy Zhang
Ryan P. Adams
Sergey Levine
OffRL
278
109
0
13 Jul 2021
Same State, Different Task: Continual Reinforcement Learning without
  Interference
Same State, Different Task: Continual Reinforcement Learning without Interference
Samuel Kessler
Jack Parker-Holder
Philip J. Ball
S. Zohren
Stephen J. Roberts
CLL
OffRL
19
46
0
05 Jun 2021
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Ian Fox
Joyce M. Lee
R. Pop-Busui
Jenna Wiens
BDL
OffRL
27
50
0
18 Sep 2020
Control as Hybrid Inference
Control as Hybrid Inference
Alexander Tschantz
Beren Millidge
A. Seth
Christopher L. Buckley
19
9
0
11 Jul 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
32
49
0
07 Feb 2020
1