ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.06257
  4. Cited By
Maximum Entropy RL (Provably) Solves Some Robust RL Problems

Maximum Entropy RL (Provably) Solves Some Robust RL Problems

10 March 2021
Benjamin Eysenbach
Sergey Levine
    OOD
ArXivPDFHTML

Papers citing "Maximum Entropy RL (Provably) Solves Some Robust RL Problems"

50 / 114 papers shown
Title
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization
Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization
Uri Gadot
E. Derman
Navdeep Kumar
Maxence Mohamed Elfatihi
Kfir Y. Levy
Shie Mannor
27
5
0
03 Sep 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
11
5
0
26 Jul 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function
  Approximation
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
35
19
0
17 Jul 2023
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient,
  and Sample Complexity
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
Runyu Zhang
Yang Hu
Na Li
33
5
0
20 Jun 2023
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in
  RL
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Miguel Suau
M. Spaan
F. Oliehoek
CML
14
4
0
04 Jun 2023
Solving Robust MDPs through No-Regret Dynamics
Solving Robust MDPs through No-Regret Dynamics
E. Guha
20
0
0
30 May 2023
Reinforcement Learning with Simple Sequence Priors
Reinforcement Learning with Simple Sequence Priors
Tankred Saanum
N. Éltető
Peter Dayan
Marcel Binz
Eric Schulz
OffRL
21
7
0
26 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Hanna Ziesche
Leonel Rozo
21
5
0
17 May 2023
What Matters in Reinforcement Learning for Tractography
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
13
2
0
15 May 2023
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and
  Global Optimality
Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality
François Ged
M. H. Veiga
21
0
0
22 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between
  Robustness and Regularization
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
M. Geist
Shie Mannor
34
1
0
12 Mar 2023
Decision-Making Under Uncertainty: Beyond Probabilities
Decision-Making Under Uncertainty: Beyond Probabilities
Thom S. Badings
T. D. Simão
Marnix Suilen
N. Jansen
UD
PER
13
12
0
10 Mar 2023
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint
Taisuke Kobayashi
29
3
0
08 Mar 2023
Bounding the Optimal Value Function in Compositional Reinforcement
  Learning
Bounding the Optimal Value Function in Compositional Reinforcement Learning
Jacob Adamczyk
Volodymyr Makarenko
A. Arriojas
Stas Tiomkin
R. Kulkarni
OffRL
27
2
0
05 Mar 2023
Multi-Start Team Orienteering Problem for UAS Mission Re-Planning with
  Data-Efficient Deep Reinforcement Learning
Multi-Start Team Orienteering Problem for UAS Mission Re-Planning with Data-Efficient Deep Reinforcement Learning
Dong Ho Lee
Jaemyung Ahn
14
6
0
02 Mar 2023
Minimax-Bayes Reinforcement Learning
Minimax-Bayes Reinforcement Learning
Thomas Kleine Buening
Christos Dimitrakakis
Hannes Eriksson
Divya Grover
Emilio Jorge
OffRL
16
5
0
21 Feb 2023
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided
  Bounds on the Value Function
Leveraging Prior Knowledge in Reinforcement Learning via Double-Sided Bounds on the Value Function
Jacob Adamczyk
Stas Tiomkin
R. Kulkarni
OffRL
20
0
0
19 Feb 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
26
46
0
14 Feb 2023
A general Markov decision process formalism for action-state
  entropy-regularized reward maximization
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
22
3
0
02 Feb 2023
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
An Efficient Solution to s-Rectangular Robust Markov Decision Processes
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
23
1
0
31 Jan 2023
Policy Gradient for Rectangular Robust Markov Decision Processes
Policy Gradient for Rectangular Robust Markov Decision Processes
Navdeep Kumar
E. Derman
M. Geist
Kfir Y. Levy
Shie Mannor
16
19
0
31 Jan 2023
STEERING: Stein Information Directed Exploration for Model-Based
  Reinforcement Learning
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Mengdi Wang
Furong Huang
Dinesh Manocha
24
7
0
28 Jan 2023
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative
  Reward Co-Training
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training
Philipp Altmann
Thomy Phan
Fabian Ritz
Thomas Gabor
Claudia Linnhoff-Popien
OffRL
14
1
0
18 Jan 2023
Robust Average-Reward Markov Decision Processes
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George K. Atia
Ashley Prater-Bennette
Shaofeng Zou
31
11
0
02 Jan 2023
Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement
  Learning
Certified Policy Smoothing for Cooperative Multi-Agent Reinforcement Learning
Ronghui Mu
Wenjie Ruan
Leandro Soriano Marcolino
Gaojie Jin
Q. Ni
30
5
0
22 Dec 2022
Risk-Sensitive Reinforcement Learning with Exponential Criteria
Risk-Sensitive Reinforcement Learning with Exponential Criteria
Erfaun Noorani
Christos N. Mavridis
John S. Baras
25
8
0
18 Dec 2022
Resilience Evaluation of Entropy Regularized Logistic Networks with
  Probabilistic Cost
Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost
Koshi Oishi
Yota Hashizume
Tomohiko Jimbo
Hirotaka Kaji
Kenji Kashima
13
2
0
05 Dec 2022
Utilizing Prior Solutions for Reward Shaping and Composition in
  Entropy-Regularized Reinforcement Learning
Utilizing Prior Solutions for Reward Shaping and Composition in Entropy-Regularized Reinforcement Learning
Jacob Adamczyk
A. Arriojas
Stas Tiomkin
R. Kulkarni
29
8
0
02 Dec 2022
Path Planning Using Wassertein Distributionally Robust Deep Q-learning
Path Planning Using Wassertein Distributionally Robust Deep Q-learning
Cem Alptürk
Venkatraman Renganathan
OOD
11
0
0
04 Nov 2022
Latent State Marginalization as a Low-cost Approach for Improving
  Exploration
Latent State Marginalization as a Low-cost Approach for Improving Exploration
Dinghuai Zhang
Aaron Courville
Yoshua Bengio
Qinqing Zheng
Amy Zhang
Ricky T. Q. Chen
OOD
17
9
0
03 Oct 2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent
  Representation
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Yannick Hogewind
T. D. Simão
Tal Kachman
N. Jansen
14
10
0
02 Oct 2022
On the convex formulations of robust Markov decision processes
On the convex formulations of robust Markov decision processes
Julien Grand-Clément
Marek Petrik
46
10
0
21 Sep 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation
  Towards Real via Offline Reinforcement Learning
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
18
3
0
19 Sep 2022
Example When Local Optimal Policies Contain Unstable Control
Example When Local Optimal Policies Contain Unstable Control
B. Song
Jean-Jacques E. Slotine
Quang-Cuong Pham
36
1
0
15 Sep 2022
A Gaussian variational inference approach to motion planning
A Gaussian variational inference approach to motion planning
Hongzhe Yu
Yongxin Chen
32
16
0
13 Sep 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
22
31
0
13 Jul 2022
Conditional Energy-Based Models for Implicit Policies: The Gap between
  Theory and Practice
Conditional Energy-Based Models for Implicit Policies: The Gap between Theory and Practice
Duy-Nguyen Ta
Eric A. Cousineau
Huihua Zhao
Siyuan Feng
26
3
0
12 Jul 2022
Robust Reinforcement Learning in Continuous Control Tasks with
  Uncertainty Set Regularization
Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
Yuan Zhang
Jianhong Wang
Joschka Boedecker
34
3
0
05 Jul 2022
Robust Reinforcement Learning with Distributional Risk-averse
  formulation
Robust Reinforcement Learning with Distributional Risk-averse formulation
Pierre Clavier
S. Allassonnière
E. L. Pennec
OOD
31
7
0
14 Jun 2022
On the Robustness of Safe Reinforcement Learning under Observational
  Perturbations
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo-wen Li
Ding Zhao
OOD
OffRL
37
35
0
29 May 2022
Efficient Policy Iteration for Robust Markov Decision Processes via
  Regularization
Efficient Policy Iteration for Robust Markov Decision Processes via Regularization
Navdeep Kumar
Kfir Y. Levy
Kaixin Wang
Shie Mannor
11
18
0
28 May 2022
Complex behavior from intrinsic motivation to occupy action-state path
  space
Complex behavior from intrinsic motivation to occupy action-state path space
Jorge Ramírez-Ruiz
D. Grytskyy
Chiara Mastrogiuseppe
Yamen Habib
R. Moreno-Bote
22
7
0
20 May 2022
Policy Gradient Method For Robust Reinforcement Learning
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang
Shaofeng Zou
81
67
0
15 May 2022
SAAC: Safe Reinforcement Learning as an Adversarial Game of
  Actor-Critics
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics
Yannis Flet-Berliac
D. Basu
AAML
12
8
0
20 Apr 2022
Divide & Conquer Imitation Learning
Divide & Conquer Imitation Learning
Alexandre Chenu
Nicolas Perrin-Gilbert
Olivier Sigaud
8
5
0
15 Apr 2022
Maximum entropy optimal density control of discrete-time linear systems
  and Schrödinger bridges
Maximum entropy optimal density control of discrete-time linear systems and Schrödinger bridges
Kaito Ito
Kenji Kashima
11
12
0
11 Apr 2022
Your Policy Regularizer is Secretly an Adversary
Your Policy Regularizer is Secretly an Adversary
Rob Brekelmans
Tim Genewein
Jordi Grau-Moya
Grégoire Delétang
M. Kunesch
Shane Legg
Pedro A. Ortega
AAML
13
12
0
23 Mar 2022
Do You Need the Entropy Reward (in Practice)?
Do You Need the Entropy Reward (in Practice)?
Haonan Yu
Haichao Zhang
Wei-ping Xu
28
7
0
28 Jan 2022
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
A Statistical Analysis of Polyak-Ruppert Averaged Q-learning
Xiang Li
Wenhao Yang
Jiadong Liang
Zhihua Zhang
Michael I. Jordan
32
15
0
29 Dec 2021
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement
  Learning
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
Dailin Hu
Pieter Abbeel
Roy Fox
16
1
0
28 Nov 2021
Previous
123
Next