Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.06257
Cited By
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
10 March 2021
Benjamin Eysenbach
Sergey Levine
OOD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Maximum Entropy RL (Provably) Solves Some Robust RL Problems"
50 / 114 papers shown
Title
Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
Bang You
Chenxu Wang
Huaping Liu
24
0
0
07 May 2025
Improving Controller Generalization with Dimensionless Markov Decision Processes
V. Charvet
Sebastian Stein
R. Murray-Smith
31
0
0
14 Apr 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
49
0
0
09 Mar 2025
Performance Comparisons of Reinforcement Learning Algorithms for Sequential Experimental Design
Yasir Zubayr Barlas
Kizito Salako
32
0
0
07 Mar 2025
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control
Zifeng Zhuang
Diyuan Shi
Runze Suo
Xiao He
Hongyin Zhang
Ting Wang
Shangke Lyu
Donglin Wang
37
0
0
24 Feb 2025
Synthesis of Model Predictive Control and Reinforcement Learning: Survey and Classification
Rudolf Reiter
Jasper Hoffmann
D. Reinhardt
Florian Messerer
Katrin Baumgärtner
Shamburaj Sawant
Joschka Boedecker
Moritz Diehl
S. Gros
77
5
0
04 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
49
2
0
29 Jan 2025
Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning
Rémy Hosseinkhan Boucher
Onofrio Semeraro
L. Mathelin
72
0
0
28 Jan 2025
Average-Reward Reinforcement Learning with Entropy Regularization
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
R. Kulkarni
OOD
45
2
0
17 Jan 2025
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
82
0
0
16 Dec 2024
Bounded Rationality Equilibrium Learning in Mean Field Games
Yannick Eich
Christian Fabian
Kai Cui
Heinz Koeppl
31
0
0
11 Nov 2024
Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping
Bavo Lesy
Ali Anwar
Siegfried Mercelis
24
0
0
07 Nov 2024
Risk-sensitive control as inference with Rényi divergence
Kaito Ito
Kenji Kashima
29
1
0
04 Nov 2024
Maximum Entropy Hindsight Experience Replay
Douglas C. Crowder
Matthew L. Trappett
Darrien M. McKenzie
Frances S. Chance
22
0
0
31 Oct 2024
Solving robust MDPs as a sequence of static RL problems
Adil Zouitine
Matthieu Geist
Emmanuel Rachelson
18
0
0
08 Oct 2024
Autonomous Wheel Loader Navigation Using Goal-Conditioned Actor-Critic MPC
Aleksi Mäki-Penttilä
Naeim Ebrahimi Toulkani
Reza Ghabcheloo
29
0
0
24 Sep 2024
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
Jean Seong Bjorn Choe
Bumkyu Choi
Jong-kook Kim
22
2
0
13 Sep 2024
How to Choose a Reinforcement-Learning Algorithm
Fabian Bongratz
Vladimir Golkov
Lukas Mautner
Luca Della Libera
Frederik Heetmeyer
Felix Czaja
Julian Rodemann
Daniel Cremers
21
1
0
30 Jul 2024
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
Jean Seong Bjorn Choe
Jong-Kook Kim
38
2
0
25 Jul 2024
Adapting Image-based RL Policies via Predicted Rewards
Weiyao Wang
Xinyuan Fang
Gregory D. Hager
36
0
0
23 Jul 2024
Understanding Reference Policies in Direct Preference Optimization
Yixin Liu
Pengfei Liu
Arman Cohan
31
7
0
18 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Boosting Soft Q-Learning by Bounding
Jacob Adamczyk
Volodymyr Makarenko
Stas Tiomkin
Rahul V. Kulkarni
OffRL
35
2
0
26 Jun 2024
Roping in Uncertainty: Robustness and Regularization in Markov Games
Jeremy McMahan
Giovanni Artiglio
Qiaomin Xie
37
2
0
13 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
38
0
0
06 Jun 2024
AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control
Rudolf Reiter
Andrea Ghezzi
Katrin Baumgärtner
Jasper Hoffmann
Robert D. McAllister
Moritz Diehl
34
6
0
06 Jun 2024
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization
Jiancong Xiao
Ziniu Li
Xingyu Xie
E. Getzen
Cong Fang
Qi Long
Weijie J. Su
41
12
0
26 May 2024
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
Chen-Hao Chao
Chien Feng
Wei-Fang Sun
Cheng-Kuang Lee
Simon See
Chun-Yi Lee
33
1
0
22 May 2024
Reward-Punishment Reinforcement Learning with Maximum Entropy
Jiexin Wang
E. Uchibe
19
0
0
20 May 2024
S
2
^2
2
AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
Safa Messaoud
Billel Mokeddem
Zhenghai Xue
L. Pang
Bo An
Haipeng Chen
Sanjay Chawla
33
3
0
02 May 2024
Tsallis Entropy Regularization for Linearly Solvable MDP and Linear Quadratic Regulator
Yota Hashizume
Koshi Oishi
Kenji Kashima
27
1
0
04 Mar 2024
Imitation-regularized Optimal Transport on Networks: Provable Robustness and Application to Logistics Planning
Koshi Oishi
Yota Hashizume
Tomohiko Jimbo
Hirotaka Kaji
Kenji Kashima
OOD
30
2
0
28 Feb 2024
Blending Data-Driven Priors in Dynamic Games
Justin Lidard
Haimin Hu
Asher Hancock
Zixu Zhang
Albert Gimó Contreras
...
Deepak Gopinath
Guy Rosman
Naomi Ehrich Leonard
María Santos
J. F. Fisac
OffRL
35
5
0
21 Feb 2024
Robust agents learn causal world models
Jonathan G. Richens
Tom Everitt
OOD
116
36
0
16 Feb 2024
Discrete Probabilistic Inference as Control in Multi-path Environments
T. Deleu
Padideh Nouri
Nikolay Malkin
Doina Precup
Yoshua Bengio
111
28
0
15 Feb 2024
Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts
Tobias Enders
James Harrison
Maximilian Schiffer
OOD
41
3
0
15 Feb 2024
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
27
2
0
26 Jan 2024
Task-Oriented Active Learning of Model Preconditions for Inaccurate Dynamics Models
A. LaGrassa
Moonyoung Lee
Oliver Kroemer
16
2
0
08 Jan 2024
Improve Robustness of Reinforcement Learning against Observation Perturbations via
l
∞
l_\infty
l
∞
Lipschitz Policy Networks
Buqing Nie
Jingtian Ji
Yangqing Fu
Yue Gao
37
4
0
14 Dec 2023
Reward Certification for Policy Smoothed Reinforcement Learning
Ronghui Mu
Leandro Soriano Marcolino
Tianle Zhang
Yanghao Zhang
Xiaowei Huang
Wenjie Ruan
23
4
0
11 Dec 2023
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
K. Zentner
Ujjwal Puri
Zhehui Huang
Gaurav Sukhatme
OffRL
14
0
0
08 Dec 2023
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
114
6
0
27 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
17
1
0
26 Oct 2023
Constrained Reweighting of Distributions: an Optimal Transport Approach
Abhisek Chakraborty
A. Bhattacharya
D. Pati
23
1
0
19 Oct 2023
Sim-to-Real Transfer of Adaptive Control Parameters for AUV Stabilization under Current Disturbance
Thomas Chaffre
J. Wheare
A. Lammas
Paulo E. Santos
G. Chenadec
Karl Sammut
Benoit Clement
13
1
0
17 Oct 2023
Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization
Simin Li
Ruixiao Xu
Jingqiao Xiu
Yuwei Zheng
Pu Feng
Yaodong Yang
Xianglong Liu
23
3
0
15 Oct 2023
Amortizing intractable inference in large language models
Marvin Schmitt
Moksh Jain
Daniel Habermann
Younesse Kaddar
Ullrich Kothe
Stefan T. Radev
Nikolay Malkin
AIFin
BDL
24
46
0
06 Oct 2023
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards
Silviu Pitis
35
5
0
30 Sep 2023
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd D. Murphey
AI4CE
DiffM
9
5
0
26 Sep 2023
Efficient Belief Road Map for Planning Under Uncertainty
Zhenyang Chen
Hongzhe Yu
Yongxin Chen
18
0
0
17 Sep 2023
1
2
3
Next