ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.03413
  4. Cited By
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness
  to Model Misspecification

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

7 November 2022
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
    OffRL
ArXivPDFHTML

Papers citing "Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification"

7 / 7 papers shown
Title
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OOD
OffRL
36
1
0
27 Feb 2025
Solving robust MDPs as a sequence of static RL problems
Solving robust MDPs as a sequence of static RL problems
Adil Zouitine
Matthieu Geist
Emmanuel Rachelson
16
0
0
08 Oct 2024
RRLS : Robust Reinforcement Learning Suite
RRLS : Robust Reinforcement Learning Suite
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OffRL
30
1
0
12 Jun 2024
Time-Constrained Robust MDPs
Time-Constrained Robust MDPs
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OOD
29
0
0
12 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
38
0
0
06 Jun 2024
Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Pierre Clavier
E. L. Pennec
M. Geist
17
12
0
10 Feb 2023
Robust Reinforcement Learning on State Observations with Learned Optimal
  Adversary
Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
Huan Zhang
Hongge Chen
Duane S. Boning
Cho-Jui Hsieh
59
162
0
21 Jan 2021
1