ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.03413
  4. Cited By
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness
  to Model Misspecification
v1v2 (latest)

Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

Neural Information Processing Systems (NeurIPS), 2022
7 November 2022
Takumi Tanabe
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification"

10 / 10 papers shown
Adversarial Diffusion for Robust Reinforcement Learning
Adversarial Diffusion for Robust Reinforcement Learning
Daniele Foffano
Alessio Russo
Alexandre Proutiere
193
2
0
28 Sep 2025
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer
Yarden As
Chengrui Qu
Benjamin Unger
Dongho Kang
Max van der Hart
Laixi Shi
Stelian Coros
Adam Wierman
Andreas Krause
OffRL
389
2
0
23 Sep 2025
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation
Kosuke Nakanishi
Akihiro Kubo
Yuji Yasui
Shin Ishii
AAMLOffRL
277
0
0
20 Jun 2025
Maximum Total Correlation Reinforcement Learning
Maximum Total Correlation Reinforcement Learning
Bang You
Puze Liu
Huaping Liu
Jan Peters
Oleg Arenz
229
3
0
22 May 2025
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Shangding Gu
Laixi Shi
Muning Wen
Ming Jin
Eric Mazumdar
Yuejie Chi
Adam Wierman
C. Spanos
OODOffRL
469
13
0
27 Feb 2025
Solving robust MDPs as a sequence of static RL problems
Solving robust MDPs as a sequence of static RL problems
Adil Zouitine
Matthieu Geist
Emmanuel Rachelson
375
2
0
08 Oct 2024
RRLS : Robust Reinforcement Learning Suite
RRLS : Robust Reinforcement Learning Suite
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OffRL
313
3
0
12 Jun 2024
Time-Constrained Robust MDPs
Time-Constrained Robust MDPs
Adil Zouitine
David Bertoin
Pierre Clavier
Matthieu Geist
Emmanuel Rachelson
OOD
244
5
0
12 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
329
1
0
06 Jun 2024
Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Towards Minimax Optimality of Model-based Robust Reinforcement LearningConference on Uncertainty in Artificial Intelligence (UAI), 2023
Pierre Clavier
E. L. Pennec
Matthieu Geist
476
20
0
10 Feb 2023
1
Page 1 of 1