ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.00099
  4. Cited By
Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety
  Constraints in Finite MDPs

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs

31 May 2021
Harsh Satija
Philip S. Thomas
Joelle Pineau
Romain Laroche
    OffRL
ArXivPDFHTML

Papers citing "Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs"

4 / 4 papers shown
Title
Direction-oriented Multi-objective Learning: Simple and Provable
  Stochastic Algorithms
Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms
Peiyao Xiao
Hao Ban
Kaiyi Ji
44
19
0
28 May 2023
Safe Policy Improvement for POMDPs via Finite-State Controllers
Safe Policy Improvement for POMDPs via Finite-State Controllers
T. D. Simão
Marnix Suilen
N. Jansen
OffRL
37
9
0
12 Jan 2023
Offline Policy Optimization with Eligible Actions
Offline Policy Optimization with Eligible Actions
Yao Liu
Yannis Flet-Berliac
Emma Brunskill
OffRL
25
5
0
01 Jul 2022
Non-Markovian policies occupancy measures
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
39
1
0
27 May 2022
1