ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.12645
  4. Cited By
Towards Safe Policy Improvement for Non-Stationary MDPs

Towards Safe Policy Improvement for Non-Stationary MDPs

23 October 2020
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
    OffRL
ArXivPDFHTML

Papers citing "Towards Safe Policy Improvement for Non-Stationary MDPs"

5 / 5 papers shown
Title
Pausing Policy Learning in Non-stationary Reinforcement Learning
Pausing Policy Learning in Non-stationary Reinforcement Learning
Hyunin Lee
Ming Jin
Javad Lavaei
Somayeh Sojoudi
OffRL
18
2
0
25 May 2024
Demystifying Reinforcement Learning in Time-Varying Systems
Demystifying Reinforcement Learning in Time-Varying Systems
Pouya Hamadanian
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
17
1
0
14 Jan 2022
Is the Rush to Machine Learning Jeopardizing Safety? Results of a Survey
Is the Rush to Machine Learning Jeopardizing Safety? Results of a Survey
M. Askarpour
Alan Wassyng
M. Lawford
R. Paige
Z. Diskin
12
0
0
29 Nov 2021
Coarse-Grained Smoothness for RL in Metric Spaces
Coarse-Grained Smoothness for RL in Metric Spaces
Giorgio Giannone
Kavosh Asadi
Cameron Allen
Sam Lobel
G. Konidaris
Michael Littman
17
3
0
23 Oct 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
21
52
0
26 Apr 2021
1