Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP

1 December 2022

Papers citing "Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP"

3 / 3 papers shown

Title
Stochastic Halpern iteration in normed spaces and applications to reinforcement learning Mario Bravo Juan Pablo Contreras 38 3 0 19 Mar 2024
Span-Based Optimal Sample Complexity for Average Reward MDPs M. Zurek Yudong Chen 26 6 0 22 Nov 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor Julien Grand-Clément Marko Petrik 22 12 0 31 Jan 2023