Average Reward Adjusted Discounted Reinforcement Learning: Near-Blackwell-Optimal Policies for Real-World Applications

2 April 2020

Papers citing "Average Reward Adjusted Discounted Reinforcement Learning: Near-Blackwell-Optimal Policies for Real-World Applications"

5 / 5 papers shown

Title
Efficient Computation of Blackwell Optimal Policies using Rational Functions Dibyangshu Mukherjee Shivaram Kalyanakrishnan OffRL 20 1 0 25 Aug 2025
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount FactorNeural Information Processing Systems (NeurIPS), 2023 Julien Grand-Clément Marko Petrik 118 19 0 31 Jan 2023
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods Xin Guo Anran Hu Junzi Zhang OffRL 125 10 0 13 Sep 2021
A nearly Blackwell-optimal policy gradient method Vektor Dewanto M. Gallagher OffRL 80 1 0 28 May 2021
Average-reward model-free reinforcement learning: a systematic review and literature mapping Vektor Dewanto George Dunn A. Eshragh M. Gallagher Fred Roosta 168 36 0 18 Oct 2020