
Average Reward Adjusted Discounted Reinforcement Learning:
Near-Blackwell-Optimal Policies for Real-World Applications
Papers citing "Average Reward Adjusted Discounted Reinforcement Learning: Near-Blackwell-Optimal Policies for Real-World Applications"
5 / 5 papers shown
Title |
|---|
![]() Reducing Blackwell and Average Optimality to Discounted MDPs via the
Blackwell Discount FactorNeural Information Processing Systems (NeurIPS), 2023 |





