Average Reward Adjusted Discounted Reinforcement Learning:
  Near-Blackwell-Optimal Policies for Real-World Applications

Average Reward Adjusted Discounted Reinforcement Learning: Near-Blackwell-Optimal Policies for Real-World Applications

Papers citing "Average Reward Adjusted Discounted Reinforcement Learning: Near-Blackwell-Optimal Policies for Real-World Applications"