Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.00603
Cited By
Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP
1 December 2022
Jinghan Wang
Meng-Xian Wang
Lin F. Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Near Sample-Optimal Reduction-based Policy Learning for Average Reward MDP"
3 / 3 papers shown
Title
Stochastic Halpern iteration in normed spaces and applications to reinforcement learning
Mario Bravo
Juan Pablo Contreras
38
3
0
19 Mar 2024
Span-Based Optimal Sample Complexity for Average Reward MDPs
M. Zurek
Yudong Chen
26
6
0
22 Nov 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
22
12
0
31 Jan 2023
1