Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.04354
Cited By
A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret
8 June 2020
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret"
3 / 3 papers shown
Title
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
OffRL
45
50
0
09 Oct 2021
Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Arghyadip Roy
Vivek Borkar
A. Karandikar
P. Chaporkar
OffRL
14
20
0
21 Dec 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1