ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.04354
  4. Cited By

A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret

8 June 2020
Mehdi Jafarnia-Jahromi
Chen-Yu Wei
Rahul Jain
Haipeng Luo
ArXivPDFHTML

Papers citing "A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret"

3 / 3 papers shown
Title
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free
  Reinforcement Learning
Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
OffRL
45
50
0
09 Oct 2021
Online Reinforcement Learning of Optimal Threshold Policies for Markov
  Decision Processes
Online Reinforcement Learning of Optimal Threshold Policies for Markov Decision Processes
Arghyadip Roy
Vivek Borkar
A. Karandikar
P. Chaporkar
OffRL
14
20
0
21 Dec 2019
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1