ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.08920
  4. Cited By
Average-reward model-free reinforcement learning: a systematic review
  and literature mapping

Average-reward model-free reinforcement learning: a systematic review and literature mapping

18 October 2020
Vektor Dewanto
George Dunn
A. Eshragh
M. Gallagher
Fred Roosta
ArXivPDFHTML

Papers citing "Average-reward model-free reinforcement learning: a systematic review and literature mapping"

9 / 9 papers shown
Title
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated
  Double Pendulum Tasks
Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks
Jean Seong Bjorn Choe
Bumkyu Choi
Jong-kook Kim
40
2
0
13 Sep 2024
NeoRL: Efficient Exploration for Nonepisodic RL
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
33
0
0
03 Jun 2024
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Full Gradient Deep Reinforcement Learning for Average-Reward Criterion
Tejas Pagare
Vivek Borkar
Konstantin Avrachenkov
29
4
0
07 Apr 2023
Reducing Blackwell and Average Optimality to Discounted MDPs via the
  Blackwell Discount Factor
Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor
Julien Grand-Clément
Marko Petrik
33
13
0
31 Jan 2023
Average-Reward Reinforcement Learning with Trust Region Methods
Average-Reward Reinforcement Learning with Trust Region Methods
Xiaoteng Ma
Xiao-Jing Tang
Li Xia
Jun Yang
Qianchuan Zhao
21
16
0
07 Jun 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
340
1,960
0
04 May 2020
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
90
146
0
04 May 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward
  Markov Decision Processes
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average
  Reward
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward
S. Murphy
Yanzhen Deng
Eric B. Laber
H. Maei
R. Sutton
K. Witkiewitz
OffRL
33
22
0
18 Jul 2016
1