ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.05800
  4. Cited By
Stochastic first-order methods for average-reward Markov decision
  processes

Stochastic first-order methods for average-reward Markov decision processes

11 May 2022
Tianjiao Li
Feiyang Wu
Guanghui Lan
ArXivPDFHTML

Papers citing "Stochastic first-order methods for average-reward Markov decision processes"

12 / 12 papers shown
Title
Towards Optimal Offline Reinforcement Learning
Towards Optimal Offline Reinforcement Learning
Mengmeng Li
Daniel Kuhn
Tobias Sutter
OffRL
53
0
0
15 Mar 2025
Finding good policies in average-reward Markov Decision Processes
  without prior knowledge
Finding good policies in average-reward Markov Decision Processes without prior knowledge
Adrienne Tuynman
Rémy Degenne
Emilie Kaufmann
23
2
0
27 May 2024
Provable Policy Gradient Methods for Average-Reward Markov Potential
  Games
Provable Policy Gradient Methods for Average-Reward Markov Potential Games
Min Cheng
Ruida Zhou
P. R. Kumar
Chao Tian
49
2
0
09 Mar 2024
Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations
  via Inverse Reinforcement Learning
Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations via Inverse Reinforcement Learning
Chao Liu
Zhaoyuan Gu
Hanran Wu
Deniz Irem Erus
Ye Zhao
34
6
0
28 Sep 2023
Accelerated stochastic approximation with state-dependent noise
Accelerated stochastic approximation with state-dependent noise
Sasila Ilandarideva
A. Juditsky
Guanghui Lan
Tianjiao Li
17
8
0
04 Jul 2023
Sharper Model-free Reinforcement Learning for Average-reward Markov
  Decision Processes
Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes
Zihan Zhang
Qiaomin Xie
OffRL
21
16
0
28 Jun 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and
  Reinforcement Learning
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
17
5
0
15 Jun 2023
Inverse Reinforcement Learning with the Average Reward Criterion
Inverse Reinforcement Learning with the Average Reward Criterion
Feiyang Wu
Jingyang Ke
Anqi Wu
21
9
0
24 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George K. Atia
Ashley Prater-Bennette
Shaofeng Zou
16
9
0
17 May 2023
Policy Mirror Descent Inherently Explores Action Space
Policy Mirror Descent Inherently Explores Action Space
Yan Li
Guanghui Lan
OffRL
51
8
0
08 Mar 2023
Policy Mirror Descent for Reinforcement Learning: Linear Convergence,
  New Sampling Complexity, and Generalized Problem Classes
Policy Mirror Descent for Reinforcement Learning: Linear Convergence, New Sampling Complexity, and Generalized Problem Classes
Guanghui Lan
87
136
0
30 Jan 2021
Off-Policy Actor-Critic
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
158
221
0
22 May 2012
1