ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.12083
  4. Cited By
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement
  Learning via Multi-Level Monte Carlo Actor-Critic

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

28 January 2023
Wesley A. Suttle
Amrit Singh Bedi
Bhrij Patel
Brian M. Sadler
Alec Koppel
Dinesh Manocha
ArXivPDFHTML

Papers citing "Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic"

10 / 10 papers shown
Title
Towards Global Optimality for Practical Average Reward Reinforcement
  Learning without Mixing Time Oracles
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
Bhrij Patel
Wesley A. Suttle
Alec Koppel
Vaneet Aggarwal
Brian M. Sadler
Amrit Singh Bedi
Dinesh Manocha
26
1
0
18 Mar 2024
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical
  Systems
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Wesley A. Suttle
Vipul K. Sharma
K. Kosaraju
S. Sivaranjani
Ji Liu
Vijay Gupta
Brian M. Sadler
22
1
0
06 Mar 2024
Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine
  Workers
Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine Workers
Ron Dorfman
Naseem Yehya
Kfir Y. Levy
17
1
0
05 Feb 2024
Two-Timescale Critic-Actor for Average Reward MDPs with Function
  Approximation
Two-Timescale Critic-Actor for Average Reward MDPs with Function Approximation
Prashansa Panda
Shalabh Bhatnagar
8
1
0
02 Feb 2024
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and
  Constrained Natural Actor-Critic Algorithms
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms
Prashansa Panda
Shalabh Bhatnagar
17
0
0
25 Oct 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon
  Average Reward Markov Decision Processes
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
11
9
0
05 Sep 2023
Adapting to Mixing Time in Stochastic Optimization with Markovian Data
Adapting to Mixing Time in Stochastic Optimization with Markovian Data
Ron Dorfman
Kfir Y. Levy
30
28
0
09 Feb 2022
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces
Amrit Singh Bedi
Souradip Chakraborty
Anjaly Parayil
Brian M. Sadler
Pratap Tokekar
Alec Koppel
36
17
0
28 Jan 2022
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
A Finite Time Analysis of Two Time-Scale Actor Critic Methods
Yue Wu
Weitong Zhang
Pan Xu
Quanquan Gu
88
145
0
04 May 2020
On the Sample Complexity of Actor-Critic Method for Reinforcement
  Learning with Function Approximation
On the Sample Complexity of Actor-Critic Method for Reinforcement Learning with Function Approximation
Harshat Kumar
Alec Koppel
Alejandro Ribeiro
99
78
0
18 Oct 2019
1