Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.00150
Cited By
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
31 January 2022
Liyu Chen
R. Jain
Haipeng Luo
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints"
19 / 19 papers shown
Title
Efficient Exploration in Average-Reward Constrained Reinforcement Learning: Achieving Near-Optimal Regret With Posterior Sampling
Danil Provodin
M. Kaptein
Mykola Pechenizkiy
41
0
0
29 May 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
51
6
0
16 Dec 2023
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
33
0
0
27 Sep 2023
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Zihan Zhou
Honghao Wei
Lei Ying
OffRL
40
1
0
27 Sep 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
28
9
0
05 Sep 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
Kaipeng Zhang
Alejandro Ribeiro
40
19
0
20 Jun 2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
Mihailo R. Jovanović
OffRL
32
11
0
31 May 2023
Online Resource Allocation in Episodic Markov Decision Processes
Duksang Lee
William Overman
Dabeen Lee
37
1
0
18 May 2023
Model-Free Robust Average-Reward Reinforcement Learning
Yue Wang
Alvaro Velasquez
George K. Atia
Ashley Prater-Bennette
Shaofeng Zou
32
9
0
17 May 2023
Graph Exploration for Effective Multi-agent Q-Learning
Ainur Zhaikhan
Ali H. Sayed
37
1
0
19 Apr 2023
Provably Efficient Model-Free Algorithms for Non-stationary CMDPs
Honghao Wei
A. Ghosh
Ness B. Shroff
Lei Ying
Xingyu Zhou
13
13
0
10 Mar 2023
Online Nonstochastic Control with Adversarial and Static Constraints
Xin Liu
Zixi Yang
Lei Ying
36
5
0
05 Feb 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
21
2
0
02 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
26
6
0
27 Jan 2023
Robust Average-Reward Markov Decision Processes
Yue Wang
Alvaro Velasquez
George K. Atia
Ashley Prater-Bennette
Shaofeng Zou
33
11
0
02 Jan 2023
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
27
1
0
08 Sep 2022
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
33
12
0
12 Sep 2021
Learning in Markov Decision Processes under Constraints
Rahul Singh
Abhishek Gupta
Ness B. Shroff
35
27
0
27 Feb 2020
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei
Mehdi Jafarnia-Jahromi
Haipeng Luo
Hiteshi Sharma
R. Jain
107
99
0
15 Oct 2019
1