A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes

3 June 2021

Papers citing "A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes"

14 / 14 papers shown

Title
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form Toshinori Kitamura Tadashi Kozuno Wataru Kumagai Kenta Hoshino Y. Hosoe Kazumi Kasaura Masashi Hamaya Paavo Parmas Yutaka Matsuo 72 0 0 29 Aug 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees Toshinori Kitamura Tadashi Kozuno Masahiro Kato Yuki Ichihara Soichiro Nishimori Akiyoshi Sannai Sho Sonoda Wataru Kumagai Yutaka Matsuo 42 2 0 31 Jan 2024
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs Zihan Zhou Honghao Wei Lei Ying OffRL 40 1 0 27 Sep 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints Ming Shi Yitao Liang Ness B. Shroff 37 8 0 08 Feb 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction Hoai-An Nguyen Ching-An Cheng OffRL 20 2 0 06 Jan 2023
Safety-Constrained Policy Transfer with Successor Features Zeyu Feng Bowen Zhang Jianxin Bi Harold Soh 6 4 0 10 Nov 2022
Provably Efficient Model-Free Constrained RL with Linear Function Approximation A. Ghosh Xingyu Zhou Ness B. Shroff 64 23 0 23 Jun 2022
Near-Optimal Sample Complexity Bounds for Constrained MDPs Sharan Vaswani Lin F. Yang Csaba Szepesvári 32 32 0 13 Jun 2022
A Review of Safe Reinforcement Learning: Methods, Theory and Applications Shangding Gu Longyu Yang Yali Du Guang Chen Florian Walter Jun Wang Alois C. Knoll OffRL AI4TS 115 237 0 20 May 2022
On Kernelized Multi-Armed Bandits with Constraints Xingyu Zhou Bo Ji 13 29 0 29 Mar 2022
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach Qinbo Bai Amrit Singh Bedi Mridul Agarwal Alec Koppel Vaneet Aggarwal 107 56 0 13 Sep 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations Mridul Agarwal Qinbo Bai Vaneet Aggarwal 33 12 0 12 Sep 2021
Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation Daniel Vial Advait Parulekar Sanjay Shakkottai R. Srikant 37 15 0 04 May 2021
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes Chen-Yu Wei Mehdi Jafarnia-Jahromi Haipeng Luo Hiteshi Sharma R. Jain 107 99 0 15 Oct 2019