Exploration-Exploitation in Constrained MDPs

4 March 2020

Papers citing "Exploration-Exploitation in Constrained MDPs"

10 / 110 papers shown

Title
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification D. Mankowitz D. A. Calian Rae Jeong Cosmin Paduraru N. Heess Sumanth Dathathri Martin Riedmiller Timothy A. Mann 21 11 0 20 Oct 2020
Balancing Constraints and Rewards with Meta-Gradient D4PG D. A. Calian D. Mankowitz Tom Zahavy Zhongwen Xu Junhyuk Oh Nir Levine Timothy A. Mann 23 25 0 13 Oct 2020
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints K. C. Kalagarla Rahul Jain Pierluigi Nuzzo 20 52 0 23 Sep 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs Aria HasanzadeZonuzy Archana Bura D. Kalathil S. Shakkottai 20 39 0 01 Aug 2020
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret Yingjie Fei Zhuoran Yang Yudong Chen Zhaoran Wang Qiaomin Xie 8 63 0 22 Jun 2020
Constrained episodic reinforcement learning in concave-convex and knapsack settings Kianté Brantley Miroslav Dudík Thodoris Lykouris Sobhan Miryoosefi Max Simchowitz Aleksandrs Slivkins Wen Sun OffRL 20 51 0 09 Jun 2020
Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints Qinbo Bai Vaneet Aggarwal Ather Gattami 6 7 0 11 Mar 2020
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss Shuang Qiu Xiaohan Wei Zhuoran Yang Jieping Ye Zhaoran Wang 14 47 0 02 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization Dongsheng Ding Xiaohan Wei Zhuoran Yang Zhaoran Wang M. Jovanović 12 159 0 01 Mar 2020
Learning in Markov Decision Processes under Constraints Rahul Singh Abhishek Gupta Ness B. Shroff 33 27 0 27 Feb 2020