ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.02189
  4. Cited By
Exploration-Exploitation in Constrained MDPs

Exploration-Exploitation in Constrained MDPs

4 March 2020
Yonathan Efroni
Shie Mannor
Matteo Pirotta
ArXivPDFHTML

Papers citing "Exploration-Exploitation in Constrained MDPs"

10 / 110 papers shown
Title
Robust Constrained Reinforcement Learning for Continuous Control with
  Model Misspecification
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
21
11
0
20 Oct 2020
Balancing Constraints and Rewards with Meta-Gradient D4PG
Balancing Constraints and Rewards with Meta-Gradient D4PG
D. A. Calian
D. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
23
25
0
13 Oct 2020
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with
  Constraints
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
20
52
0
23 Sep 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement
  Learning for Constrained MDPs
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
20
39
0
01 Aug 2020
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff
  in Regret
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
Qiaomin Xie
8
63
0
22 Jun 2020
Constrained episodic reinforcement learning in concave-convex and
  knapsack settings
Constrained episodic reinforcement learning in concave-convex and knapsack settings
Kianté Brantley
Miroslav Dudík
Thodoris Lykouris
Sobhan Miryoosefi
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
OffRL
20
51
0
09 Jun 2020
Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints
Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints
Qinbo Bai
Vaneet Aggarwal
Ather Gattami
6
7
0
11 Mar 2020
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with
  Adversarial Loss
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
Shuang Qiu
Xiaohan Wei
Zhuoran Yang
Jieping Ye
Zhaoran Wang
14
47
0
02 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
12
159
0
01 Mar 2020
Learning in Markov Decision Processes under Constraints
Learning in Markov Decision Processes under Constraints
Rahul Singh
Abhishek Gupta
Ness B. Shroff
33
27
0
27 Feb 2020
Previous
123