Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.00311
Cited By
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs
1 August 2020
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs"
27 / 27 papers shown
Title
Constrained Online Decision-Making: A Unified Framework
Haichen Hu
David Simchi-Levi
Navid Azizan
34
0
0
11 May 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning
Jeremy McMahan
121
0
0
11 Feb 2025
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
72
0
0
29 Aug 2024
Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time
Jeremy McMahan
31
2
0
23 May 2024
Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
Sihan Zeng
Thinh T. Doan
Justin Romberg
32
0
0
03 May 2024
Structured Reinforcement Learning for Media Streaming at the Wireless Edge
Archana Bura
Sarat Chandra Bobbili
Shreyas Rameshkumar
Desik Rengarajan
D. Kalathil
S. Shakkottai
26
0
0
10 Apr 2024
What Are the Odds? Improving the foundations of Statistical Model Checking
Tobias Meggendorfer
Maximilian Weininger
Patrick Wienhoft
39
4
0
08 Apr 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
42
2
0
31 Jan 2024
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms
Prashansa Panda
Shalabh Bhatnagar
33
0
0
25 Oct 2023
Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows
Xiaoshan Lin
Abbasali Koochakzadeh
Yasin Yazıcıoğlu
Derya Aksaray
18
1
0
29 Jul 2023
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
Runyu Zhang
Yang Hu
Na Li
38
5
0
20 Jun 2023
ROSARL: Reward-Only Safe Reinforcement Learning
Geraud Nangue Tasse
Tamlin Love
Mark W. Nemecek
Steven D. James
Benjamin Rosman
21
3
0
31 May 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints
Ming Shi
Yitao Liang
Ness B. Shroff
35
8
0
08 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
26
6
0
27 Jan 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction
Hoai-An Nguyen
Ching-An Cheng
OffRL
18
2
0
06 Jan 2023
Provable Safe Reinforcement Learning with Binary Feedback
Andrew Bennett
Dipendra Kumar Misra
Nathan Kallus
OffRL
33
4
0
26 Oct 2022
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments
Yixuan Wang
S. Zhan
Ruochen Jiao
Zhilu Wang
Wanxin Jin
Zhuoran Yang
Zhaoran Wang
Chao Huang
Qi Zhu
26
48
0
29 Sep 2022
Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents
Nathaniel P. Hamilton
Kyle Dunlap
Taylor T. Johnson
Kerianne L. Hobbs
OffRL
19
8
0
08 Jul 2022
Reinforcement Learning with a Terminator
Guy Tennenholtz
Nadav Merlis
Lior Shani
Shie Mannor
Uri Shalit
Gal Chechik
Assaf Hallak
Gal Dalal
9
5
0
30 May 2022
Finding Safe Zones of policies Markov Decision Processes
Lee Cohen
Yishay Mansour
Michal Moshkovitz
19
1
0
23 Feb 2022
Reinforcement Learning with Almost Sure Constraints
Agustin Castellano
Hancheng Min
J. Bazerque
Enrique Mallada
13
15
0
09 Dec 2021
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
22
28
0
01 Dec 2021
RLOps: Development Life-cycle of Reinforcement Learning Aided Open RAN
Peizheng Li
Jonathan D. Thomas
Xiaoyang Wang
Ahmed Khalil
A. Ahmad
...
S. Kapoor
Arjun Parekh
A. Doufexi
Arman Shojaeifard
Robert Piechocki
AI4TS
14
37
0
12 Nov 2021
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Guojun Xiong
Jian Li
Rahul Singh
19
4
0
20 Sep 2021
Learning to Act Safely with Limited Exposure and Almost Sure Certainty
Agustin Castellano
Hancheng Min
J. Bazerque
Enrique Mallada
13
4
0
18 May 2021
A Meta Reinforcement Learning-based Approach for Self-Adaptive System
Mingyue Zhang
Jialong Li
Haiyan Zhao
Kenji Tei
S. Honiden
Zhi Jin
17
4
0
11 May 2021
Stochastic Linear Bandits with Protected Subspace
Advait Parulekar
Soumya Basu
Aditya Gopalan
Karthikeyan Shanmugam
Sanjay Shakkottai
71
2
0
02 Nov 2020
1