Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2009.11348
Cited By
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints
AAAI Conference on Artificial Intelligence (AAAI), 2020
23 September 2020
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints"
32 / 32 papers shown
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Yukuan Wei
Xudong Li
Lin F. Yang
189
0
0
20 Sep 2025
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
305
1
0
17 Jan 2025
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
International Conference on Machine Learning (ICML), 2024
Juntao Dai
Yaodong Yang
Qian Zheng
Gang Pan
OffRL
339
3
0
15 Dec 2024
Capacity-Aware Planning and Scheduling in Budget-Constrained Multi-Agent MDPs: A Meta-RL Approach
IEEE Robotics and Automation Letters (RA-L), 2024
Manav Vora
Ilan Shomorony
Melkior Ornik
210
0
0
28 Oct 2024
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Bo Yue
Jian Li
Guiliang Liu
477
3
0
24 Sep 2024
Structured Reinforcement Learning for Media Streaming at the Wireless Edge
ACM Interational Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc), 2024
Archana Bura
Sarat Chandra Bobbili
Shreyas Rameshkumar
Desik Rengarajan
D. Kalathil
S. Shakkottai
331
2
0
10 Apr 2024
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints
Jean-Baptiste Bouvier
Kartik Nagpal
Negar Mehr
351
5
0
20 Mar 2024
Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources
Rohan Deb
Aadirupa Saha
247
0
0
28 Dec 2023
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shu-Fan Wang
Efstathia Soufleri
Jian Li
486
9
0
16 Dec 2023
Anytime-Constrained Reinforcement Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jeremy McMahan
Xiaojin Zhu
339
9
0
09 Nov 2023
Learning to Make Adherence-Aware Advice
International Conference on Learning Representations (ICLR), 2023
Guanting Chen
Xiaocheng Li
Chunlin Sun
Hanzhao Wang
279
15
0
01 Oct 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Neural Information Processing Systems (NeurIPS), 2023
Dongsheng Ding
Chen-Yu Wei
Jianchao Tan
Alejandro Ribeiro
401
31
0
20 Jun 2023
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints
International Conference on Machine Learning (ICML), 2023
Donghao Li
Ruiquan Huang
Cong Shen
Jing Yang
305
4
0
09 Jun 2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Conference on Learning for Dynamics & Control (L4DC), 2023
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
Mihailo R. Jovanović
OffRL
395
15
0
31 May 2023
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
International Conference on Learning Representations (ICLR), 2023
Kaixuan Ji
Qingyue Zhao
Jiafan He
Weitong Zhang
Q. Gu
324
5
0
15 May 2023
Long-Term Fairness with Unknown Dynamics
Neural Information Processing Systems (NeurIPS), 2023
Tongxin Yin
Reilly P. Raab
M. Liu
Yang Liu
FaML
315
29
0
19 Apr 2023
Provably Safe Reinforcement Learning with Step-wise Violation Constraints
Neural Information Processing Systems (NeurIPS), 2023
Nuoya Xiong
Yihan Du
Longbo Huang
508
13
0
13 Feb 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints
International Conference on Machine Learning (ICML), 2023
Ming Shi
Yitao Liang
Ness B. Shroff
217
17
0
08 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
225
6
0
27 Jan 2023
Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RL
International Conference on Learning Representations (ICLR), 2022
Ruiquan Huang
J. Yang
Yingbin Liang
OffRL
327
9
0
28 Jun 2022
Provably Efficient Model-Free Constrained RL with Linear Function Approximation
Neural Information Processing Systems (NeurIPS), 2022
A. Ghosh
Xingyu Zhou
Ness B. Shroff
429
34
0
23 Jun 2022
Near-Optimal Sample Complexity Bounds for Constrained MDPs
Neural Information Processing Systems (NeurIPS), 2022
Sharan Vaswani
Lin F. Yang
Csaba Szepesvári
317
45
0
13 Jun 2022
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
650
316
0
20 May 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
International Conference on Machine Learning (ICML), 2022
Liyu Chen
R. Jain
Haipeng Luo
322
33
0
31 Jan 2022
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
386
37
0
01 Dec 2021
Model-Free Reinforcement Learning for Optimal Control of MarkovDecision Processes Under Signal Temporal Logic Specifications
IEEE Conference on Decision and Control (CDC), 2021
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
191
15
0
27 Sep 2021
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Efstathia Soufleri
Jian Li
Rahul Singh
264
4
0
20 Sep 2021
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
505
68
0
13 Sep 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
458
17
0
12 Sep 2021
Markov Decision Processes with Long-Term Average Constraints
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
220
7
0
12 Jun 2021
Safe Reinforcement Learning with Linear Function Approximation
International Conference on Machine Learning (ICML), 2021
Sanae Amani
Christos Thrampoulidis
Lin F. Yang
219
40
0
11 Jun 2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes
Honghao Wei
Xin Liu
Lei Ying
298
26
0
03 Jun 2021
1
Page 1 of 1