A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints

AAAI Conference on Artificial Intelligence (AAAI), 2020

23 September 2020

Papers citing "A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints"

32 / 32 papers shown

Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs

Yukuan Wei

Xudong Li

Lin F. Yang

192

20 Sep 2025

Solving Finite-Horizon MDPs via Low-Rank Tensors

Sergio Rozada

Jose Luis Orejuela

Antonio G. Marques

310

17 Jan 2025

Safe Reinforcement Learning using Finite-Horizon Gradient-based EstimationInternational Conference on Machine Learning (ICML), 2024

340

15 Dec 2024

Capacity-Aware Planning and Scheduling in Budget-Constrained Multi-Agent MDPs: A Meta-RL ApproachIEEE Robotics and Automation Letters (RA-L), 2024

Manav Vora

Ilan Shomorony

Melkior Ornik

216

28 Oct 2024

Provably Efficient Exploration in Inverse Constrained Reinforcement Learning

Bo Yue

Jian Li

Guiliang Liu

479

24 Sep 2024

Structured Reinforcement Learning for Media Streaming at the Wireless EdgeACM Interational Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc), 2024

Archana Bura

Sarat Chandra Bobbili

343

10 Apr 2024

POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints

Jean-Baptiste Bouvier

Kartik Nagpal

Negar Mehr

368

20 Mar 2024

Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources

Rohan Deb

Aadirupa Saha

248

28 Dec 2023

Online Restless Multi-Armed Bandits with Long-Term Fairness ConstraintsAAAI Conference on Artificial Intelligence (AAAI), 2023

Shu-Fan Wang

Efstathia Soufleri

Jian Li

489

16 Dec 2023

Anytime-Constrained Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Jeremy McMahan

Xiaojin Zhu

343

09 Nov 2023

Learning to Make Adherence-Aware AdviceInternational Conference on Learning Representations (ICLR), 2023

Guanting Chen

Xiaocheng Li

Chunlin Sun

Hanzhao Wang

279

01 Oct 2023

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2023

Dongsheng Ding

Chen-Yu Wei

Jianchao Tan

Alejandro Ribeiro

417

20 Jun 2023

Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise ConstraintsInternational Conference on Machine Learning (ICML), 2023

315

09 Jun 2023

Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement LearningConference on Learning for Dynamics & Control (L4DC), 2023

397

31 May 2023

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPsInternational Conference on Learning Representations (ICLR), 2023

324

15 May 2023

Long-Term Fairness with Unknown DynamicsNeural Information Processing Systems (NeurIPS), 2023

Yang Liu

316

19 Apr 2023

Provably Safe Reinforcement Learning with Step-wise Violation ConstraintsNeural Information Processing Systems (NeurIPS), 2023

Nuoya Xiong

Yihan Du

Longbo Huang

513

13 Feb 2023

A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard ConstraintsInternational Conference on Machine Learning (ICML), 2023

Ming Shi

Yitao Liang

Ness B. Shroff

221

08 Feb 2023

Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation

K. C. Kalagarla

Rahul Jain

Pierluigi Nuzzo

243

27 Jan 2023

Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RLInternational Conference on Learning Representations (ICLR), 2022

339

28 Jun 2022

Provably Efficient Model-Free Constrained RL with Linear Function ApproximationNeural Information Processing Systems (NeurIPS), 2022

A. Ghosh

Xingyu Zhou

Ness B. Shroff

440

23 Jun 2022

Near-Optimal Sample Complexity Bounds for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2022

Sharan Vaswani

Lin F. Yang

Csaba Szepesvári

319

13 Jun 2022

A Review of Safe Reinforcement Learning: Methods, Theory and Applications

Guang Chen

Jun Wang

675

318

20 May 2022

Learning Infinite-Horizon Average-Reward Markov Decision Processes with ConstraintsInternational Conference on Machine Learning (ICML), 2022

Liyu Chen

R. Jain

Haipeng Luo

325

31 Jan 2022

DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning

396

01 Dec 2021

Model-Free Reinforcement Learning for Optimal Control of MarkovDecision Processes Under Signal Temporal Logic SpecificationsIEEE Conference on Decision and Control (CDC), 2021

K. C. Kalagarla

Rahul Jain

Pierluigi Nuzzo

193

27 Sep 2021

Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits

Efstathia Soufleri

Jian Li

Rahul Singh

274

20 Sep 2021

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

514

13 Sep 2021

Concave Utility Reinforcement Learning with Zero-Constraint Violations

Mridul Agarwal

Qinbo Bai

Vaneet Aggarwal

471

12 Sep 2021

Markov Decision Processes with Long-Term Average Constraints

Mridul Agarwal

Qinbo Bai

Vaneet Aggarwal

222

12 Jun 2021

Safe Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2021

Sanae Amani

Christos Thrampoulidis

Lin F. Yang

225

11 Jun 2021

A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes

Honghao Wei

Xin Liu

Lei Ying

298

03 Jun 2021