ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.11348
  4. Cited By
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with
  Constraints

A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints

AAAI Conference on Artificial Intelligence (AAAI), 2020
23 September 2020
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
ArXiv (abs)PDFHTML

Papers citing "A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints"

32 / 32 papers shown
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Yukuan Wei
Xudong Li
Lin F. Yang
192
0
0
20 Sep 2025
Solving Finite-Horizon MDPs via Low-Rank Tensors
Solving Finite-Horizon MDPs via Low-Rank Tensors
Sergio Rozada
Jose Luis Orejuela
Antonio G. Marques
310
1
0
17 Jan 2025
Safe Reinforcement Learning using Finite-Horizon Gradient-based
  Estimation
Safe Reinforcement Learning using Finite-Horizon Gradient-based EstimationInternational Conference on Machine Learning (ICML), 2024
Juntao Dai
Yaodong Yang
Qian Zheng
Gang Pan
OffRL
340
3
0
15 Dec 2024
Capacity-Aware Planning and Scheduling in Budget-Constrained Multi-Agent MDPs: A Meta-RL Approach
Capacity-Aware Planning and Scheduling in Budget-Constrained Multi-Agent MDPs: A Meta-RL ApproachIEEE Robotics and Automation Letters (RA-L), 2024
Manav Vora
Ilan Shomorony
Melkior Ornik
216
0
0
28 Oct 2024
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
Bo Yue
Jian Li
Guiliang Liu
479
3
0
24 Sep 2024
Structured Reinforcement Learning for Media Streaming at the Wireless
  Edge
Structured Reinforcement Learning for Media Streaming at the Wireless EdgeACM Interational Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc), 2024
Archana Bura
Sarat Chandra Bobbili
Shreyas Rameshkumar
Desik Rengarajan
D. Kalathil
S. Shakkottai
343
2
0
10 Apr 2024
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable
  Satisfaction of Hard Constraints
POLICEd RL: Learning Closed-Loop Robot Control Policies with Provable Satisfaction of Hard Constraints
Jean-Baptiste Bouvier
Kartik Nagpal
Negar Mehr
368
5
0
20 Mar 2024
Think Before You Duel: Understanding Complexities of Preference Learning
  under Constrained Resources
Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources
Rohan Deb
Aadirupa Saha
248
0
0
28 Dec 2023
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness ConstraintsAAAI Conference on Artificial Intelligence (AAAI), 2023
Shu-Fan Wang
Efstathia Soufleri
Jian Li
489
10
0
16 Dec 2023
Anytime-Constrained Reinforcement Learning
Anytime-Constrained Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Jeremy McMahan
Xiaojin Zhu
343
9
0
09 Nov 2023
Learning to Make Adherence-Aware Advice
Learning to Make Adherence-Aware AdviceInternational Conference on Learning Representations (ICLR), 2023
Guanting Chen
Xiaocheng Li
Chunlin Sun
Hanzhao Wang
279
15
0
01 Oct 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2023
Dongsheng Ding
Chen-Yu Wei
Jianchao Tan
Alejandro Ribeiro
417
31
0
20 Jun 2023
Near-optimal Conservative Exploration in Reinforcement Learning under
  Episode-wise Constraints
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise ConstraintsInternational Conference on Machine Learning (ICML), 2023
Donghao Li
Ruiquan Huang
Cong Shen
Jing Yang
315
4
0
09 Jun 2023
Provably Efficient Generalized Lagrangian Policy Optimization for Safe
  Multi-Agent Reinforcement Learning
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement LearningConference on Learning for Dynamics & Control (L4DC), 2023
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
Mihailo R. Jovanović
OffRL
397
15
0
31 May 2023
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPsInternational Conference on Learning Representations (ICLR), 2023
Kaixuan Ji
Qingyue Zhao
Jiafan He
Weitong Zhang
Q. Gu
324
5
0
15 May 2023
Long-Term Fairness with Unknown Dynamics
Long-Term Fairness with Unknown DynamicsNeural Information Processing Systems (NeurIPS), 2023
Tongxin Yin
Reilly P. Raab
M. Liu
Yang Liu
FaML
316
29
0
19 Apr 2023
Provably Safe Reinforcement Learning with Step-wise Violation
  Constraints
Provably Safe Reinforcement Learning with Step-wise Violation ConstraintsNeural Information Processing Systems (NeurIPS), 2023
Nuoya Xiong
Yihan Du
Longbo Huang
513
13
0
13 Feb 2023
A Near-Optimal Algorithm for Safe Reinforcement Learning Under
  Instantaneous Hard Constraints
A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard ConstraintsInternational Conference on Machine Learning (ICML), 2023
Ming Shi
Yitao Liang
Ness B. Shroff
221
17
0
08 Feb 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint
  Violation
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
243
6
0
27 Jan 2023
Safe Exploration Incurs Nearly No Additional Sample Complexity for
  Reward-free RL
Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RLInternational Conference on Learning Representations (ICLR), 2022
Ruiquan Huang
J. Yang
Yingbin Liang
OffRL
339
9
0
28 Jun 2022
Provably Efficient Model-Free Constrained RL with Linear Function
  Approximation
Provably Efficient Model-Free Constrained RL with Linear Function ApproximationNeural Information Processing Systems (NeurIPS), 2022
A. Ghosh
Xingyu Zhou
Ness B. Shroff
440
34
0
23 Jun 2022
Near-Optimal Sample Complexity Bounds for Constrained MDPs
Near-Optimal Sample Complexity Bounds for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2022
Sharan Vaswani
Lin F. Yang
Csaba Szepesvári
319
45
0
13 Jun 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRLAI4TS
675
318
0
20 May 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with ConstraintsInternational Conference on Machine Learning (ICML), 2022
Liyu Chen
R. Jain
Haipeng Luo
325
33
0
31 Jan 2022
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe
  Reinforcement Learning
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
396
38
0
01 Dec 2021
Model-Free Reinforcement Learning for Optimal Control of MarkovDecision
  Processes Under Signal Temporal Logic Specifications
Model-Free Reinforcement Learning for Optimal Control of MarkovDecision Processes Under Signal Temporal Logic SpecificationsIEEE Conference on Decision and Control (CDC), 2021
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
193
15
0
27 Sep 2021
Reinforcement Learning for Finite-Horizon Restless Multi-Armed
  Multi-Action Bandits
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Efstathia Soufleri
Jian Li
Rahul Singh
274
4
0
20 Sep 2021
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
514
68
0
13 Sep 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
471
17
0
12 Sep 2021
Markov Decision Processes with Long-Term Average Constraints
Markov Decision Processes with Long-Term Average Constraints
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
222
7
0
12 Jun 2021
Safe Reinforcement Learning with Linear Function Approximation
Safe Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2021
Sanae Amani
Christos Thrampoulidis
Lin F. Yang
225
42
0
11 Jun 2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov
  Decision Processes
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes
Honghao Wei
Xin Liu
Lei Ying
298
26
0
03 Jun 2021
1
Page 1 of 1