ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.05733
  4. Cited By
Provably Efficient Model-Free Algorithms for Non-stationary CMDPs

Provably Efficient Model-Free Algorithms for Non-stationary CMDPs

10 March 2023
Honghao Wei
A. Ghosh
Ness B. Shroff
Lei Ying
Xingyu Zhou
ArXivPDFHTML

Papers citing "Provably Efficient Model-Free Algorithms for Non-stationary CMDPs"

14 / 14 papers shown
Title
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
Qian Zuo
Fengxiang He
26
0
0
07 Apr 2025
Optimal Strong Regret and Violation in Constrained MDPs via Policy
  Optimization
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
Francesco Emanuele Stradi
Matteo Castiglioni
A. Marchesi
Nicola Gatti
18
1
0
03 Oct 2024
Safe Reinforcement Learning for Constrained Markov Decision Processes
  with Stochastic Stopping Time
Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time
Abhijit Mazumdar
Rafał Wisniewski
Manuela L. Bujorianu
20
3
0
23 Mar 2024
Learning Adversarial MDPs with Stochastic Hard Constraints
Learning Adversarial MDPs with Stochastic Hard Constraints
Francesco Emanuele Stradi
Matteo Castiglioni
A. Marchesi
Nicola Gatti
26
4
0
06 Mar 2024
Safe Reinforcement Learning with Instantaneous Constraints: The Role of
  Aggressive Exploration
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration
Honghao Wei
Xin Liu
Lei Ying
32
1
0
22 Dec 2023
Constraint-Conditioned Policy Optimization for Versatile Safe
  Reinforcement Learning
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
Yi-Fan Yao
Zuxin Liu
Zhepeng Cen
Jiacheng Zhu
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
28
12
0
05 Oct 2023
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs
Zihan Zhou
Honghao Wei
Lei Ying
OffRL
40
1
0
27 Sep 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Dongsheng Ding
Chen-Yu Wei
Kaipeng Zhang
Alejandro Ribeiro
43
19
0
20 Jun 2023
Online Resource Allocation in Episodic Markov Decision Processes
Online Resource Allocation in Episodic Markov Decision Processes
Duksang Lee
William Overman
Dabeen Lee
37
1
0
18 May 2023
Provably Efficient Model-Free Constrained RL with Linear Function
  Approximation
Provably Efficient Model-Free Constrained RL with Linear Function Approximation
A. Ghosh
Xingyu Zhou
Ness B. Shroff
64
23
0
23 Jun 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
57
25
0
31 Jan 2022
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
122
166
0
06 Jan 2021
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Efficient Learning in Non-Stationary Linear Markov Decision Processes
Ahmed Touati
Pascal Vincent
37
29
0
24 Oct 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
165
1,632
0
02 Feb 2020
1