ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.09377
  4. Cited By
Constrained Upper Confidence Reinforcement Learning

Constrained Upper Confidence Reinforcement Learning

Conference on Learning for Dynamics & Control (L4DC), 2020
26 January 2020
Liyuan Zheng
Lillian J. Ratliff
ArXiv (abs)PDFHTML

Papers citing "Constrained Upper Confidence Reinforcement Learning"

40 / 40 papers shown
Provably Efficient Sample Complexity for Robust CMDP
Provably Efficient Sample Complexity for Robust CMDP
Sourav Ganguly
Arnob Ghosh
164
0
0
10 Nov 2025
Exchange Policy Optimization Algorithm for Semi-Infinite Safe Reinforcement Learning
Exchange Policy Optimization Algorithm for Semi-Infinite Safe Reinforcement Learning
Jiaming Zhang
Yujie Yang
Haoning Wang
Liping Zhang
Shengbo Eben Li
174
0
0
06 Nov 2025
Beyond Slater's Condition in Online CMDPs with Stochastic and Adversarial Constraints
Beyond Slater's Condition in Online CMDPs with Stochastic and Adversarial Constraints
Francesco Emanuele Stradi
Eleonora Fidelia Chiefari
Matteo Castiglioni
A. Marchesi
Nicola Gatti
192
0
0
24 Sep 2025
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs
Yukuan Wei
Xudong Li
Lin F. Yang
189
0
0
20 Sep 2025
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
Sourav Ganguly
Arnob Ghosh
Kishan Panaganti
Adam Wierman
230
3
0
25 May 2025
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
Ensuring Safety in an Uncertain Environment: Constrained MDPs via Stochastic Thresholds
Qian Zuo
Fengxiang He
371
0
0
07 Apr 2025
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
ActSafe: Active Exploration with Safety Constraints for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
434
16
0
12 Oct 2024
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph FormInternational Conference on Learning Representations (ICLR), 2024
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
744
8
0
29 Aug 2024
A Primal-Dual Online Learning Approach for Dynamic Pricing of
  Sequentially Displayed Complementary Items under Sale Constraints
A Primal-Dual Online Learning Approach for Dynamic Pricing of Sequentially Displayed Complementary Items under Sale Constraints
Francesco Emanuele Stradi
Filippo Cipriani
Lorenzo Ciampiconi
Marco Leonardi
A. Rozza
Nicola Gatti
200
1
0
08 Jul 2024
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Tingting Ni
Maryam Kamgarpour
398
5
0
01 Dec 2023
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for
  Constrained MDPs
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2023
Dongsheng Ding
Chen-Yu Wei
Jianchao Tan
Alejandro Ribeiro
408
31
0
20 Jun 2023
Near-optimal Conservative Exploration in Reinforcement Learning under
  Episode-wise Constraints
Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise ConstraintsInternational Conference on Machine Learning (ICML), 2023
Donghao Li
Ruiquan Huang
Cong Shen
Jing Yang
314
4
0
09 Jun 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient
  Reinforcement Learning
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
209
1
0
29 Apr 2023
Long-Term Fairness with Unknown Dynamics
Long-Term Fairness with Unknown DynamicsNeural Information Processing Systems (NeurIPS), 2023
Tongxin Yin
Reilly P. Raab
M. Liu
Yang Liu
FaML
315
29
0
19 Apr 2023
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint
  Violation
Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
226
6
0
27 Jan 2023
Provable Reset-free Reinforcement Learning by No-Regret Reduction
Provable Reset-free Reinforcement Learning by No-Regret ReductionInternational Conference on Machine Learning (ICML), 2023
Hoai-An Nguyen
Ching-An Cheng
OffRL
380
3
0
06 Jan 2023
An Empirical Evaluation of Posterior Sampling for Constrained
  Reinforcement Learning
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
205
1
0
08 Sep 2022
Safe Exploration Incurs Nearly No Additional Sample Complexity for
  Reward-free RL
Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RLInternational Conference on Learning Representations (ICLR), 2022
Ruiquan Huang
J. Yang
Yingbin Liang
OffRL
327
9
0
28 Jun 2022
Provably Efficient Model-Free Constrained RL with Linear Function
  Approximation
Provably Efficient Model-Free Constrained RL with Linear Function ApproximationNeural Information Processing Systems (NeurIPS), 2022
A. Ghosh
Xingyu Zhou
Ness B. Shroff
429
34
0
23 Jun 2022
Near-Optimal Sample Complexity Bounds for Constrained MDPs
Near-Optimal Sample Complexity Bounds for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2022
Sharan Vaswani
Lin F. Yang
Csaba Szepesvári
317
45
0
13 Jun 2022
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
Dongsheng Ding
Jianchao Tan
Jiali Duan
Tamer Bacsar
Mihailo R. Jovanović
393
24
0
06 Jun 2022
Safe Reinforcement Learning for Legged Locomotion
Safe Reinforcement Learning for Legged LocomotionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Tsung-Yen Yang
Tingnan Zhang
Linda Luu
Sehoon Ha
Jie Tan
Wenhao Yu
281
53
0
05 Mar 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with ConstraintsInternational Conference on Machine Learning (ICML), 2022
Liyu Chen
R. Jain
Haipeng Luo
323
33
0
31 Jan 2022
Constraint Sampling Reinforcement Learning: Incorporating Expertise For
  Faster Learning
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster LearningAAAI Conference on Artificial Intelligence (AAAI), 2021
Tong Mu
Georgios Theocharous
David Arbour
Emma Brunskill
205
6
0
30 Dec 2021
Recent Advances in Reinforcement Learning in Finance
Recent Advances in Reinforcement Learning in Finance
B. Hambly
Renyuan Xu
Huining Yang
OffRL
608
264
0
08 Dec 2021
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe
  Reinforcement Learning
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning
Archana Bura
Aria HasanzadeZonuzy
D. Kalathil
S. Shakkottai
J. Chamberland
388
37
0
01 Dec 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
464
17
0
12 Sep 2021
Markov Decision Processes with Long-Term Average Constraints
Markov Decision Processes with Long-Term Average Constraints
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
220
7
0
12 Jun 2021
Safe Reinforcement Learning with Linear Function Approximation
Safe Reinforcement Learning with Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2021
Sanae Amani
Christos Thrampoulidis
Lin F. Yang
220
40
0
11 Jun 2021
Learning Policies with Zero or Bounded Constraint Violation for
  Constrained MDPs
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPsNeural Information Processing Systems (NeurIPS), 2021
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
437
99
0
04 Jun 2021
Safe Value Functions
Safe Value FunctionsIEEE Transactions on Automatic Control (IEEE TAC), 2021
P. Massiani
Steve Heim
Friedrich Solowjow
Sebastian Trimpe
382
18
0
25 May 2021
Online Selection of Diverse Committees
Online Selection of Diverse CommitteesInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Virginie Do
Jamal Atif
J. Lang
Nicolas Usunier
245
9
0
19 May 2021
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with
  Constraints
A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with ConstraintsAAAI Conference on Artificial Intelligence (AAAI), 2020
K. C. Kalagarla
Rahul Jain
Pierluigi Nuzzo
230
56
0
23 Sep 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement
  Learning for Constrained MDPs
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPsAAAI Conference on Artificial Intelligence (AAAI), 2020
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
551
46
0
01 Aug 2020
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff
  in Regret
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in RegretNeural Information Processing Systems (NeurIPS), 2020
Yingjie Fei
Zhuoran Yang
Yudong Chen
Zhaoran Wang
Qiaomin Xie
284
76
0
22 Jun 2020
Accelerating Safe Reinforcement Learning with Constraint-mismatched
  Policies
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies
Tsung-Yen Yang
Justinian P. Rosca
Karthik Narasimhan
Peter J. Ramadge
318
20
0
20 Jun 2020
Constrained episodic reinforcement learning in concave-convex and
  knapsack settings
Constrained episodic reinforcement learning in concave-convex and knapsack settings
Kianté Brantley
Miroslav Dudík
Thodoris Lykouris
Sobhan Miryoosefi
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
OffRL
224
56
0
09 Jun 2020
Exploration-Exploitation in Constrained MDPs
Exploration-Exploitation in Constrained MDPs
Yonathan Efroni
Shie Mannor
Matteo Pirotta
444
207
0
04 Mar 2020
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with
  Adversarial Loss
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial LossNeural Information Processing Systems (NeurIPS), 2020
Delin Qu
Xiaohan Wei
Zhuoran Yang
Jieping Ye
Zhaoran Wang
473
59
0
02 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Provably Efficient Safe Exploration via Primal-Dual Policy OptimizationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
416
185
0
01 Mar 2020
1
Page 1 of 1