ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.10528
  4. Cited By
Constrained Policy Optimization

Constrained Policy Optimization

30 May 2017
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
ArXivPDFHTML

Papers citing "Constrained Policy Optimization"

50 / 304 papers shown
Title
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Yifan Lin
Yuhao Wang
Enlu Zhou
78
0
0
01 Mar 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Performance Improvement Bounds for Lipschitz Configurable Markov
  Decision Processes
Performance Improvement Bounds for Lipschitz Configurable Markov Decision Processes
Alberto Maria Metelli
23
0
0
21 Feb 2024
Leveraging Approximate Model-based Shielding for Probabilistic Safety
  Guarantees in Continuous Environments
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments
Alexander W. Goodall
Francesco Belardinelli
OffRL
33
1
0
01 Feb 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with
  Uniform PAC Guarantees
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
44
2
0
31 Jan 2024
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs
Michael Gimelfarb
Ayal Taitler
Scott Sanner
31
0
0
20 Jan 2024
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and
  Efficient Autonomous Driving
HAIM-DRL: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
Zilin Huang
Zihao Sheng
Chengyuan Ma
Sikai Chen
22
29
0
06 Jan 2024
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation
  Allocation Approach for Recommender Systems
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
37
7
0
27 Dec 2023
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
41
3
0
27 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
28
0
0
24 Dec 2023
Multi-Objective Reinforcement Learning-based Approach for Pressurized
  Water Reactor Optimization
Multi-Objective Reinforcement Learning-based Approach for Pressurized Water Reactor Optimization
Paul Seurin
K. Shirvan
24
10
0
15 Dec 2023
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement
  Learning
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
22
19
0
01 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region
  Conditional Value at Risk
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
29
19
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
46
3
0
01 Dec 2023
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement
  Learning with Sub-optimal Demonstrations
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations
Lu Li
Yuxin Pan
Ruobing Chen
Jie Liu
Zilin Wang
Yu Liu
Zhiheng Li
50
0
0
13 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
50
48
0
06 Oct 2023
Provably Efficient Exploration in Constrained Reinforcement
  Learning:Posterior Sampling Is All You Need
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
39
0
0
27 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
36
0
0
21 Sep 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
Probabilistic Constrained Reinforcement Learning with Formal
  Interpretability
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
Yanran Wang
Qiuchen Qian
David E. Boyle
21
4
0
13 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Generalizable Resource Scaling of 5G Slices using Constrained
  Reinforcement Learning
Generalizable Resource Scaling of 5G Slices using Constrained Reinforcement Learning
Muhammad Sulaiman
Mahdieh Ahmadi
M. A. Salahuddin
R. Boutaba
A. Saleh
45
6
0
15 Jun 2023
Identifiability and Generalizability in Constrained Inverse
  Reinforcement Learning
Identifiability and Generalizability in Constrained Inverse Reinforcement Learning
Andreas Schlaginhaufen
Maryam Kamgarpour
29
10
0
01 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
41
11
0
01 Jun 2023
Learning for Edge-Weighted Online Bipartite Matching with Robustness
  Guarantees
Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees
Pengfei Li
Jianyi Yang
Shaolei Ren
OffRL
27
4
0
31 May 2023
On the Value of Myopic Behavior in Policy Reuse
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
41
1
0
28 May 2023
Constrained Proximal Policy Optimization
Constrained Proximal Policy Optimization
Chengbin Xuan
Feng Zhang
Faliang Yin
H. Lam
26
0
0
23 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
Reinforcement Learning for Safe Robot Control using Control Lyapunov
  Barrier Functions
Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Desong Du
Shao-Fu Han
Naiming Qi
Haitham Bou-Ammar
Jun Wang
Wei Pan
42
15
0
16 May 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient
  Reinforcement Learning
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
21
1
0
29 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
27
2
0
21 Apr 2023
Evolving Constrained Reinforcement Learning Policy
Evolving Constrained Reinforcement Learning Policy
Chengpeng Hu
Jiyuan Pei
Jialin Liu
Xinghu Yao
18
1
0
19 Apr 2023
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards
  and Ethical Behavior in the MACHIAVELLI Benchmark
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
Alexander Pan
Chan Jun Shern
Andy Zou
Nathaniel Li
Steven Basart
Thomas Woodside
Jonathan Ng
Hanlin Zhang
Scott Emmons
Dan Hendrycks
37
127
0
06 Apr 2023
When Learning Is Out of Reach, Reset: Generalization in Autonomous
  Visuomotor Reinforcement Learning
When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Zichen Zhang
Luca Weihs
OffRL
29
5
0
30 Mar 2023
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic
  Environments
Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments
Hongyi Chen
Changliu Liu
OffRL
27
14
0
24 Mar 2023
Motion Planning for Autonomous Driving: The State of the Art and Future
  Perspectives
Motion Planning for Autonomous Driving: The State of the Art and Future Perspectives
Siyu Teng
Xuemin Hu
Peng Deng
Bai Li
Yuchen Li
...
Yunfeng Ai
Lingxi Li
Zhe Xuanyuan
F. Zhu
Long Chen
45
336
0
17 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement
  Learning
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning
Nick Bührer
Zhejun Zhang
Alexander Liniger
Feng Yu
Luc Van Gool
29
1
0
07 Mar 2023
Constrained Reinforcement Learning and Formal Verification for Safe
  Colonoscopy Navigation
Constrained Reinforcement Learning and Formal Verification for Safe Colonoscopy Navigation
Davide Corsi
Luca Marzari
Ameya Pore
Alessandro Farinelli
A. Casals
Paolo Fiorini
Diego DallÁlba
27
9
0
06 Mar 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
Guarded Policy Optimization with Imperfect Online Demonstrations
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
53
10
0
03 Mar 2023
Model-based Constrained MDP for Budget Allocation in Sequential
  Incentive Marketing
Model-based Constrained MDP for Budget Allocation in Sequential Incentive Marketing
Shuai Xiao
Le Guo
Zaifan Jiang
Lei Lv
Yuanbo Chen
Jun Zhu
Shuang Yang
30
21
0
02 Mar 2023
Efficient Exploration Using Extra Safety Budget in Constrained Policy
  Optimization
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization
Haotian Xu
Shengjie Wang
Zhaolei Wang
Yunzhe Zhang
Qing Zhuo
Yang Gao
Tao Zhang
18
0
0
28 Feb 2023
A Human-Centered Safe Robot Reinforcement Learning Framework with
  Interactive Behaviors
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors
Shangding Gu
Alap Kshirsagar
Yali Du
Guang Chen
Jan Peters
Alois C. Knoll
39
14
0
25 Feb 2023
Behavior Proximal Policy Optimization
Behavior Proximal Policy Optimization
Zifeng Zhuang
Kun Lei
Jinxin Liu
Donglin Wang
Yilang Guo
OffRL
32
34
0
22 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
55
0
0
04 Feb 2023
Efficient Gradient Approximation Method for Constrained Bilevel
  Optimization
Efficient Gradient Approximation Method for Constrained Bilevel Optimization
Siyuan Xu
Minghui Zhu
36
20
0
03 Feb 2023
Distributional constrained reinforcement learning for supply chain
  optimization
Distributional constrained reinforcement learning for supply chain optimization
J. Berm\údez
Antonio del Rio-Chanona
Calvin Tsay
26
5
0
03 Feb 2023
Imitating careful experts to avoid catastrophic events
Imitating careful experts to avoid catastrophic events
J.R.P. Hanslope
Laurence Aitchison
OffRL
38
0
0
02 Feb 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
29
2
0
02 Feb 2023
Optimal Transport Perturbations for Safe Reinforcement Learning with
  Robustness Guarantees
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees
James Queeney
E. C. Ozcan
I. Paschalidis
Christos G. Cassandras
OOD
OffRL
36
5
0
31 Jan 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
43
5
0
30 Jan 2023
Previous
1234567
Next