ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11074
  4. Cited By
Reward Constrained Policy Optimization

Reward Constrained Policy Optimization

28 May 2018
Chen Tessler
D. Mankowitz
Shie Mannor
ArXivPDFHTML

Papers citing "Reward Constrained Policy Optimization"

50 / 128 papers shown
Title
Optimal Transport Perturbations for Safe Reinforcement Learning with
  Robustness Guarantees
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees
James Queeney
E. C. Ozcan
I. Paschalidis
Christos G. Cassandras
OOD
OffRL
36
5
0
31 Jan 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe
  Reinforcement Learning
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
43
5
0
30 Jan 2023
Offline Policy Optimization in RL with Variance Regularizaton
Offline Policy Optimization in RL with Variance Regularizaton
Riashat Islam
Samarth Sinha
Homanga Bharadhwaj
Samin Yeasar Arnob
Zhuoran Yang
Animesh Garg
Zhaoran Wang
Lihong Li
Doina Precup
OffRL
28
0
0
29 Dec 2022
Safe Reinforcement Learning using Data-Driven Predictive Control
Safe Reinforcement Learning using Data-Driven Predictive Control
Mahmoud Selim
Amr Alanwar
M. El-Kharashi
Hazem Abbas
Karl H. Johansson
OffRL
30
3
0
20 Nov 2022
Policy Optimization with Advantage Regularization for Long-Term Fairness
  in Decision Systems
Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems
Eric Yang Yu
Zhizhen Qin
Min Kyung Lee
Sicun Gao
OffRL
37
15
0
22 Oct 2022
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
Soumyajit Guin
S. Bhatnagar
31
8
0
10 Oct 2022
Policy Gradients for Probabilistic Constrained Reinforcement Learning
Policy Gradients for Probabilistic Constrained Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
29
6
0
02 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo Li
Ding Zhao
79
45
0
16 Sep 2022
Constrained Update Projection Approach to Safe Policy Optimization
Constrained Update Projection Approach to Safe Policy Optimization
Long Yang
Jiaming Ji
Juntao Dai
Linrui Zhang
Binbin Zhou
Pengfei Li
Yaodong Yang
Gang Pan
41
43
0
15 Sep 2022
A Risk-Sensitive Approach to Policy Optimization
A Risk-Sensitive Approach to Policy Optimization
Jared Markowitz
Ryan W. Gardner
Ashley J. Llorens
R. Arora
I-J. Wang
OffRL
36
6
0
19 Aug 2022
Reward Design For An Online Reinforcement Learning Algorithm Supporting
  Oral Self-Care
Reward Design For An Online Reinforcement Learning Algorithm Supporting Oral Self-Care
Anna L. Trella
Kelly W. Zhang
Inbal Nahum-Shani
Vivek Shetty
Finale Doshi-Velez
Susan Murphy
OnRL
24
19
0
15 Aug 2022
Learning to Solve Soft-Constrained Vehicle Routing Problems with
  Lagrangian Relaxation
Learning to Solve Soft-Constrained Vehicle Routing Problems with Lagrangian Relaxation
Qiaoyue Tang
Yangzhe Kong
Lemeng Pan
Choon-woo Lee
35
3
0
20 Jul 2022
Near-Optimal Sample Complexity Bounds for Constrained MDPs
Near-Optimal Sample Complexity Bounds for Constrained MDPs
Sharan Vaswani
Lin F. Yang
Csaba Szepesvári
35
32
0
13 Jun 2022
On the Robustness of Safe Reinforcement Learning under Observational
  Perturbations
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo Li
Ding Zhao
OOD
OffRL
48
36
0
29 May 2022
Constrained Reinforcement Learning for Short Video Recommendation
Constrained Reinforcement Learning for Short Video Recommendation
Qingpeng Cai
Ruohan Zhan
Chi Zhang
Jie Zheng
Guangwei Ding
Pinghua Gong
Dong Zheng
Peng Jiang
33
6
0
26 May 2022
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
Linrui Zhang
Li Shen
Long Yang
Shi-Yong Chen
Bo Yuan
Xueqian Wang
Dacheng Tao
18
62
0
24 May 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
117
241
0
20 May 2022
Reachability Constrained Reinforcement Learning
Reachability Constrained Reinforcement Learning
Dongjie Yu
Haitong Ma
Sheng Li
Jianyu Chen
63
55
0
16 May 2022
Aligning to Social Norms and Values in Interactive Narratives
Aligning to Social Norms and Values in Interactive Narratives
Prithviraj Ammanabrolu
Liwei Jiang
Maarten Sap
Hannaneh Hajishirzi
Yejin Choi
AI4CE
30
47
0
04 May 2022
Road Traffic Law Adaptive Decision-making for Self-Driving Vehicles
Road Traffic Law Adaptive Decision-making for Self-Driving Vehicles
Jiaxin Liu
Wenhui Zhou
Hong Wang
Zhong Cao
Wen-Hui Yu
Cheng-Yu Zhao
Ding Zhao
Diange Yang
Jun Li
30
23
0
25 Apr 2022
Safe Reinforcement Learning Using Black-Box Reachability Analysis
Safe Reinforcement Learning Using Black-Box Reachability Analysis
Mahmoud Selim
Amr Alanwar
Shreyas Kousik
Grace Gao
Marco Pavone
Karl H. Johansson
29
32
0
15 Apr 2022
How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for
  Efficient and Safe Driving Strategies
How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies
Lukas M. Schmidt
Sebastian Rietsch
Axel Plinge
Bjoern M. Eskofier
Christopher Mutschler
OffRL
35
5
0
16 Mar 2022
Safe Reinforcement Learning for Legged Locomotion
Safe Reinforcement Learning for Legged Locomotion
Tsung-Yen Yang
Tingnan Zhang
Linda Luu
Sehoon Ha
Jie Tan
Wenhao Yu
29
40
0
05 Mar 2022
Pareto Frontier Approximation Network (PA-Net) to Solve Bi-objective TSP
Pareto Frontier Approximation Network (PA-Net) to Solve Bi-objective TSP
Ishaan Mehta
Sharareh Taghipour
Sajad Saeedi
30
4
0
02 Mar 2022
A Globally Convergent Evolutionary Strategy for Stochastic Constrained
  Optimization with Applications to Reinforcement Learning
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning
Youssef Diouane
Aurelien Lucchi
Vihang Patil
29
3
0
21 Feb 2022
MuZero with Self-competition for Rate Control in VP9 Video Compression
MuZero with Self-competition for Rate Control in VP9 Video Compression
Amol Mandhane
A. Zhernov
Maribeth Rauh
Chenjie Gu
Miaosen Wang
...
Jackson Broshear
Julian Schrittwieser
Thomas Hubert
Oriol Vinyals
Timothy A. Mann
37
44
0
14 Feb 2022
Learning Infinite-Horizon Average-Reward Markov Decision Processes with
  Constraints
Learning Infinite-Horizon Average-Reward Markov Decision Processes with Constraints
Liyu Chen
R. Jain
Haipeng Luo
64
25
0
31 Jan 2022
Towards Safe Reinforcement Learning with a Safety Editor Policy
Towards Safe Reinforcement Learning with a Safety Editor Policy
Haonan Yu
Wei Xu
Haichao Zhang
OffRL
69
31
0
28 Jan 2022
Conservative Distributional Reinforcement Learning with Safety
  Constraints
Conservative Distributional Reinforcement Learning with Safety Constraints
Hengrui Zhang
Youfang Lin
Sheng Han
Shuo Wang
Kai Lv
OffRL
26
5
0
18 Jan 2022
SABLAS: Learning Safe Control for Black-box Dynamical Systems
SABLAS: Learning Safe Control for Black-box Dynamical Systems
Zengyi Qin
Dawei Sun
Chuchu Fan
28
43
0
06 Jan 2022
Safe Reinforcement Learning with Chance-constrained Model Predictive
  Control
Safe Reinforcement Learning with Chance-constrained Model Predictive Control
Samuel Pfrommer
Tanmay Gautam
Alec Zhou
Somayeh Sojoudi
21
24
0
27 Dec 2021
Model-Based Safe Reinforcement Learning with Time-Varying State and
  Control Constraints: An Application to Intelligent Vehicles
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
Xinglong Zhang
Yaoqian Peng
Biao Luo
Wei Pan
Xin Xu
Haibin Xie
27
11
0
18 Dec 2021
Towards Disturbance-Free Visual Mobile Manipulation
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni
Kiana Ehsani
Luca Weihs
Jordi Salvador
28
9
0
17 Dec 2021
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement
  Learning
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Yecheng Jason Ma
Andrew Shen
Osbert Bastani
Dinesh Jayaraman
18
25
0
14 Dec 2021
CLARA: A Constrained Reinforcement Learning Based Resource Allocation
  Framework for Network Slicing
CLARA: A Constrained Reinforcement Learning Based Resource Allocation Framework for Network Slicing
Yongshuai Liu
J. Ding
Zhi-Li Zhang
Xin Liu
25
19
0
16 Nov 2021
Safe Policy Optimization with Local Generalized Linear Function
  Approximations
Safe Policy Optimization with Local Generalized Linear Function Approximations
Akifumi Wachi
Yunyue Wei
Yanan Sui
OffRL
35
10
0
09 Nov 2021
Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic
  Algorithm for Constrained Markov Decision Processes
Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
Sihan Zeng
Thinh T. Doan
Justin Romberg
102
17
0
21 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to
  Multiagent Domain
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
41
93
0
14 Sep 2021
Achieving Zero Constraint Violation for Constrained Reinforcement
  Learning via Primal-Dual Approach
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai
Amrit Singh Bedi
Mridul Agarwal
Alec Koppel
Vaneet Aggarwal
107
56
0
13 Sep 2021
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Concave Utility Reinforcement Learning with Zero-Constraint Violations
Mridul Agarwal
Qinbo Bai
Vaneet Aggarwal
38
12
0
12 Sep 2021
Controllable Summarization with Constrained Markov Decision Process
Controllable Summarization with Constrained Markov Decision Process
Hou Pong Chan
Lu Wang
Irwin King
207
21
0
07 Aug 2021
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Haoran Xu
Xianyuan Zhan
Xiangyu Zhu
OffRL
16
86
0
19 Jul 2021
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Sungryull Sohn
Sungtae Lee
Jongwook Choi
H. V. Seijen
Mehdi Fatemi
Honglak Lee
173
3
0
13 Jul 2021
A Simple Reward-free Approach to Constrained Reinforcement Learning
A Simple Reward-free Approach to Constrained Reinforcement Learning
Sobhan Miryoosefi
Chi Jin
16
29
0
12 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of
  Sparse Reward Iterative Tasks
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
29
11
0
10 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real
  world: aligning domain-agnostic and domain-specific research
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
27
6
0
07 Jul 2021
Safe Reinforcement Learning Using Advantage-Based Intervention
Safe Reinforcement Learning Using Advantage-Based Intervention
Nolan Wagener
Byron Boots
Ching-An Cheng
34
52
0
16 Jun 2021
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Yiming Zhang
Keith Ross
OffRL
41
41
0
14 Jun 2021
Learning Policies with Zero or Bounded Constraint Violation for
  Constrained MDPs
Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs
Tao-Wen Liu
Ruida Zhou
D. Kalathil
P. R. Kumar
Chao Tian
42
78
0
04 Jun 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units
  Using Offline Reinforcement Learning
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
42
68
0
23 Feb 2021
Previous
123
Next