Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.11074
Cited By
Reward Constrained Policy Optimization
28 May 2018
Chen Tessler
D. Mankowitz
Shie Mannor
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Reward Constrained Policy Optimization"
50 / 128 papers shown
Title
Robust Planning for Autonomous Driving via Mixed Adversarial Diffusion Predictions
Albert Zhao
Stefano Soatto
DiffM
24
0
0
18 May 2025
Skill-based Safe Reinforcement Learning with Risk Planning
Hanping Zhang
Yuhong Guo
OffRL
OnRL
50
0
0
02 May 2025
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback
Siow Meng Low
Akshat Kumar
53
0
0
17 Apr 2025
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints
Max Buckley
Konstantinos Papathanasiou
Andreas Spanopoulos
57
0
0
09 Mar 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
58
0
0
30 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
96
0
0
31 Dec 2024
A learning-based approach to stochastic optimal control under reach-avoid constraint
Tingting Ni
Maryam Kamgarpour
85
0
0
21 Dec 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
35
1
0
12 Oct 2024
GreenLight-Gym: Reinforcement learning benchmark environment for control of greenhouse production systems
Bart van Laatum
Eldert J. van Henten
Sjoerd Boersma
OffRL
74
0
0
06 Oct 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
Ruohong Liu
Yuxin Pan
Linjie Xu
Lei Song
Jiang Bian
Pengcheng You
Yize Chen
48
1
0
03 Oct 2024
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
74
1
0
29 Aug 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia Herbert
45
6
0
12 Jul 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
69
0
0
06 Jul 2024
Learning Autonomous Race Driving with Action Mapping Reinforcement Learning
Yuanda Wang
Xin Yuan
Changyin Sun
47
1
0
21 Jun 2024
CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving
Jonathan Booher
Khashayar Rohanimanesh
Junhong Xu
Vladislav Isenbaev
Ashwin Balakrishna
Ishan Gupta
Wei Liu
Aleksandr Petiushko
OffRL
39
7
0
13 Jun 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
49
1
0
06 Jun 2024
Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
OffRL
OnRL
41
0
0
28 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
42
1
0
26 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
45
3
0
20 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
22
6
0
07 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
48
4
0
02 May 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
43
0
0
25 Apr 2024
Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints
Jiping Luo
Nikolaos Pappas
27
7
0
25 Mar 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
44
2
0
31 Jan 2024
Agile But Safe: Learning Collision-Free High-Speed Legged Locomotion
Tairan He
Chong Zhang
Wenli Xiao
Guanqi He
Changliu Liu
Guanya Shi
47
59
0
31 Jan 2024
A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants
Yixuan Sun
Sami Khairy
Richard B. Vilim
Rui Hu
Akshay J. Dave
29
2
0
23 Jan 2024
HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning
Hao Wang
Bo Tang
Chi Harold Liu
Shangqin Mao
Jiahong Zhou
Zipeng Dai
Yaqi Sun
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
41
3
0
29 Dec 2023
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
37
7
0
27 Dec 2023
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
26
0
0
24 Dec 2023
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
22
19
0
01 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
29
19
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
46
3
0
01 Dec 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
50
48
0
06 Oct 2023
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
39
0
0
27 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
36
0
0
21 Sep 2023
On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
Ophir M. Carmel
Guy Katz
40
0
0
06 Sep 2023
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
Generalizable Resource Scaling of 5G Slices using Constrained Reinforcement Learning
Muhammad Sulaiman
Mahdieh Ahmadi
M. A. Salahuddin
R. Boutaba
A. Saleh
45
6
0
15 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
41
11
0
01 Jun 2023
C-MCTS: Safe Planning with Monte Carlo Tree Search
Dinesh Parthasarathy
G. Kontes
Axel Plinge
Christopher Mutschler
42
3
0
25 May 2023
Constrained Reinforcement Learning for Dynamic Material Handling
Chengpeng Hu
Ziming Wang
Jialin Liu
J. Wen
Bifei Mao
Xinghu Yao
24
0
0
23 May 2023
Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Desong Du
Shao-Fu Han
Naiming Qi
Haitham Bou-Ammar
Jun Wang
Wei Pan
42
15
0
16 May 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
21
1
0
29 Apr 2023
Evolving Constrained Reinforcement Learning Policy
Chengpeng Hu
Jiyuan Pei
Jialin Liu
Xinghu Yao
18
1
0
19 Apr 2023
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark
Alexander Pan
Chan Jun Shern
Andy Zou
Nathaniel Li
Steven Basart
Thomas Woodside
Jonathan Ng
Hanlin Zhang
Scott Emmons
Dan Hendrycks
37
127
0
06 Apr 2023
When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Zichen Zhang
Luca Weihs
OffRL
29
5
0
30 Mar 2023
Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization
E. Derman
Yevgeniy Men
M. Geist
Shie Mannor
45
1
0
12 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning
Nick Bührer
Zhejun Zhang
Alexander Liniger
Feng Yu
Luc Van Gool
29
1
0
07 Mar 2023
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Qingpeng Cai
Zhenghai Xue
Chi Zhang
Wanqi Xue
Shuchang Liu
...
Tianyou Zuo
Wentao Xie
Dong Zheng
Peng Jiang
Kun Gai
OffRL
CML
27
44
0
03 Feb 2023
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints
Akhil Agnihotri
R. Jain
Haipeng Luo
29
2
0
02 Feb 2023
1
2
3
Next