Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.07708
Cited By
A Lyapunov-based Approach to Safe Reinforcement Learning
20 May 2018
Yinlam Chow
Ofir Nachum
Edgar A. Duénez-Guzmán
Mohammad Ghavamzadeh
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Lyapunov-based Approach to Safe Reinforcement Learning"
50 / 117 papers shown
Title
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
17
0
0
16 May 2025
Neural Lyapunov Function Approximation with Self-Supervised Reinforcement Learning
Luc McCutcheon
Bahman Gharesifard
Saber Fallah
53
0
0
19 Mar 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
49
0
0
23 Feb 2025
Polynomial-Time Approximability of Constrained Reinforcement Learning
Jeremy McMahan
210
0
0
11 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
56
5
0
29 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
96
0
0
31 Dec 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
183
1
0
08 Oct 2024
q-exponential family for policy optimization
Lingwei Zhu
Haseeb Shah
Han Wang
Yukie Nagai
Martha White
OffRL
78
0
0
14 Aug 2024
E
2
C
F
D
\mathrm{E^{2}CFD}
E
2
CFD
: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model
Zepeng Wang
Chao Ma
Linjiang Zhou
Libing Wu
Lei Yang
Xiaochuan Shi
Guojun Peng
OffRL
48
0
0
08 Jul 2024
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model
Zhehua Zhou
Xuan Xie
Jiayang Song
Zhan Shu
Lei Ma
49
1
0
06 Jun 2024
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
42
1
0
26 May 2024
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
42
2
0
23 May 2024
Constrained Reinforcement Learning Under Model Mismatch
Zhongchang Sun
Sihong He
Fei Miao
Shaofeng Zou
48
4
0
02 May 2024
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
63
9
0
02 May 2024
Myopically Verifiable Probabilistic Certificates for Safe Control and Learning
Zhuoyuan Wang
Haoming Jing
Christian Kurniawan
Albert Chern
Yorie Nakahira
41
1
0
23 Apr 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
26
0
0
24 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
29
19
0
01 Dec 2023
A safe exploration approach to constrained Markov decision processes
Tingting Ni
Maryam Kamgarpour
46
3
0
01 Dec 2023
Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning
F. D. Lellis
M. Coraggio
G. Russo
Mirco Musolesi
Mario di Bernardo
OffRL
35
4
0
16 Nov 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
50
48
0
06 Oct 2023
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need
Danil Provodin
Pratik Gajane
Mykola Pechenizkiy
M. Kaptein
39
0
0
27 Sep 2023
Shielded Reinforcement Learning for Hybrid Systems
Asger Horn Brorholt
P. G. Jensen
Kim G. Larsen
Florian Lorber
Christian Schilling
23
4
0
28 Aug 2023
Safe & Accurate at Speed with Tendons: A Robot Arm for Exploring Dynamic Motion
Simon Guist
Jan Schneider
Hao Ma
Tianyu Cui
V. Berenz
...
Felix Gruninger
M. Muhlebach
J. Fiene
Bernhard Schölkopf
Le Chen
52
4
0
05 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Identifiability and Generalizability in Constrained Inverse Reinforcement Learning
Andreas Schlaginhaufen
Maryam Kamgarpour
29
10
0
01 Jun 2023
Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Desong Du
Shao-Fu Han
Naiming Qi
Haitham Bou-Ammar
Jun Wang
Wei Pan
42
15
0
16 May 2023
Neural Operators of Backstepping Controller and Observer Gain Functions for Reaction-Diffusion PDEs
Miroslav Krstic
Luke Bhan
Yuanyuan Shi
54
28
0
18 Mar 2023
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization
Haotian Xu
Shengjie Wang
Zhaolei Wang
Yunzhe Zhang
Qing Zhuo
Yang Gao
Tao Zhang
18
0
0
28 Feb 2023
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors
Shangding Gu
Alap Kshirsagar
Yali Du
Guang Chen
Jan Peters
Alois C. Knoll
39
14
0
25 Feb 2023
Active Uncertainty Reduction for Safe and Efficient Interaction Planning: A Shielding-Aware Dual Control Approach
Haimin Hu
David Isele
S. Bae
J. F. Fisac
35
17
0
01 Feb 2023
A Policy Optimization Method Towards Optimal-time Stability
Shengjie Wang
Lan Fengb
Xiang Zheng
Yu-wen Cao
Oluwatosin Oseni
Haotian Xu
Tao Zhang
Yang Gao
39
1
0
02 Jan 2023
Don't do it: Safer Reinforcement Learning With Rule-based Guidance
Ekaterina Nikonova
Cheng Xue
Jochen Renz
32
0
0
28 Dec 2022
Online Shielding for Reinforcement Learning
Bettina Könighofer
Julian Rudolf
Alexander Palmisano
Martin Tappler
Roderick Bloem
OffRL
14
21
0
04 Dec 2022
Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability
Whiyoung Jung
Myungsik Cho
Jongeui Park
Young-Jin Sung
38
4
0
28 Nov 2022
A Transfer Learning Approach for UAV Path Design with Connectivity Outage Constraint
G. Fontanesi
Anding Zhu
M. Arvaneh
Hamed Ahmadi
19
16
0
07 Nov 2022
Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems
Eric Yang Yu
Zhizhen Qin
Min Kyung Lee
Sicun Gao
OffRL
37
15
0
22 Oct 2022
Robotic Table Wiping via Reinforcement Learning and Whole-body Trajectory Optimization
T. Lew
Sumeet Singh
M. Prats
Jeffrey Bingham
Jonathan Weisz
...
Fei Xia
Peng Xu
Tingnan Zhang
Jie Tan
Montserrat Gonzalez
35
15
0
19 Oct 2022
Near-Optimal Multi-Agent Learning for Safe Coverage Control
Manish Prajapat
M. Turchetta
Melanie Zeilinger
Andreas Krause
35
14
0
12 Oct 2022
Learning Control Policies for Stochastic Systems with Reach-avoid Guarantees
Dorde Zikelic
Mathias Lechner
T. Henzinger
K. Chatterjee
26
22
0
11 Oct 2022
Neurosymbolic Motion and Task Planning for Linear Temporal Logic Tasks
Xiaowu Sun
Yasser Shoukry
50
11
0
11 Oct 2022
Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning
Zih-Yun Chiu
Yi-Lin Tuan
William Yang Wang
Michael C. Yip
OffRL
32
3
0
07 Oct 2022
Guiding Safe Exploration with Weakest Preconditions
Greg Anderson
Swarat Chaudhuri
Işıl Dillig
46
6
0
28 Sep 2022
Constrained Update Projection Approach to Safe Policy Optimization
Long Yang
Jiaming Ji
Juntao Dai
Linrui Zhang
Binbin Zhou
Pengfei Li
Yaodong Yang
Gang Pan
41
43
0
15 Sep 2022
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
117
241
0
20 May 2022
Bridging Model-based Safety and Model-free Reinforcement Learning through System Identification of Low Dimensional Linear Models
Zhongyu Li
Jun Zeng
A. Thirugnanam
Koushil Sreenath
29
16
0
11 May 2022
Safe Reinforcement Learning Using Black-Box Reachability Analysis
Mahmoud Selim
Amr Alanwar
Shreyas Kousik
Grace Gao
Marco Pavone
Karl H. Johansson
29
32
0
15 Apr 2022
Safe Reinforcement Learning for Legged Locomotion
Tsung-Yen Yang
Tingnan Zhang
Linda Luu
Sehoon Ha
Jie Tan
Wenhao Yu
29
40
0
05 Mar 2022
Safe Control with Learned Certificates: A Survey of Neural Lyapunov, Barrier, and Contraction methods
Charles Dawson
Sicun Gao
Chuchu Fan
46
232
0
23 Feb 2022
Accelerating Primal-dual Methods for Regularized Markov Decision Processes
Haoya Li
Hsiang-Fu Yu
Lexing Ying
Inderjit Dhillon
34
4
0
21 Feb 2022
Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake
Shahaf S. Shperberg
Bo Liu
Peter Stone
34
7
0
19 Feb 2022
1
2
3
Next