ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.14603
  4. Cited By
Learning to be Safe: Deep RL with a Safety Critic

Learning to be Safe: Deep RL with a Safety Critic

27 October 2020
K. Srinivasan
Benjamin Eysenbach
Sehoon Ha
Jie Tan
Chelsea Finn
    OffRL
ArXivPDFHTML

Papers citing "Learning to be Safe: Deep RL with a Safety Critic"

42 / 42 papers shown
Title
Barrier Function Overrides For Non-Convex Fixed Wing Flight Control and Self-Driving Cars
Barrier Function Overrides For Non-Convex Fixed Wing Flight Control and Self-Driving Cars
Eric Squires
Phillip Odom
Z. Kira
31
0
0
08 May 2025
A Domain-Agnostic Scalable AI Safety Ensuring Framework
A Domain-Agnostic Scalable AI Safety Ensuring Framework
Beomjun Kim
Kangyeon Kim
Sunwoo Kim
Heejin Ahn
57
0
0
29 Apr 2025
Cooptimizing Safety and Performance with a Control-Constrained
  Formulation
Cooptimizing Safety and Performance with a Control-Constrained Formulation
Hao Wang
Adityaya Dhande
Somil Bansal
28
1
0
10 Sep 2024
Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding
Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
OffRL
OnRL
39
0
0
28 May 2024
Counterexample-Guided Repair of Reinforcement Learning Systems Using
  Safety Critics
Counterexample-Guided Repair of Reinforcement Learning Systems Using Safety Critics
David Boetius
Stefan Leue
28
0
0
24 May 2024
Preparing for Black Swans: The Antifragility Imperative for Machine
  Learning
Preparing for Black Swans: The Antifragility Imperative for Machine Learning
Ming Jin
38
2
0
18 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer
  Crashes
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
17
6
0
07 May 2024
Myopically Verifiable Probabilistic Certificates for Safe Control and
  Learning
Myopically Verifiable Probabilistic Certificates for Safe Control and Learning
Zhuoyuan Wang
Haoming Jing
Christian Kurniawan
Albert Chern
Yorie Nakahira
39
1
0
23 Apr 2024
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement
  Learning
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
19
19
0
01 Dec 2023
Learning to Recover for Safe Reinforcement Learning
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
34
0
0
21 Sep 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe Exploration
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRL
OnRL
34
5
0
26 Jul 2023
C-MCTS: Safe Planning with Monte Carlo Tree Search
C-MCTS: Safe Planning with Monte Carlo Tree Search
Dinesh Parthasarathy
G. Kontes
Axel Plinge
Christopher Mutschler
40
3
0
25 May 2023
Reinforcement Learning for Safe Robot Control using Control Lyapunov
  Barrier Functions
Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Desong Du
Shao-Fu Han
Naiming Qi
Haitham Bou-Ammar
Jun Wang
Wei Pan
34
15
0
16 May 2023
When Learning Is Out of Reach, Reset: Generalization in Autonomous
  Visuomotor Reinforcement Learning
When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Zichen Zhang
Luca Weihs
OffRL
24
5
0
30 Mar 2023
A Multiplicative Value Function for Safe and Efficient Reinforcement
  Learning
A Multiplicative Value Function for Safe and Efficient Reinforcement Learning
Nick Bührer
Zhejun Zhang
Alexander Liniger
Feng Yu
Luc Van Gool
24
1
0
07 Mar 2023
Efficient Exploration Using Extra Safety Budget in Constrained Policy
  Optimization
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization
Haotian Xu
Shengjie Wang
Zhaolei Wang
Yunzhe Zhang
Qing Zhuo
Yang Gao
Tao Zhang
18
0
0
28 Feb 2023
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Failure-aware Policy Learning for Self-assessable Robotics Tasks
Kechun Xu
Runjian Chen
Shuqing Zhao
Zizhang Li
Hongxiang Yu
Ci Chen
Yue Wang
R. Xiong
20
1
0
25 Feb 2023
Optimal Transport Perturbations for Safe Reinforcement Learning with
  Robustness Guarantees
Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees
James Queeney
E. C. Ozcan
I. Paschalidis
Christos G. Cassandras
OOD
OffRL
31
5
0
31 Jan 2023
Safe Reinforcement Learning for an Energy-Efficient Driver Assistance
  System
Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System
Habtamu Hailemichael
B. Ayalew
Lindsey Kerbel
Andrej Ivanco
K. Loiselle
19
4
0
03 Jan 2023
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep
  Guidance
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance
Kelvin Xu
Zheyuan Hu
Ria Doshi
Aaron Rovinsky
Vikash Kumar
Abhishek Gupta
Sergey Levine
32
19
0
19 Dec 2022
Evaluating Model-free Reinforcement Learning toward Safety-critical
  Tasks
Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks
Linrui Zhang
Q. Zhang
Li Shen
Bo Yuan
Xueqian Wang
Dacheng Tao
OffRL
40
26
0
12 Dec 2022
Prediction-aware and Reinforcement Learning based Altruistic Cooperative
  Driving
Prediction-aware and Reinforcement Learning based Altruistic Cooperative Driving
Rodolfo Valiente
Mahdi Razzaghpour
Behrad Toghi
Ghayoor Shah
Y. P. Fallah
28
14
0
19 Nov 2022
Characterising the Robustness of Reinforcement Learning for Continuous
  Control using Disturbance Injection
Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection
Catherine R. Glossop
Jacopo Panerati
A. Krishnan
Zhaocong Yuan
Angela P. Schoellig
22
6
0
27 Oct 2022
VIMA: General Robot Manipulation with Multimodal Prompts
VIMA: General Robot Manipulation with Multimodal Prompts
Yunfan Jiang
Agrim Gupta
Zichen Zhang
Guanzhi Wang
Yongqiang Dou
Yanjun Chen
Li Fei-Fei
Anima Anandkumar
Yuke Zhu
Linxi Fan
LM&Ro
28
335
0
06 Oct 2022
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities:
  Robustness, Safety, and Generalizability
Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability
Mengdi Xu
Zuxin Liu
Peide Huang
Wenhao Ding
Zhepeng Cen
Bo-wen Li
Ding Zhao
74
45
0
16 Sep 2022
Learning to Rearrange with Physics-Inspired Risk Awareness
Learning to Rearrange with Physics-Inspired Risk Awareness
Meng Song
Yuhan Liu
Zhengqin Li
Manmohan Chandraker
26
0
0
26 Jun 2022
On the Robustness of Safe Reinforcement Learning under Observational
  Perturbations
On the Robustness of Safe Reinforcement Learning under Observational Perturbations
Zuxin Liu
Zijian Guo
Zhepeng Cen
Huan Zhang
Jie Tan
Bo-wen Li
Ding Zhao
OOD
OffRL
42
35
0
29 May 2022
Safe Reinforcement Learning for Legged Locomotion
Safe Reinforcement Learning for Legged Locomotion
Tsung-Yen Yang
Tingnan Zhang
Linda Luu
Sehoon Ha
Jie Tan
Wenhao Yu
21
40
0
05 Mar 2022
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill
  Acquisition
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition
Dylan Slack
Yinlam Chow
Bo Dai
Nevan Wichers
OffRL
24
7
0
10 Feb 2022
GoSafeOpt: Scalable Safe Exploration for Global Optimization of
  Dynamical Systems
GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems
Bhavya Sukhija
M. Turchetta
David Lindner
Andreas Krause
Sebastian Trimpe
Dominik Baumann
31
17
0
24 Jan 2022
Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and
  Generalization Guarantees
Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees
Kai Hsu
Allen Z. Ren
D. Nguyen
Anirudha Majumdar
J. F. Fisac
OffRL
26
41
0
20 Jan 2022
Safe Deep RL in 3D Environments using Human Feedback
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
32
4
0
20 Jan 2022
Physical Derivatives: Computing policy gradients by physical
  forward-propagation
Physical Derivatives: Computing policy gradients by physical forward-propagation
Arash Mehrjou
Ashkan Soleymani
Stefan Bauer
Bernhard Schölkopf
38
0
0
15 Jan 2022
Model-Based Safe Reinforcement Learning with Time-Varying State and
  Control Constraints: An Application to Intelligent Vehicles
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
Xinglong Zhang
Yaoqian Peng
Biao Luo
Wei Pan
Xin Xu
Haibin Xie
27
11
0
18 Dec 2021
Learning to Be Cautious
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
Safe Autonomous Racing via Approximate Reachability on Ego-vision
Safe Autonomous Racing via Approximate Reachability on Ego-vision
Bingqing Chen
Jonathan M Francis
Jean Oh
Eric Nyberg
Sylvia L. Herbert
56
14
0
14 Oct 2021
Improving Safety in Deep Reinforcement Learning using Unsupervised
  Action Planning
Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning
Hao-Lun Hsu
Qiuhua Huang
Sehoon Ha
OffRL
42
11
0
29 Sep 2021
Safe Reinforcement Learning Using Advantage-Based Intervention
Safe Reinforcement Learning Using Advantage-Based Intervention
Nolan Wagener
Byron Boots
Ching-An Cheng
29
52
0
16 Jun 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
38
175
0
10 Mar 2021
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've
  Learned
How to Train Your Robot with Deep Reinforcement Learning; Lessons We've Learned
Julian Ibarz
Jie Tan
Chelsea Finn
Mrinal Kalakrishnan
P. Pastor
Sergey Levine
OffRL
16
516
0
04 Feb 2021
Deep Dynamics Models for Learning Dexterous Manipulation
Deep Dynamics Models for Learning Dexterous Manipulation
Anusha Nagabandi
K. Konolige
Sergey Levine
Vikash Kumar
157
408
0
25 Sep 2019
Safe Exploration in Markov Decision Processes
Safe Exploration in Markov Decision Processes
T. Moldovan
Pieter Abbeel
78
308
0
22 May 2012
1