ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.18180
  4. Cited By
Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding
v1v2v3 (latest)

Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding

28 May 2024
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
    OffRLOnRL
ArXiv (abs)PDFHTML

Papers citing "Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding"

28 / 28 papers shown
Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation
Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation
Daniel Bethell
Simos Gerasimou
R. Calinescu
Calum Imrie
96
0
0
21 Oct 2025
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Safety-Gymnasium: A Unified Safe Reinforcement Learning BenchmarkNeural Information Processing Systems (NeurIPS), 2023
Jiaming Ji
Borong Zhang
Jiayi Zhou
Xuehai Pan
Weidong Huang
Ruiyang Sun
Yiran Geng
Yifan Zhong
Juntao Dai
Yaodong Yang
OffRL
336
113
0
19 Oct 2023
Safe Reinforcement Learning via Probabilistic Logic Shields
Safe Reinforcement Learning via Probabilistic Logic ShieldsInternational Workshop on Neural-Symbolic Learning and Reasoning (NeSy), 2023
Wen-Chi Yang
G. Marra
Gavin Rens
Luc de Raedt
OffRL
179
42
0
06 Mar 2023
Online Shielding for Reinforcement Learning
Online Shielding for Reinforcement LearningInnovations in Systems and Software Engineering (ISSE), 2022
Bettina Könighofer
Julian Rudolf
Alexander Palmisano
Martin Tappler
Roderick Bloem
OffRL
139
30
0
04 Dec 2022
Automata Learning meets Shielding
Automata Learning meets ShieldingLeveraging Applications of Formal Methods (ISoLA), 2022
Martin Tappler
Stefan Pranger
Bettina Könighofer
Edi Muškardin
Roderick Bloem
Kim G. Larsen
208
7
0
04 Dec 2022
Dynamic Shielding for Reinforcement Learning in Black-Box Environments
Dynamic Shielding for Reinforcement Learning in Black-Box EnvironmentsAutomated Technology for Verification and Analysis (ATVA), 2022
Masaki Waga
Ezequiel Castellano
Sasinee Pruekprasert
Stefan Klikovits
Toru Takisaka
I. Hasuo
153
11
0
27 Jul 2022
Safe Reinforcement Learning via Shielding under Partial Observability
Safe Reinforcement Learning via Shielding under Partial ObservabilityAAAI Conference on Artificial Intelligence (AAAI), 2022
Steven Carr
N. Jansen
Sebastian Junges
Ufuk Topcu
191
61
0
02 Apr 2022
Learning a Shield from Catastrophic Action Effects: Never Repeat the
  Same Mistake
Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake
Shahaf S. Shperberg
Bo Liu
Peter Stone
236
8
0
19 Feb 2022
Constrained Variational Policy Optimization for Safe Reinforcement
  Learning
Constrained Variational Policy Optimization for Safe Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Zuxin Liu
Zhepeng Cen
Vladislav Isenbaev
Wei Liu
Zhiwei Steven Wu
Yue Liu
Ding Zhao
283
94
0
28 Jan 2022
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement
  Learning
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement LearningConference on Robot Learning (CoRL), 2021
Nikita Rudin
David Hoeller
Philipp Reist
Marco Hutter
887
772
0
24 Sep 2021
Reinforcement Learning with External Knowledge by using Logical Neural
  Networks
Reinforcement Learning with External Knowledge by using Logical Neural Networks
Daiki Kimura
Subhajit Chaudhury
Akifumi Wachi
Ryosuke Kohita
Asim Munawar
Michiaki Tatsubori
Alexander G. Gray
105
12
0
03 Mar 2021
Safe Multi-Agent Reinforcement Learning via Shielding
Safe Multi-Agent Reinforcement Learning via ShieldingAdaptive Agents and Multi-Agent Systems (AAMAS), 2021
Ingy Elsayed-Aly
Suda Bharadwaj
Chris Amato
Rüdiger Ehlers
Ufuk Topcu
Lu Feng
181
109
0
27 Jan 2021
Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Recovery RL: Safe Reinforcement Learning with Learned Recovery ZonesIEEE Robotics and Automation Letters (RA-L), 2020
Brijen Thananjeyan
Ashwin Balakrishna
Suraj Nair
Michael Luo
K. Srinivasan
M. Hwang
Joseph E. Gonzalez
Julian Ibarz
Chelsea Finn
Ken Goldberg
OffRL
290
267
0
29 Oct 2020
Learning to be Safe: Deep RL with a Safety Critic
Learning to be Safe: Deep RL with a Safety Critic
K. Srinivasan
Benjamin Eysenbach
Sehoon Ha
Jie Tan
Chelsea Finn
OffRL
197
167
0
27 Oct 2020
Conservative Safety Critics for Exploration
Conservative Safety Critics for ExplorationInternational Conference on Learning Representations (ICLR), 2020
Homanga Bharadhwaj
Aviral Kumar
Nicholas Rhinehart
Sergey Levine
Florian Shkurti
Animesh Garg
OffRL
346
153
0
27 Oct 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian MethodsInternational Conference on Machine Learning (ICML), 2020
Adam Stooke
Joshua Achiam
Pieter Abbeel
271
351
0
08 Jul 2020
Safe Reinforcement Learning via Curriculum Induction
Safe Reinforcement Learning via Curriculum Induction
M. Turchetta
Andrey Kolobov
S. Shah
Andreas Krause
Alekh Agarwal
231
98
0
22 Jun 2020
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
735
2,376
0
11 Apr 2020
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
400
2,031
0
13 Dec 2019
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Safe Option-Critic: Learning Safety in the Option-Critic Architecture
Arushi Jain
Khimya Khetarpal
Doina Precup
242
28
0
21 Jul 2018
Reward Constrained Policy Optimization
Reward Constrained Policy Optimization
Chen Tessler
D. Mankowitz
Shie Mannor
400
603
0
28 May 2018
Safe Exploration in Continuous Action Spaces
Safe Exploration in Continuous Action Spaces
Gal Dalal
Krishnamurthy Dvijotham
Matej Vecerík
Todd Hester
Cosmin Paduraru
Yuval Tassa
165
476
0
26 Jan 2018
Safe Reinforcement Learning via Shielding
Safe Reinforcement Learning via Shielding
Mohammed Alshiekh
Roderick Bloem
Rüdiger Ehlers
Bettina Könighofer
S. Niekum
Ufuk Topcu
1.1K
777
0
29 Aug 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
470
976
0
07 Jul 2017
Constrained Policy Optimization
Constrained Policy OptimizationInternational Conference on Machine Learning (ICML), 2017
Joshua Achiam
David Held
Aviv Tamar
Pieter Abbeel
1.3K
1,560
0
30 May 2017
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
Uncertainty-Aware Reinforcement Learning for Collision Avoidance
G. Kahn
Adam R. Villaflor
Vitchyr H. Pong
Pieter Abbeel
Sergey Levine
205
326
0
03 Feb 2017
Concrete Problems in AI Safety
Concrete Problems in AI Safety
Dario Amodei
C. Olah
Jacob Steinhardt
Paul Christiano
John Schulman
Dandelion Mané
1.3K
2,758
0
21 Jun 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
962
14,671
0
09 Sep 2015
1