v1v2v3 (latest)

Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding

28 May 2024

Papers citing "Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding"

28 / 28 papers shown

Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation

21 Oct 2025

Safety-Gymnasium: A Unified Safe Reinforcement Learning BenchmarkNeural Information Processing Systems (NeurIPS), 2023

Jiaming Ji

Juntao Dai

336

113

19 Oct 2023

Safe Reinforcement Learning via Probabilistic Logic ShieldsInternational Workshop on Neural-Symbolic Learning and Reasoning (NeSy), 2023

179

06 Mar 2023

Online Shielding for Reinforcement LearningInnovations in Systems and Software Engineering (ISSE), 2022

139

04 Dec 2022

Automata Learning meets ShieldingLeveraging Applications of Formal Methods (ISoLA), 2022

208

04 Dec 2022

Dynamic Shielding for Reinforcement Learning in Black-Box EnvironmentsAutomated Technology for Verification and Analysis (ATVA), 2022

153

27 Jul 2022

Safe Reinforcement Learning via Shielding under Partial ObservabilityAAAI Conference on Artificial Intelligence (AAAI), 2022

Steven Carr

N. Jansen

Sebastian Junges

Ufuk Topcu

191

02 Apr 2022

Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake

Shahaf S. Shperberg

Bo Liu

Peter Stone

236

19 Feb 2022

Constrained Variational Policy Optimization for Safe Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

Wei Liu

Ding Zhao

283

28 Jan 2022

Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement LearningConference on Robot Learning (CoRL), 2021

Nikita Rudin

David Hoeller

Philipp Reist

Marco Hutter

887

772

24 Sep 2021

Reinforcement Learning with External Knowledge by using Logical Neural Networks

105

03 Mar 2021

Safe Multi-Agent Reinforcement Learning via ShieldingAdaptive Agents and Multi-Agent Systems (AAMAS), 2021

Ufuk Topcu

181

109

27 Jan 2021

Recovery RL: Safe Reinforcement Learning with Learned Recovery ZonesIEEE Robotics and Automation Letters (RA-L), 2020

290

267

29 Oct 2020

Learning to be Safe: Deep RL with a Safety Critic

Jie Tan

197

167

27 Oct 2020

Conservative Safety Critics for ExplorationInternational Conference on Learning Representations (ICLR), 2020

Homanga Bharadhwaj

346

153

27 Oct 2020

Responsive Safety in Reinforcement Learning by PID Lagrangian MethodsInternational Conference on Machine Learning (ICML), 2020

Adam Stooke

Joshua Achiam

Pieter Abbeel

271

351

08 Jul 2020

Safe Reinforcement Learning via Curriculum Induction

231

22 Jun 2020

Meta-Learning in Neural Networks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Timothy M. Hospedales

Antreas Antoniou

P. Micaelli

Amos Storkey

OOD

735

2,376

11 Apr 2020

Dota 2 with Large Scale Deep Reinforcement Learning

...

GNN VLM CLL AI4CE LRM

400

2,031

13 Dec 2019

Safe Option-Critic: Learning Safety in the Option-Critic Architecture

Arushi Jain

Khimya Khetarpal

Doina Precup

242

21 Jul 2018

Reward Constrained Policy Optimization

Chen Tessler

D. Mankowitz

Shie Mannor

400

603

28 May 2018

Safe Exploration in Continuous Action Spaces

Gal Dalal

Krishnamurthy Dvijotham

165

476

26 Jan 2018

Safe Reinforcement Learning via Shielding

Ufuk Topcu

1.1K

777

29 Aug 2017

Emergence of Locomotion Behaviours in Rich Environments

...

Martin Riedmiller

David Silver

470

976

07 Jul 2017

Constrained Policy OptimizationInternational Conference on Machine Learning (ICML), 2017

Joshua Achiam

David Held

Aviv Tamar

Pieter Abbeel

1.3K

1,560

30 May 2017

Uncertainty-Aware Reinforcement Learning for Collision Avoidance

Pieter Abbeel

205

326

03 Feb 2017

Concrete Problems in AI Safety

1.3K

2,758

21 Jun 2016

Continuous control with deep reinforcement learning

Alexander Pritzel

David Silver

962

14,671

09 Sep 2015