
![]() Safety-Gymnasium: A Unified Safe Reinforcement Learning BenchmarkNeural Information Processing Systems (NeurIPS), 2023 |
![]() Safe Reinforcement Learning via Probabilistic Logic ShieldsInternational Workshop on Neural-Symbolic Learning and Reasoning (NeSy), 2023 |
![]() Online Shielding for Reinforcement LearningInnovations in Systems and Software Engineering (ISSE), 2022 |
![]() Automata Learning meets ShieldingLeveraging Applications of Formal Methods (ISoLA), 2022 |
![]() Dynamic Shielding for Reinforcement Learning in Black-Box EnvironmentsAutomated Technology for Verification and Analysis (ATVA), 2022 |
![]() Safe Reinforcement Learning via Shielding under Partial ObservabilityAAAI Conference on Artificial Intelligence (AAAI), 2022 |
![]() Constrained Variational Policy Optimization for Safe Reinforcement
LearningInternational Conference on Machine Learning (ICML), 2022 |
![]() Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement
LearningConference on Robot Learning (CoRL), 2021 |
![]() Safe Multi-Agent Reinforcement Learning via ShieldingAdaptive Agents and Multi-Agent Systems (AAMAS), 2021 |
![]() Conservative Safety Critics for ExplorationInternational Conference on Learning Representations (ICLR), 2020 |
![]() Responsive Safety in Reinforcement Learning by PID Lagrangian MethodsInternational Conference on Machine Learning (ICML), 2020 |
![]() Meta-Learning in Neural Networks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020 |
![]() Constrained Policy OptimizationInternational Conference on Machine Learning (ICML), 2017 |