Formal Verification of Noisy Quantum Reinforcement Learning Policies

1 December 2025

Dennis Gross

ArXiv (abs)PDF HTML Github (13★)

Main:14 Pages

8 Figures

Bibliography:5 Pages

2 Tables

Abstract

Quantum reinforcement learning (QRL) aims to use quantum effects to create sequential decision-making policies that achieve tasks more effectively than their classical counterparts. However, QRL policies face uncertainty from quantum measurements and hardware noise, such as bit-flip, phase-flip, and depolarizing errors, which can lead to unsafe behavior. Existing work offers no systematic way to verify whether trained QRL policies meet safety requirements under specific noise conditions.

View on arXiv

Comments on this paper