Formal Verification of Noisy Quantum Reinforcement Learning Policies
Main:14 Pages
8 Figures
Bibliography:5 Pages
2 Tables
Abstract
Quantum reinforcement learning (QRL) aims to use quantum effects to create sequential decision-making policies that achieve tasks more effectively than their classical counterparts. However, QRL policies face uncertainty from quantum measurements and hardware noise, such as bit-flip, phase-flip, and depolarizing errors, which can lead to unsafe behavior. Existing work offers no systematic way to verify whether trained QRL policies meet safety requirements under specific noise conditions.
View on arXivComments on this paper
