470
v1v2v3v4v5 (latest)

VeriX: Towards Verified Explainability of Deep Neural Networks

Neural Information Processing Systems (NeurIPS), 2022
Abstract

We present VeriX (Verified eXplainability), a system for producing optimal robust explanations and generating counterfactuals along decision boundaries of machine learning models. We build such explanations and counterfactuals iteratively using constraint solving techniques and a heuristic based on feature-level sensitivity ranking. We evaluate our method on image recognition benchmarks and a real-world scenario of autonomous aircraft taxiing.

View on arXiv
Comments on this paper