DetoxAI: a Python Toolkit for Debiasing Deep Learning Models in Computer Vision

While machine learning fairness has made significant progress in recent years, most existing solutions focus on tabular data and are poorly suited for vision-based classification tasks, which rely heavily on deep learning. To bridge this gap, we introduce DetoxAI, an open-source Python library for improving fairness in deep learning vision classifiers through post-hoc debiasing. DetoxAI implements state-of-the-art debiasing algorithms, fairness metrics, and visualization tools. It supports debiasing via interventions in internal representations and includes attribution-based visualization tools and quantitative algorithmic fairness metrics to show how bias is mitigated. This paper presents the motivation, design, and use cases of DetoxAI, demonstrating its tangible value to engineers and researchers.
View on arXiv@article{stępka2025_2505.05492, title={ DetoxAI: a Python Toolkit for Debiasing Deep Learning Models in Computer Vision }, author={ Ignacy Stępka and Lukasz Sztukiewicz and Michał Wiliński and Jerzy Stefanowski }, journal={arXiv preprint arXiv:2505.05492}, year={ 2025 } }