Gradient Reversal Against Discrimination

International Conference on Data Science and Advanced Analytics (DSAA), 2018

1 July 2018

Abstract

No methods currently exist for making arbitrary neural networks fair. In this work we introduce GRAD, a new and simplified method to producing fair neural networks that can be used for auto-encoding fair representations or directly with predictive networks. It is easy to implement and add to existing architectures, has only one (insensitive) hyper-parameter, and provides improved individual and group fairness. We use the flexibility of GRAD to demonstrate multi-attribute protection.

View on arXiv

Comments on this paper