59
0

Deep Fair Learning: A Unified Framework for Fine-tuning Representations with Sufficient Networks

Abstract

Ensuring fairness in machine learning is a critical and challenging task, as biased data representations often lead to unfair predictions. To address this, we propose Deep Fair Learning, a framework that integrates nonlinear sufficient dimension reduction with deep learning to construct fair and informative representations. By introducing a novel penalty term during fine-tuning, our method enforces conditional independence between sensitive attributes and learned representations, addressing bias at its source while preserving predictive performance. Unlike prior methods, it supports diverse sensitive attributes, including continuous, discrete, binary, or multi-group types. Experiments on various types of data structure show that our approach achieves a superior balance between fairness and utility, significantly outperforming state-of-the-art baselines.

View on arXiv
@article{shi2025_2504.06470,
  title={ Deep Fair Learning: A Unified Framework for Fine-tuning Representations with Sufficient Networks },
  author={ Enze Shi and Linglong Kong and Bei Jiang },
  journal={arXiv preprint arXiv:2504.06470},
  year={ 2025 }
}
Comments on this paper