Position: Adopt Constraints Over Penalties in Deep Learning

27 May 2025

Juan Ramirez

Meraj Hashemizadeh

Damien Scieur

ArXiv (abs)PDF HTML Github (35182★)

Main:10 Pages

4 Figures

Bibliography:4 Pages

6 Tables

Appendix:8 Pages

Abstract

Recent efforts toward developing trustworthy AI systems with accountability guarantees have led to a growing reliance on machine learning formulations that incorporate external requirements, or constraints. These requirements are often enforced through penalization--adding fixed-weight terms to the task loss. We argue that this approach is ill-suited, and that tailored constrained optimization methods should be adopted instead. In particular, no penalty coefficient may yield a solution that both satisfies the constraints and achieves good performance--i.e., one solving the constrained problem. Moreover, tuning these coefficients is costly, incurring significant time and computational overhead. In contrast, tailored constrained methods--such as the Lagrangian approach, which optimizes the penalization "coefficients" (the Lagrange multipliers) alongside the model--(i) truly solve the constrained problem and add accountability, (ii) eliminate the need for extensive penalty tuning, and (iii) integrate seamlessly with modern deep learning pipelines.

View on arXiv

Comments on this paper