v1v2v3 (latest)

COPA: Comparing the incomparable in multi-objective model evaluation

18 March 2025

Adrián Javaloy

Antonio Vergari

Isabel Valera

ArXiv (abs)PDF HTML Github (2★)

Main:1 Pages

23 Figures

Bibliography:1 Pages

2 Tables

Appendix:27 Pages

Abstract

In machine learning (ML), we often need to choose one among hundreds of trained ML models at hand, based on various objectives such as accuracy, robustness, fairness or scalability. However, it is often unclear how to compare, aggregate and, ultimately, trade-off these objectives, making it a time-consuming task that requires expert knowledge, as objectives may be measured in different units and scales. In this work, we investigate how objectives can be automatically normalized and aggregated to systematically help the user navigate their Pareto front. To this end, we make incomparable objectives comparable using their cumulative functions, approximated by their relative rankings. As a result, our proposed approach, COPA, can aggregate them while matching user-specific preferences, allowing practitioners to meaningfully navigate and search for models in the Pareto front. We demonstrate the potential impact of COPA in both model selection and benchmarking tasks across diverse ML areas such as fair ML, domain generalization, AutoML and foundation models, where classical ways to normalize and aggregate objectives fall short.

View on arXiv

Comments on this paper