Contextual Learning for Stochastic Optimization

ACM Conference on Economics and Computation (EC), 2025

22 May 2025

Anna Heuser

Thomas Kesselheim

ArXiv (abs)PDF HTML Github

Main:17 Pages

Bibliography:3 Pages

Appendix:2 Pages

Abstract

Motivated by stochastic optimization, we introduce the problem of learning from samples of contextual value distributions. A contextual value distribution can be understood as a family of real-valued distributions, where each sample consists of a context $x$ and a random variable drawn from the corresponding real-valued distribution $D_x$ . By minimizing a convex surrogate loss, we learn an empirical distribution $D'_x$ for each context, ensuring a small Lévy distance to $D_x$ . We apply this result to obtain the sample complexity bounds for the learning of an $\epsilon$ -optimal policy for stochastic optimization problems defined on an unknown contextual value distribution. The sample complexity is shown to be polynomial for the general case of strongly monotone and stable optimization problems, including Single-item Revenue Maximization, Pandora's Box and Optimal Stopping.

View on arXiv

Comments on this paper