237

Contextual Learning for Stochastic Optimization

ACM Conference on Economics and Computation (EC), 2025
Main:17 Pages
Bibliography:3 Pages
Appendix:2 Pages
Abstract

Motivated by stochastic optimization, we introduce the problem of learning from samples of contextual value distributions. A contextual value distribution can be understood as a family of real-valued distributions, where each sample consists of a context xx and a random variable drawn from the corresponding real-valued distribution DxD_x. By minimizing a convex surrogate loss, we learn an empirical distribution DxD'_x for each context, ensuring a small Lévy distance to DxD_x. We apply this result to obtain the sample complexity bounds for the learning of an ϵ\epsilon-optimal policy for stochastic optimization problems defined on an unknown contextual value distribution. The sample complexity is shown to be polynomial for the general case of strongly monotone and stable optimization problems, including Single-item Revenue Maximization, Pandora's Box and Optimal Stopping.

View on arXiv
Comments on this paper