548

Resourceful Contextual Bandits

Annual Conference Computational Learning Theory (COLT), 2014
Abstract

We study contextual bandits with ancillary constraints on resources, which are common in real-world applications such as choosing ads or dynamic pricing of items. We design the first algorithm for solving these problems, and prove a regret guarantee with near-optimal statistical properties.

View on arXiv
Comments on this paper