All Papers
Title |
---|
Title |
---|
We study an online learning problem with long-term budget constraints in the adversarial setting. In this problem, at each round , the learner selects an action from a convex decision set, after which the adversary reveals a cost function and a resource consumption function . The cost and consumption functions are assumed to be -approximately convex - a broad class that generalizes convexity and encompasses many common non-convex optimization problems, including DR-submodular maximization, Online Vertex Cover, and Regularized Phase Retrieval. The goal is to design an online algorithm that minimizes cumulative cost over a horizon of length while approximately satisfying a long-term budget constraint of . We propose an efficient first-order online algorithm that guarantees -regret against the optimal fixed feasible benchmark while consuming at most resources in both full-information and bandit feedback settings. In the bandit feedback setting, our approach yields an efficient solution for the problem with improved guarantees. We also prove matching lower bounds, demonstrating the tightness of our results. Finally, we characterize the class of -approximately convex functions and show that our results apply to a broad family of problems.
View on arXiv