452
v1v2v3v4 (latest)

Optimal Bounds for Adversarial Constrained Online Convex Optimization

Main:11 Pages
Bibliography:3 Pages
2 Tables
Appendix:6 Pages
Abstract

Constrained Online Convex Optimization (COCO) can be seen as a generalization of the standard Online Convex Optimization (OCO) framework. At each round, a cost function and constraint function are revealed after a learner chooses an action. The goal is to minimize both the regret and cumulative constraint violation (CCV) against an adaptive adversary. We show for the first time that is possible to obtain the optimal O(T)O(\sqrt{T}) bound on both regret and CCV, improving the best known bounds of O(T)O \left( \sqrt{T} \right) and O~(T)\tilde{O} \left( \sqrt{T} \right) for the regret and CCV, respectively. Based on a new surrogate loss function enforcing a minimum penalty on the constraint function, we demonstrate that both the Follow-the-Regularized-Leader and the Online Gradient Descent achieve the optimal bounds.

View on arXiv
Comments on this paper