Optimal Bounds for Adversarial Constrained Online Convex Optimization

17 March 2025

Abstract

Constrained Online Convex Optimization (COCO) can be seen as a generalization of the standard Online Convex Optimization (OCO) framework. At each round, a cost function and constraint function are revealed after a learner chooses an action. The goal is to minimize both the regret and cumulative constraint violation (CCV) against an adaptive adversary. We show for the first time that is possible to obtain the optimal $O(\sqrt{T})$ bound on both regret and CCV, improving the best known bounds of $O \left( \sqrt{T} \right)$ and $\tilde{O} \left( \sqrt{T} \right)$ for the regret and CCV, respectively. Based on a new surrogate loss function enforcing a minimum penalty on the constraint function, we demonstrate that both the Follow-the-Regularized-Leader and the Online Gradient Descent achieve the optimal bounds.

View on arXiv

@article{ferreira2025_2503.13366,
  title={ Optimal Bounds for Adversarial Constrained Online Convex Optimization },
  author={ Ricardo N. Ferreira and Cláudia Soares },
  journal={arXiv preprint arXiv:2503.13366},
  year={ 2025 }
}

Comments on this paper