We show that any memory-constrained, first-order algorithm which minimizes -dimensional, -Lipschitz convex functions over the unit ball to accuracy using at most bits of memory must make at least first-order queries (for any constant ). Consequently, the performance of such memory-constrained algorithms are a polynomial factor worse than the optimal query bound for this problem obtained by cutting plane methods that use memory. This resolves a COLT 2019 open problem of Woodworth and Srebro.
View on arXiv