97

Power Homotopy for Zeroth-Order Non-Convex Optimizations

Main:8 Pages
2 Figures
Bibliography:3 Pages
9 Tables
Appendix:5 Pages
Abstract

We introduce GS-PowerHP, a novel zeroth-order method for non-convex optimization problems of the form maxxRdf(x)\max_{x \in \mathbb{R}^d} f(x). Our approach leverages two key components: a power-transformed Gaussian-smoothed surrogate FN,σ(μ)=ExN(μ,σ2Id)[eNf(x)]F_{N,\sigma}(\mu) = \mathbb{E}_{x\sim\mathcal{N}(\mu,\sigma^2 I_d)}[e^{N f(x)}] whose stationary points cluster near the global maximizer xx^* of ff for sufficiently large NN, and an incrementally decaying σ\sigma for enhanced data efficiency. Under mild assumptions, we prove convergence in expectation to a small neighborhood of xx^* with the iteration complexity of O(d2ε2)O(d^2 \varepsilon^{-2}). Empirical results show our approach consistently ranks among the top three across a suite of competing algorithms. Its robustness is underscored by the final experiment on a substantially high-dimensional problem (d=150,528d=150,528), where it achieved first place on least-likely targeted black-box attacks against images from ImageNet, surpassing all competing methods.

View on arXiv
Comments on this paper