In stochastic optimization, particularly in evolutionary computation and reinforcement learning, the optimization of a function is often addressed through optimizing a so-called relaxation of , where resembles the parameters of a family of probability measures on . We investigate the structure of such relaxations by means of measure theory and Fourier analysis, enabling us to shed light on the success of many associated stochastic optimization methods. The main structural traits we derive and that allow fast and reliable optimization of relaxations are the consistency of optimal values of , Lipschitzness of gradients, and convexity. We emphasize settings where itself is not differentiable or convex, e.g., in the presence of (stochastic) disturbance.
View on arXiv