Most Neural Networks Are Almost Learnable

25 May 2023

Gal Vardi

Abstract

We present a PTAS for learning random constant-depth networks. We show that for any fixed $\epsilon>0$ and depth $i$ , there is a poly-time algorithm that for any distribution on $\sqrt{d} \cdot \mathbb{S}^{d-1}$ learns random Xavier networks of depth $i$ , up to an additive error of $\epsilon$ . The algorithm runs in time and sample complexity of $(\bar{d})^{\mathrm{poly}(\epsilon^{-1})}$ , where $\bar d$ is the size of the network. For some cases of sigmoid and ReLU-like activations the bound can be improved to $(\bar{d})^{\mathrm{polylog}(\epsilon^{-1})}$ , resulting in a quasi-poly-time algorithm for learning constant depth random networks.

View on arXiv

Comments on this paper