Let be a function of the form for . We give a simple proof that shows that poly-size depth two neural networks with (exponentially) bounded weights cannot approximate whenever cannot be approximated by a low degree polynomial. Moreover, for many 's, such as , the number of neurons must be . Furthermore, the result holds w.r.t.\ the uniform distribution on . As many functions of the above form can be well approximated by poly-size depth three networks with poly-bounded weights, this establishes a separation between depth two and depth three networks w.r.t.\ the uniform distribution on .
View on arXiv