MLPs at the EOC: Spectrum of the NTK
We study the properties of the Neural Tangent Kernel (NTK) corresponding to infinitely wide -layer Multilayer Perceptrons (MLPs) taking inputs from to outputs in equipped with activation functions for some and initialized at the Edge Of Chaos (EOC). We find that the entries can be approximated by the inverses of the cosine distances of the activations corresponding to and increasingly better as the depth increases. By quantifying these inverse cosine distances and the spectrum of the matrix containing them, we obtain tight spectral bounds for the NTK matrix over a dataset , transferred from the inverse cosine distance matrix via our approximation result. Our results show that determines the rate at which the condition number of the NTK matrix converges to its limit as depth increases, implying in particular that the absolute value () is better than the ReLU () in this regard.
View on arXiv