133
v1v2 (latest)

Global law of conjugate kernel random matrices with heavy-tailed weights

Main:-1 Pages
7 Figures
Bibliography:1 Pages
Appendix:48 Pages
Abstract

We study the asymptotic spectral distribution of the conjugate kernel random matrix YYYY^\top, where Y=f(WX)Y= f(WX) arises from a two-layer neural network model. We consider the setting where WW and XX are random rectangular matrices with i.i.d.\ entries, where the entries of WW follow a heavy-tailed distribution, while those of XX have light tails. Our assumptions on WW include a broad class of heavy-tailed distributions, such as symmetric α\alpha-stable laws with α]0,2[\alpha \in ]0,2[ and sparse matrices with O(1)\mathcal{O}(1) nonzero entries per row. The activation function ff, applied entrywise, is bounded, smooth, odd, and nonlinear. We compute the limiting eigenvalue distribution of YYYY^\top through its moments and show that heavy-tailed weights induce strong correlations between the entries of YY, resulting in richer and fundamentally different spectral behavior compared to the light-tailed case.

View on arXiv
Comments on this paper