31
0

How to explain grokking

Abstract

Explanation of grokking (delayed generalization) in learning is given by modeling grokking by the stochastic gradient Langevin dynamics (Brownian motion) and applying the ideas of thermodynamics.

View on arXiv
Comments on this paper