Entropy, concentration, and learning: a statistical mechanics primer
Akshay Balsubramani
- AI4CE
Main:31 Pages
1 Figures
Bibliography:7 Pages
Abstract
Artificial intelligence models trained through loss minimization have demonstrated significant success, grounded in principles from fields like information theory and statistical physics. This work explores these established connections through the lens of statistical mechanics, starting from first-principles sample concentration behaviors that underpin AI and machine learning. Our development of statistical mechanics for modeling highlights the key role of exponential families, and quantities of statistics, physics, and information theory.
View on arXivComments on this paper
