Notes on the Mathematical Structure of GPT LLM Architectures
Spencer Becker-Kahn
Main:9 Pages
Bibliography:1 Pages
Abstract
An exposition of the mathematics underpinning the neural network architecture of a GPT-3-style LLM.
View on arXivComments on this paper
