24
1

Notes on the Mathematical Structure of GPT LLM Architectures

Spencer Becker-Kahn
Abstract

An exposition of the mathematics underpinning the neural network architecture of a GPT-3-style LLM.

View on arXiv
Comments on this paper