128

Notes on the Mathematical Structure of GPT LLM Architectures

Spencer Becker-Kahn
Main:9 Pages
Bibliography:1 Pages
Abstract

An exposition of the mathematics underpinning the neural network architecture of a GPT-3-style LLM.

View on arXiv
Comments on this paper