Graded Transformers: A Symbolic-Geometric Approach to Structured Learning

27 July 2025

Main:2 Pages

3 Figures

Appendix:35 Pages

Abstract

We introduce the Graded Transformer framework, a novel class of sequence models that embeds algebraic inductive biases through grading transformations on vector spaces. Extending the theory of Graded Neural Networks (GNNs), we propose two architectures: the Linearly Graded Transformer (LGT) and the Exponentially Graded Transformer (EGT). These models apply parameterized scaling operators-governed by fixed or learnable grading tuples and, for EGT, exponential factors to infuse hierarchical structure into attention and representation layers, enhancing efficiency for structured data.

View on arXiv

Comments on this paper