11
3

Logic and the 22-Simplicial Transformer

Abstract

We introduce the 22-simplicial Transformer, an extension of the Transformer which includes a form of higher-dimensional attention generalising the dot-product attention, and uses this attention to update entity representations with tensor products of value vectors. We show that this architecture is a useful inductive bias for logical reasoning in the context of deep reinforcement learning.

View on arXiv
Comments on this paper