Ehrenfeucht-Haussler Rank and Chain of Thought

The notion of rank of a Boolean function has been a cornerstone in the theory of PAC learning, enabling quasipolynomial-time learning algorithms for polynomial-size decision trees. We present a novel characterization of rank, grounded in the well-known Transformer architecture. We show that the rank of a function corresponds to the minimum number of Chain of Thought (CoT) steps required by a single-layer transformer decoder with hard attention to compute . Based on this characterization we establish tight bounds on the number of CoT steps required for specific problems, showing that -fold function composition necessitates exactly CoT steps. Furthermore, we analyze the problem of identifying the position of the -th occurrence of 1 in a Boolean sequence, proving that it requires CoT steps.
View on arXiv