Finding Manifolds With Bilinear Autoencoders

19 October 2025

Thomas Dooms

Ward Gauderis

ArXiv (abs)PDF HTML Github (2★)

Main:8 Pages

17 Figures

Bibliography:3 Pages

Appendix:7 Pages

Abstract

Sparse autoencoders are a standard tool for uncovering interpretable latent representations in neural networks. Yet, their interpretation depends on the inputs, making their isolated study incomplete. Polynomials offer a solution; they serve as algebraic primitives that can be analysed without reference to input and can describe structures ranging from linear concepts to complicated manifolds. This work uses bilinear autoencoders to efficiently decompose representations into quadratic polynomials. We discuss improvements that induce importance ordering, clustering, and activation sparsity. This is an initial step toward nonlinear yet analysable latents through their algebraic properties.

View on arXiv

Comments on this paper