158

LIME: Link-based user-item Interaction Modeling with decoupled xor attention for Efficient test time scaling

Main:9 Pages
7 Figures
Bibliography:2 Pages
6 Tables
Appendix:8 Pages
Abstract

Scaling large recommendation systems requires advancing three major frontiers: processing longer user histories, expanding candidate sets, and increasing model capacity. While promising, transformers' computational cost scales quadratically with the user sequence length and linearly with the number of candidates. This trade-off makes it prohibitively expensive to expand candidate sets or increase sequence length at inference, despite the significant performance improvements.

View on arXiv
Comments on this paper