v1v2 (latest)

Scalable LinUCB: Low-Rank Design Matrix Updates for Recommenders with Large Action Spaces

22 October 2025

Main:15 Pages

17 Figures

Bibliography:2 Pages

6 Tables

Appendix:10 Pages

Abstract

In this paper, we introduce PSI-LinUCB, a scalable variant of LinUCB that enables efficient training, inference, and memory usage by representing the inverse regularized design matrix as a sum of a diagonal matrix and low-rank correction. We derive numerically stable rank-1 and batched updates that maintain the inverse without explicitly forming the matrix. To control memory growth, we employ a projector-splitting integrator for dynamical low-rank approximation, yielding an average per-step update cost and memory usage of $O(dr)$ for approximation rank $r$ . The inference complexity of the proposed algorithm is $O(dr)$ per action evaluation. Experiments on recommender system datasets demonstrate the effectiveness of our algorithm.

View on arXiv

Comments on this paper