v1v2 (latest)
Scalable LinUCB: Low-Rank Design Matrix Updates for Recommenders with Large Action Spaces
- BDL
Main:15 Pages
17 Figures
Bibliography:2 Pages
6 Tables
Appendix:10 Pages
Abstract
In this paper, we introduce PSI-LinUCB, a scalable variant of LinUCB that enables efficient training, inference, and memory usage by representing the inverse regularized design matrix as a sum of a diagonal matrix and low-rank correction. We derive numerically stable rank-1 and batched updates that maintain the inverse without explicitly forming the matrix. To control memory growth, we employ a projector-splitting integrator for dynamical low-rank approximation, yielding an average per-step update cost and memory usage of for approximation rank . The inference complexity of the proposed algorithm is per action evaluation. Experiments on recommender system datasets demonstrate the effectiveness of our algorithm.
View on arXivComments on this paper
