SARM: LLM-Augmented Semantic Anchor for End-to-End Live-Streaming Ranking

10 February 2026

Ruochen Yang

Yueyang Liu

Zijie Zhuang

Changxin Lao

Yuhui Zhang

Jiangxia Cao

Jia Xu

Xiang Chen

Haoke Xiao

Xiangyu Wu

Xiaoyou Zhou

Xiao Lv

Shuang Yang

Tingwen Liu

Zhaojie Liu

Han Li

Kun Gai

AI4TS

ArXiv (abs)PDF HTML Github

Main:8 Pages

8 Figures

Bibliography:2 Pages

7 Tables

Appendix:1 Pages

Abstract

Large-scale live-streaming recommendation requires precise modeling of non-stationary content semantics under strict real-time serving constraints. In industrial deployment, two common approaches exhibit fundamental limitations: discrete semantic abstractions sacrifice descriptive precision through clustering, while dense multimodal embeddings are extracted independently and remain weakly aligned with ranking optimization, limiting fine-grained content-aware ranking. To address these limitations, we propose \textbf{SARM}, an end-to-end ranking architecture that integrates natural-language semantic anchors directly into ranking optimization, enabling fine-grained author representations conditioned on multimodal content. Each semantic anchor is represented as learnable text tokens jointly optimized with ranking features, allowing the model to adapt content descriptions to ranking objectives. A lightweight dual-token gated design captures domain-specific live-streaming semantics, while an asymmetric deployment strategy preserves low-latency online training and serving. Extensive offline evaluation and large-scale A/B tests show consistent improvements over production baselines. SARM is fully deployed and serves over 400 million users daily.

View on arXiv

Comments on this paper