16

SARM: LLM-Augmented Semantic Anchor for End-to-End Live-Streaming Ranking

Ruochen Yang
Yueyang Liu
Zijie Zhuang
Changxin Lao
Yuhui Zhang
Jiangxia Cao
Jia Xu
Xiang Chen
Haoke Xiao
Xiangyu Wu
Xiaoyou Zhou
Xiao Lv
Shuang Yang
Tingwen Liu
Zhaojie Liu
Han Li
Kun Gai
Main:8 Pages
8 Figures
Bibliography:2 Pages
7 Tables
Appendix:1 Pages
Abstract

Large-scale live-streaming recommendation requires precise modeling of non-stationary content semantics under strict real-time serving constraints. In industrial deployment, two common approaches exhibit fundamental limitations: discrete semantic abstractions sacrifice descriptive precision through clustering, while dense multimodal embeddings are extracted independently and remain weakly aligned with ranking optimization, limiting fine-grained content-aware ranking. To address these limitations, we propose \textbf{SARM}, an end-to-end ranking architecture that integrates natural-language semantic anchors directly into ranking optimization, enabling fine-grained author representations conditioned on multimodal content. Each semantic anchor is represented as learnable text tokens jointly optimized with ranking features, allowing the model to adapt content descriptions to ranking objectives. A lightweight dual-token gated design captures domain-specific live-streaming semantics, while an asymmetric deployment strategy preserves low-latency online training and serving. Extensive offline evaluation and large-scale A/B tests show consistent improvements over production baselines. SARM is fully deployed and serves over 400 million users daily.

View on arXiv
Comments on this paper