RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

27 December 2023

Papers citing "RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems"

5 / 5 papers shown

Title
ID policy (with reassignment) is asymptotically optimal for heterogeneous weakly-coupled MDPs Xiangcheng Zhang Yige Hong Weina Wang 35 0 0 09 Feb 2025
RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems Shuo Su Xiaoshuang Chen Yao Wang Yulin Wu Ziqiang Zhang Kaiqiao Zhan Ben Wang Kun Gai AI4TS 24 1 0 20 Sep 2024
Weakly Coupled Deep Q-Networks Ibrahim El Shar Daniel R. Jiang 19 2 0 28 Oct 2023
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed Guogang Liao Zewen Wang Xiaoxu Wu Xiaowen Shi Chuheng Zhang Yongkang Wang Xingxing Wang Dong Wang 33 36 0 09 Sep 2021
COMBO: Conservative Offline Model-Based Policy Optimization Tianhe Yu Aviral Kumar Rafael Rafailov Aravind Rajeswaran Sergey Levine Chelsea Finn OffRL 214 413 0 16 Feb 2021