ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.01369
  4. Cited By
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation
  Allocation Approach for Recommender Systems

RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

27 December 2023
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
ArXivPDFHTML

Papers citing "RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems"

5 / 5 papers shown
Title
ID policy (with reassignment) is asymptotically optimal for heterogeneous weakly-coupled MDPs
ID policy (with reassignment) is asymptotically optimal for heterogeneous weakly-coupled MDPs
Xiangcheng Zhang
Yige Hong
Weina Wang
35
0
0
09 Feb 2025
RPAF: A Reinforcement Prediction-Allocation Framework for Cache
  Allocation in Large-Scale Recommender Systems
RPAF: A Reinforcement Prediction-Allocation Framework for Cache Allocation in Large-Scale Recommender Systems
Shuo Su
Xiaoshuang Chen
Yao Wang
Yulin Wu
Ziqiang Zhang
Kaiqiao Zhan
Ben Wang
Kun Gai
AI4TS
24
1
0
20 Sep 2024
Weakly Coupled Deep Q-Networks
Weakly Coupled Deep Q-Networks
Ibrahim El Shar
Daniel R. Jiang
19
2
0
28 Oct 2023
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Guogang Liao
Zewen Wang
Xiaoxu Wu
Xiaowen Shi
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
33
36
0
09 Sep 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
214
413
0
16 Feb 2021
1