ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.05327
  4. Cited By
Fast Offline Policy Optimization for Large Scale Recommendation

Fast Offline Policy Optimization for Large Scale Recommendation

8 August 2022
Otmane Sakhi
D. Rohde
Alexandre Gilotte
    OffRL
ArXivPDFHTML

Papers citing "Fast Offline Policy Optimization for Large Scale Recommendation"

4 / 4 papers shown
Title
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection
  and Learning
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
41
1
0
23 May 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
27
1
0
12 Mar 2024
Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Otmane Sakhi
D. Rohde
Nicolas Chopin
OffRL
18
3
0
03 Aug 2023
Efficient Estimation of Word Representations in Vector Space
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
31,244
0
16 Jan 2013
1