Fast Offline Policy Optimization for Large Scale Recommendation

8 August 2022

Papers citing "Fast Offline Policy Optimization for Large Scale Recommendation"

4 / 4 papers shown

Title
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning Otmane Sakhi Imad Aouali Pierre Alquier Nicolas Chopin OffRL 41 1 0 23 May 2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective Yunpeng Qing Shunyu Liu Jingyuan Cong Kaixuan Chen Yihe Zhou Mingli Song OffRL 27 1 0 12 Mar 2024
Fast Slate Policy Optimization: Going Beyond Plackett-Luce Otmane Sakhi D. Rohde Nicolas Chopin OffRL 18 3 0 03 Aug 2023
Efficient Estimation of Word Representations in Vector Space Tomáš Mikolov Kai Chen G. Corrado J. Dean 3DV 228 31,244 0 16 Jan 2013