Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.05327
Cited By
Fast Offline Policy Optimization for Large Scale Recommendation
8 August 2022
Otmane Sakhi
D. Rohde
Alexandre Gilotte
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fast Offline Policy Optimization for Large Scale Recommendation"
4 / 4 papers shown
Title
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
38
1
0
23 May 2024
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
27
1
0
12 Mar 2024
Fast Slate Policy Optimization: Going Beyond Plackett-Luce
Otmane Sakhi
D. Rohde
Nicolas Chopin
OffRL
10
3
0
03 Aug 2023
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
31,244
0
16 Jan 2013
1