Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.01566
Cited By
Fast Slate Policy Optimization: Going Beyond Plackett-Luce
3 August 2023
Otmane Sakhi
D. Rohde
Nicolas Chopin
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Fast Slate Policy Optimization: Going Beyond Plackett-Luce"
2 / 2 papers shown
Title
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi
Imad Aouali
Pierre Alquier
Nicolas Chopin
OffRL
41
1
0
23 May 2024
Variational Optimization
J. Staines
David Barber
DRL
57
53
0
18 Dec 2012
1