Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.07346
Cited By
Reinforcement Learning for Strategic Recommendations
15 September 2020
Georgios Theocharous
Yash Chandak
Philip S. Thomas
F. D. Nijs
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning for Strategic Recommendations"
9 / 9 papers shown
Title
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
90
5
0
10 Oct 2023
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
69
1
0
28 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
69
5
0
23 Sep 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
89
6
0
24 Jan 2023
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Tong Mu
Georgios Theocharous
David Arbour
Emma Brunskill
66
6
0
30 Dec 2021
Edge-Compatible Reinforcement Learning for Recommendations
James E. Kostas
Philip S. Thomas
Georgios Theocharous
OffRL
120
0
0
10 Dec 2021
SOPE: Spectrum of Off-Policy Estimators
C. J. Yuan
Yash Chandak
S. Giguere
Philip S. Thomas
S. Niekum
OffRL
93
5
0
06 Nov 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
101
53
0
26 Apr 2021
Towards Safe Policy Improvement for Non-Stationary MDPs
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
OffRL
132
34
0
23 Oct 2020
1