Reinforcement Learning for Strategic Recommendations

15 September 2020

Papers citing "Reinforcement Learning for Strategic Recommendations"

9 / 9 papers shown

Title
Bi-Level Offline Policy Optimization with Limited Exploration Wenzhuo Zhou OffRL 90 5 0 10 Oct 2023
Stackelberg Batch Policy Learning Wenzhuo Zhou Annie Qu OffRL 69 1 0 28 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework Wenzhuo Zhou Yuhan Li Ruoqing Zhu Annie Qu OffRL 69 5 0 23 Sep 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments Yash Chandak Shiv Shankar Nathaniel D. Bastian Bruno Castro da Silva Emma Brunskil Philip S. Thomas OffRL 89 6 0 24 Jan 2023
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning Tong Mu Georgios Theocharous David Arbour Emma Brunskill 66 6 0 30 Dec 2021
Edge-Compatible Reinforcement Learning for Recommendations James E. Kostas Philip S. Thomas Georgios Theocharous OffRL 120 0 0 10 Dec 2021
SOPE: Spectrum of Off-Policy Estimators C. J. Yuan Yash Chandak S. Giguere Philip S. Thomas S. Niekum OffRL 89 5 0 06 Nov 2021
Universal Off-Policy Evaluation Yash Chandak S. Niekum Bruno C. da Silva Erik Learned-Miller Emma Brunskill Philip S. Thomas OffRL ELM 101 53 0 26 Apr 2021
Towards Safe Policy Improvement for Non-Stationary MDPs Yash Chandak Scott M. Jordan Georgios Theocharous Martha White Philip S. Thomas OffRL 132 34 0 23 Oct 2020