ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.07346
  4. Cited By
Reinforcement Learning for Strategic Recommendations

Reinforcement Learning for Strategic Recommendations

15 September 2020
Georgios Theocharous
Yash Chandak
Philip S. Thomas
F. D. Nijs
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning for Strategic Recommendations"

9 / 9 papers shown
Title
Bi-Level Offline Policy Optimization with Limited Exploration
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
90
5
0
10 Oct 2023
Stackelberg Batch Policy Learning
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
69
1
0
28 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
69
5
0
23 Sep 2023
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Off-Policy Evaluation for Action-Dependent Non-Stationary Environments
Yash Chandak
Shiv Shankar
Nathaniel D. Bastian
Bruno Castro da Silva
Emma Brunskil
Philip S. Thomas
OffRL
89
6
0
24 Jan 2023
Constraint Sampling Reinforcement Learning: Incorporating Expertise For
  Faster Learning
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Tong Mu
Georgios Theocharous
David Arbour
Emma Brunskill
66
6
0
30 Dec 2021
Edge-Compatible Reinforcement Learning for Recommendations
Edge-Compatible Reinforcement Learning for Recommendations
James E. Kostas
Philip S. Thomas
Georgios Theocharous
OffRL
120
0
0
10 Dec 2021
SOPE: Spectrum of Off-Policy Estimators
SOPE: Spectrum of Off-Policy Estimators
C. J. Yuan
Yash Chandak
S. Giguere
Philip S. Thomas
S. Niekum
OffRL
89
5
0
06 Nov 2021
Universal Off-Policy Evaluation
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRLELM
101
53
0
26 Apr 2021
Towards Safe Policy Improvement for Non-Stationary MDPs
Towards Safe Policy Improvement for Non-Stationary MDPs
Yash Chandak
Scott M. Jordan
Georgios Theocharous
Martha White
Philip S. Thomas
OffRL
132
34
0
23 Oct 2020
1