Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09714
Cited By
Robust Actor-Critic Contextual Bandit for Mobile Health (mHealth) Interventions
27 February 2018
Feiyun Zhu
Jun Guo
Ruoyu Li
Junzhou Huang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Robust Actor-Critic Contextual Bandit for Mobile Health (mHealth) Interventions"
5 / 5 papers shown
Title
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
11
0
0
09 Oct 2023
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
11
4
0
21 Nov 2022
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Anton Dereventsov
A. Bibin
11
1
0
12 Oct 2022
Robust Tests in Online Decision-Making
Gi-Soo Kim
Hyun-Joon Yang
J. P. Kim
OffRL
11
0
0
21 Aug 2022
A Batch, Off-Policy, Actor-Critic Algorithm for Optimizing the Average Reward
S. Murphy
Yanzhen Deng
Eric B. Laber
H. Maei
R. Sutton
K. Witkiewitz
OffRL
25
22
0
18 Jul 2016
1