ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.09090
  4. Cited By
An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile
  Health Interventions
v1v2 (latest)

An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile Health Interventions

28 June 2017
H. Lei
Yangyi Lu
Ambuj Tewari
Susan Murphy
ArXiv (abs)PDFHTML

Papers citing "An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile Health Interventions"

18 / 18 papers shown
Diabetes Lifestyle Medicine Treatment Assistance Using Reinforcement Learning
Diabetes Lifestyle Medicine Treatment Assistance Using Reinforcement Learning
Yuhan Tang
OffRL
101
0
0
19 Oct 2025
Improving Reward-Conditioned Policies for Multi-Armed Bandits using
  Normalized Weight Functions
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
269
0
0
16 Jun 2024
Increasing Entropy to Boost Policy Gradient Performance on
  Personalization Tasks
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
216
1
0
09 Oct 2023
Inference for relative sparsity
Inference for relative sparsity
Samuel J. Weisenthal
Sally W. Thurston
Ashkan Ertefaie
CML
310
0
0
25 Jun 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Policy Optimization for Personalized Interventions in Behavioral HealthManufacturing & Service Operations Management (MSOM), 2023
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
231
10
0
21 Mar 2023
Examining Policy Entropy of Reinforcement Learning Agents for
  Personalization Tasks
Examining Policy Entropy of Reinforcement Learning Agents for Personalization TasksInternational Conferences on Pattern Recognition and Artificial Intelligence (ICCPRAI), 2022
Anton Dereventsov
Andrew Starnes
Clayton Webster
388
4
0
21 Nov 2022
Simulated Contextual Bandits for Personalization Tasks from
  Recommendation Datasets
Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Anton Dereventsov
A. Bibin
192
2
0
12 Oct 2022
Robust Tests in Online Decision-Making
Robust Tests in Online Decision-MakingAAAI Conference on Artificial Intelligence (AAAI), 2022
Gi-Soo Kim
Hyun-Joon Yang
J. P. Kim
OffRL
170
0
0
21 Aug 2022
Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy
  Logarithmic Regrets
Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic RegretsAAAI Conference on Artificial Intelligence (AAAI), 2022
Zongqi Wan
Zhijie Zhang
Tongyang Li
Jialin Zhang
Xiaoming Sun
295
29
0
30 May 2022
Selectively Contextual Bandits
Selectively Contextual Bandits
Claudia V. Roberts
Maria Dimakopoulou
Qifeng Qiao
Ashok Chandrashekar
Tony Jebara
168
1
0
09 May 2022
Bounded Memory Adversarial Bandits with Composite Anonymous Delayed
  Feedback
Bounded Memory Adversarial Bandits with Composite Anonymous Delayed FeedbackInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Zongqi Wan
Xiaoming Sun
Jialin Zhang
221
1
0
27 Apr 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal
  Adaptive Interventions
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive InterventionsInternational Statistical Review (ISR), 2022
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
295
20
0
04 Mar 2022
Learning Neural Contextual Bandits Through Perturbed Rewards
Learning Neural Contextual Bandits Through Perturbed RewardsInternational Conference on Learning Representations (ICLR), 2022
Yiling Jia
Weitong Zhang
Dongruo Zhou
Quanquan Gu
Hongning Wang
376
20
0
24 Jan 2022
From Personalized Medicine to Population Health: A Survey of mHealth
  Sensing Techniques
From Personalized Medicine to Population Health: A Survey of mHealth Sensing Techniques
Zhiyuan Wang
Haoyi Xiong
Jie Zhang
Sijia Yang
M. Boukhechba
Laura E. Barnes
Daqing Zhang
Dejing Dou
270
38
0
02 Jul 2021
Fatigue-Aware Ad Creative Selection
Fatigue-Aware Ad Creative Selection
Daisuke Moriwaki
Komei Fujita
Shota Yasui
T. Hoshino
185
8
0
21 Aug 2019
Parameterized Exploration
Parameterized Exploration
Jesse Clifton
Lili Wu
E. Laber
211
0
0
13 Jul 2019
Balanced Linear Contextual Bandits
Balanced Linear Contextual Bandits
Maria Dimakopoulou
Zhengyuan Zhou
Susan Athey
Guido Imbens
312
72
0
15 Dec 2018
Estimation Considerations in Contextual Bandits
Estimation Considerations in Contextual Bandits
Maria Dimakopoulou
Zhengyuan Zhou
Susan Athey
Guido Imbens
527
74
0
19 Nov 2017
1
Page 1 of 1