Non-Stationary Bandits with Habituation and Recovery Dynamics

v1v2v3 (latest)

Non-Stationary Bandits with Habituation and Recovery Dynamics

26 July 2017

Yonatan Dov Mintz

Philip M. Kaminsky

Yoshimi Fukuoka

ArXiv (abs)PDF HTML

Papers citing "Non-Stationary Bandits with Habituation and Recovery Dynamics"

15 / 15 papers shown

Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback Nan Lu Ethan X. Fang Junwei Lu 420 0 0 27 Apr 2025
Adaptive Interventions with User-Defined Goals for Health Behavior Change Aishwarya Mandyam Matthew Joerke William Denton Barbara E. Engelhardt Emma Brunskill 42 1 0 16 Nov 2023
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms Khashayar Khosravi R. Leme Chara Podimata Apostolis Tsorvantzis 88 1 0 21 Jul 2023
An Adaptive Optimization Approach to Personalized Financial Incentives in Mobile Behavioral Weight Loss Interventions Qiaomei Li Kara L. Gavin Corrine L. Voils Yonatan Dov Mintz 28 1 0 01 Jul 2023
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits Liu Leqi Giulio Zhou Fatma Kilincc-Karzan Zachary Chase Lipton A. Montgomery 72 2 0 16 Apr 2023
Policy Optimization for Personalized Interventions in Behavioral Health Jackie Baek J. Boutilier Vivek F. Farias J. Jónasson Erez Yoeli OffRL 58 8 0 21 Mar 2023
Stochastic Rising Bandits Alberto Maria Metelli F. Trovò Matteo Pirola Marcello Restelli 51 18 0 07 Dec 2022
Non-Stationary Bandit Learning via Predictive Sampling Yueyang Liu Kuang Xu Benjamin Van Roy 124 18 0 04 May 2022
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health Aditya Mate Lovish Madaan Aparna Taneja N. Madhiwalla Shresth Verma Gargi Singh Aparna Hegde Pradeep Varakantham Milind Tambe 81 54 0 16 Sep 2021
Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function Ilgin Dogan Z. Shen A. Aswani 42 12 0 04 Aug 2021
Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits Zhimei Ren Zhengyuan Zhou 106 31 0 27 Aug 2020
Recovering Bandits Ciara Pike-Burke Steffen Grunewalder 140 41 0 31 Oct 2019
Weighted Linear Bandits for Non-Stationary Environments Yoan Russac Claire Vernade Olivier Cappé 159 108 0 19 Sep 2019
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity Peng Liao Kristjan Greenewald P. Klasnja Susan Murphy 64 85 0 08 Sep 2019
Mostly Exploration-Free Algorithms for Contextual Bandits Hamsa Bastani Mohsen Bayati Khashayar Khosravi 397 159 0 28 Apr 2017