Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.08423
Cited By
v1
v2
v3 (latest)
Non-Stationary Bandits with Habituation and Recovery Dynamics
26 July 2017
Yonatan Dov Mintz
A. Aswani
Philip M. Kaminsky
E. Flowers
Yoshimi Fukuoka
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Non-Stationary Bandits with Habituation and Recovery Dynamics"
15 / 15 papers shown
Title
Contextual Online Uncertainty-Aware Preference Learning for Human Feedback
Nan Lu
Ethan X. Fang
Junwei Lu
420
0
0
27 Apr 2025
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
42
1
0
16 Nov 2023
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms
Khashayar Khosravi
R. Leme
Chara Podimata
Apostolis Tsorvantzis
88
1
0
21 Jul 2023
An Adaptive Optimization Approach to Personalized Financial Incentives in Mobile Behavioral Weight Loss Interventions
Qiaomei Li
Kara L. Gavin
Corrine L. Voils
Yonatan Dov Mintz
28
1
0
01 Jul 2023
A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits
Liu Leqi
Giulio Zhou
Fatma Kilincc-Karzan
Zachary Chase Lipton
A. Montgomery
72
2
0
16 Apr 2023
Policy Optimization for Personalized Interventions in Behavioral Health
Jackie Baek
J. Boutilier
Vivek F. Farias
J. Jónasson
Erez Yoeli
OffRL
58
8
0
21 Mar 2023
Stochastic Rising Bandits
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
51
18
0
07 Dec 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
124
18
0
04 May 2022
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health
Aditya Mate
Lovish Madaan
Aparna Taneja
N. Madhiwalla
Shresth Verma
Gargi Singh
Aparna Hegde
Pradeep Varakantham
Milind Tambe
81
54
0
16 Sep 2021
Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function
Ilgin Dogan
Z. Shen
A. Aswani
42
12
0
04 Aug 2021
Dynamic Batch Learning in High-Dimensional Sparse Linear Contextual Bandits
Zhimei Ren
Zhengyuan Zhou
106
31
0
27 Aug 2020
Recovering Bandits
Ciara Pike-Burke
Steffen Grunewalder
140
41
0
31 Oct 2019
Weighted Linear Bandits for Non-Stationary Environments
Yoan Russac
Claire Vernade
Olivier Cappé
159
108
0
19 Sep 2019
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity
Peng Liao
Kristjan Greenewald
P. Klasnja
Susan Murphy
64
85
0
08 Sep 2019
Mostly Exploration-Free Algorithms for Contextual Bandits
Hamsa Bastani
Mohsen Bayati
Khashayar Khosravi
397
159
0
28 Apr 2017
1