A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

2 October 2015

Papers citing "A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit"

30 / 30 papers shown

Title
Information maximization for a broad variety of multi-armed bandit games Alex Barbier-Chebbah Christian L. Vestergaard Jean-Baptiste Masson 54 0 0 20 Mar 2025
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems África Periánez Ana Fernández del Río Ivan Nazarov Enric Jané Moiz Hassan Aditya Rastogi Dexian Tang 34 9 0 24 Sep 2024
A Green Multi-Attribute Client Selection for Over-The-Air Federated Learning: A Grey-Wolf-Optimizer Approach Maryam Ben Driss Essaid Sabir H. Elbiaze Abdoulaye Baniré Diallo M. Sadik 20 0 0 16 Sep 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx Ana Fernández del Río Michael Brennan Leong Paulo Saraiva Ivan Nazarov Aditya Rastogi Moiz Hassan Dexian Tang África Periánez OffRL OnRL 23 2 0 15 Aug 2024
Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services Ana Fernández del Río Michael Brennan Leong Paulo Saraiva Ivan Nazarov Aditya Rastogi Moiz Hassan Dexian Tang África Periánez OffRL 18 3 0 14 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings África Periánez Kathrin Schmitz Lazola Makhupula Moiz Hassan Moeti Moleko Ana Fernández del Río Ivan Nazarov Aditya Rastogi Dexian Tang OffRL 22 0 0 14 Aug 2024
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation Do June Min Verónica Pérez-Rosas Kenneth Resnicow Rada Mihalcea OffRL 35 2 0 20 Mar 2024
GROS: A General Robust Aggregation Strategy A. Cholaquidis Emilien Joly L. Moreno 19 2 0 23 Feb 2024
Evaluating Online Bandit Exploration In Large-Scale Recommender System Hongbo Guo Ruben Naeff Alex Nikulkov Zheqing Zhu OffRL 9 6 0 05 Apr 2023
Adaptive Interventions for Global Health: A Case Study of Malaria África Periánez A. Trister Madhav Nekkar Ana Fernández del Río P. Alonso 22 1 0 03 Mar 2023
Multi-Armed Bandits in Brain-Computer Interfaces Frida Heskebeck Carolina Bergeling Bo Bernhardsson 11 4 0 19 May 2022
Existence conditions for hidden feedback loops in online recommender systems A. Khritankov Anton A. Pilkevich 13 1 0 11 Sep 2021
Debiasing Samples from Online Learning Using Bootstrap Ningyuan Chen Xuefeng Gao Yi Xiong OffRL OnRL 9 4 0 31 Jul 2021
Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits Gourab Ghatak Hardhik Mohanty Aniq Ur Rahman TTA 21 9 0 30 May 2021
TSEC: a framework for online experimentation under experimental constraints Simon Mak Yuanshuo Zhou Lavonne Hoang C. F. J. Wu 13 1 0 17 Jan 2021
DORB: Dynamically Optimizing Multiple Rewards with Bandits Ramakanth Pasunuru Han Guo Mohit Bansal OffRL 14 6 0 15 Nov 2020
Asymptotic Randomised Control with applications to bandits Samuel N. Cohen Tanut Treetanthiploet 10 5 0 14 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization Mack Sweeney M. Adelsberg Kathryn B. Laskey C. Domeniconi 13 1 0 07 Oct 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network Xing Wang A. Vinel 8 0 0 29 Sep 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization Yimin Huang Yujun Li Hanrong Ye Zhenguo Li Zhihua Zhang 14 7 0 11 Jul 2020
Bandit Samplers for Training Graph Neural Networks Ziqi Liu Zhengwei Wu Zhiqiang Zhang Jun Zhou Shuang Yang Le Song Yuan Qi 17 47 0 10 Jun 2020
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect Sulgi Kim Kyungmin Kim 6 0 0 04 Mar 2020
Gittins' theorem under uncertainty Samuel N. Cohen Tanut Treetanthiploet 11 3 0 12 Jul 2019
Productization Challenges of Contextual Multi-Armed Bandits D. Abensur Ivan Balashov S. Bar R. Lempel Nurit Moscovici I. Orlov Danny Rosenstein Ido Tamir 6 3 0 10 Jul 2019
Multi-Armed Bandits with Fairness Constraints for Distributing Resources to Human Teammates Houston Claure Yifang Chen Jignesh Modi Malte Jung S. Nikolaidis 14 22 0 30 Jun 2019
Adapting multi-armed bandits policies to contextual bandits scenarios David Cortes 16 32 0 11 Nov 2018
Cuttlefish: A Lightweight Primitive for Adaptive Query Processing Tomer Kaftan Magdalena Balazinska Alvin Cheung J. Gehrke 13 24 0 26 Feb 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems Lai Wei Vaibhav Srivastava 21 37 0 23 Feb 2018
Taming Non-stationary Bandits: A Bayesian Approach Vishnu Raj Sheetal Kalyani 19 76 0 31 Jul 2017
The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution H. Chan 25 14 0 24 Mar 2017