Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00757
Cited By
A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit
2 October 2015
Giuseppe Burtini
Jason L. Loeppky
Ramon Lawrence
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit"
30 / 30 papers shown
Title
Information maximization for a broad variety of multi-armed bandit games
Alex Barbier-Chebbah
Christian L. Vestergaard
Jean-Baptiste Masson
54
0
0
20 Mar 2025
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
África Periánez
Ana Fernández del Río
Ivan Nazarov
Enric Jané
Moiz Hassan
Aditya Rastogi
Dexian Tang
34
9
0
24 Sep 2024
A Green Multi-Attribute Client Selection for Over-The-Air Federated Learning: A Grey-Wolf-Optimizer Approach
Maryam Ben Driss
Essaid Sabir
H. Elbiaze
Abdoulaye Baniré Diallo
M. Sadik
20
0
0
16 Sep 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
OnRL
23
2
0
15 Aug 2024
Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
18
3
0
14 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings
África Periánez
Kathrin Schmitz
Lazola Makhupula
Moiz Hassan
Moeti Moleko
Ana Fernández del Río
Ivan Nazarov
Aditya Rastogi
Dexian Tang
OffRL
22
0
0
14 Aug 2024
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
Do June Min
Verónica Pérez-Rosas
Kenneth Resnicow
Rada Mihalcea
OffRL
35
2
0
20 Mar 2024
GROS: A General Robust Aggregation Strategy
A. Cholaquidis
Emilien Joly
L. Moreno
19
2
0
23 Feb 2024
Evaluating Online Bandit Exploration In Large-Scale Recommender System
Hongbo Guo
Ruben Naeff
Alex Nikulkov
Zheqing Zhu
OffRL
9
6
0
05 Apr 2023
Adaptive Interventions for Global Health: A Case Study of Malaria
África Periánez
A. Trister
Madhav Nekkar
Ana Fernández del Río
P. Alonso
22
1
0
03 Mar 2023
Multi-Armed Bandits in Brain-Computer Interfaces
Frida Heskebeck
Carolina Bergeling
Bo Bernhardsson
11
4
0
19 May 2022
Existence conditions for hidden feedback loops in online recommender systems
A. Khritankov
Anton A. Pilkevich
13
1
0
11 Sep 2021
Debiasing Samples from Online Learning Using Bootstrap
Ningyuan Chen
Xuefeng Gao
Yi Xiong
OffRL
OnRL
9
4
0
31 Jul 2021
Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits
Gourab Ghatak
Hardhik Mohanty
Aniq Ur Rahman
TTA
21
9
0
30 May 2021
TSEC: a framework for online experimentation under experimental constraints
Simon Mak
Yuanshuo Zhou
Lavonne Hoang
C. F. J. Wu
13
1
0
17 Jan 2021
DORB: Dynamically Optimizing Multiple Rewards with Bandits
Ramakanth Pasunuru
Han Guo
Mohit Bansal
OffRL
14
6
0
15 Nov 2020
Asymptotic Randomised Control with applications to bandits
Samuel N. Cohen
Tanut Treetanthiploet
10
5
0
14 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
13
1
0
07 Oct 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
8
0
0
29 Sep 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization
Yimin Huang
Yujun Li
Hanrong Ye
Zhenguo Li
Zhihua Zhang
14
7
0
11 Jul 2020
Bandit Samplers for Training Graph Neural Networks
Ziqi Liu
Zhengwei Wu
Zhiqiang Zhang
Jun Zhou
Shuang Yang
Le Song
Yuan Qi
17
47
0
10 Jun 2020
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect
Sulgi Kim
Kyungmin Kim
6
0
0
04 Mar 2020
Gittins' theorem under uncertainty
Samuel N. Cohen
Tanut Treetanthiploet
11
3
0
12 Jul 2019
Productization Challenges of Contextual Multi-Armed Bandits
D. Abensur
Ivan Balashov
S. Bar
R. Lempel
Nurit Moscovici
I. Orlov
Danny Rosenstein
Ido Tamir
6
3
0
10 Jul 2019
Multi-Armed Bandits with Fairness Constraints for Distributing Resources to Human Teammates
Houston Claure
Yifang Chen
Jignesh Modi
Malte Jung
S. Nikolaidis
14
22
0
30 Jun 2019
Adapting multi-armed bandits policies to contextual bandits scenarios
David Cortes
16
32
0
11 Nov 2018
Cuttlefish: A Lightweight Primitive for Adaptive Query Processing
Tomer Kaftan
Magdalena Balazinska
Alvin Cheung
J. Gehrke
13
24
0
26 Feb 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
21
37
0
23 Feb 2018
Taming Non-stationary Bandits: A Bayesian Approach
Vishnu Raj
Sheetal Kalyani
19
76
0
31 Jul 2017
The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution
H. Chan
25
14
0
24 Mar 2017
1