ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00757
  4. Cited By
A Survey of Online Experiment Design with the Stochastic Multi-Armed
  Bandit

A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

2 October 2015
Giuseppe Burtini
Jason L. Loeppky
Ramon Lawrence
ArXivPDFHTML

Papers citing "A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit"

30 / 30 papers shown
Title
Information maximization for a broad variety of multi-armed bandit games
Information maximization for a broad variety of multi-armed bandit games
Alex Barbier-Chebbah
Christian L. Vestergaard
Jean-Baptiste Masson
54
0
0
20 Mar 2025
The Digital Transformation in Health: How AI Can Improve the Performance
  of Health Systems
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
África Periánez
Ana Fernández del Río
Ivan Nazarov
Enric Jané
Moiz Hassan
Aditya Rastogi
Dexian Tang
34
9
0
24 Sep 2024
A Green Multi-Attribute Client Selection for Over-The-Air Federated
  Learning: A Grey-Wolf-Optimizer Approach
A Green Multi-Attribute Client Selection for Over-The-Air Federated Learning: A Grey-Wolf-Optimizer Approach
Maryam Ben Driss
Essaid Sabir
H. Elbiaze
Abdoulaye Baniré Diallo
M. Sadik
20
0
0
16 Sep 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning:
  Insights from SwipeRx
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
OnRL
23
2
0
15 Aug 2024
Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy
  Services
Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
18
3
0
14 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in
  Resource-Limited Settings
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings
África Periánez
Kathrin Schmitz
Lazola Makhupula
Moiz Hassan
Moeti Moleko
Ana Fernández del Río
Ivan Nazarov
Aditya Rastogi
Dexian Tang
OffRL
22
0
0
14 Aug 2024
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for
  Counselor Reflection Generation
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation
Do June Min
Verónica Pérez-Rosas
Kenneth Resnicow
Rada Mihalcea
OffRL
35
2
0
20 Mar 2024
GROS: A General Robust Aggregation Strategy
GROS: A General Robust Aggregation Strategy
A. Cholaquidis
Emilien Joly
L. Moreno
19
2
0
23 Feb 2024
Evaluating Online Bandit Exploration In Large-Scale Recommender System
Evaluating Online Bandit Exploration In Large-Scale Recommender System
Hongbo Guo
Ruben Naeff
Alex Nikulkov
Zheqing Zhu
OffRL
9
6
0
05 Apr 2023
Adaptive Interventions for Global Health: A Case Study of Malaria
Adaptive Interventions for Global Health: A Case Study of Malaria
África Periánez
A. Trister
Madhav Nekkar
Ana Fernández del Río
P. Alonso
22
1
0
03 Mar 2023
Multi-Armed Bandits in Brain-Computer Interfaces
Multi-Armed Bandits in Brain-Computer Interfaces
Frida Heskebeck
Carolina Bergeling
Bo Bernhardsson
11
4
0
19 May 2022
Existence conditions for hidden feedback loops in online recommender
  systems
Existence conditions for hidden feedback loops in online recommender systems
A. Khritankov
Anton A. Pilkevich
13
1
0
11 Sep 2021
Debiasing Samples from Online Learning Using Bootstrap
Debiasing Samples from Online Learning Using Bootstrap
Ningyuan Chen
Xuefeng Gao
Yi Xiong
OffRL
OnRL
9
4
0
31 Jul 2021
Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for
  Non-Stationary Bandits
Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits
Gourab Ghatak
Hardhik Mohanty
Aniq Ur Rahman
TTA
21
9
0
30 May 2021
TSEC: a framework for online experimentation under experimental
  constraints
TSEC: a framework for online experimentation under experimental constraints
Simon Mak
Yuanshuo Zhou
Lavonne Hoang
C. F. J. Wu
13
1
0
17 Jan 2021
DORB: Dynamically Optimizing Multiple Rewards with Bandits
DORB: Dynamically Optimizing Multiple Rewards with Bandits
Ramakanth Pasunuru
Han Guo
Mohit Bansal
OffRL
14
6
0
15 Nov 2020
Asymptotic Randomised Control with applications to bandits
Asymptotic Randomised Control with applications to bandits
Samuel N. Cohen
Tanut Treetanthiploet
10
5
0
14 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in
  UX Optimization
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
13
1
0
07 Oct 2020
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep
  Q-Network
Reannealing of Decaying Exploration Based On Heuristic Measure in Deep Q-Network
Xing Wang
A. Vinel
8
0
0
29 Sep 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and
  Hyperparameter Optimization
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization
Yimin Huang
Yujun Li
Hanrong Ye
Zhenguo Li
Zhihua Zhang
14
7
0
11 Jul 2020
Bandit Samplers for Training Graph Neural Networks
Bandit Samplers for Training Graph Neural Networks
Ziqi Liu
Zhengwei Wu
Zhiqiang Zhang
Jun Zhou
Shuang Yang
Le Song
Yuan Qi
17
47
0
10 Jun 2020
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect
Odds-Ratio Thompson Sampling to Control for Time-Varying Effect
Sulgi Kim
Kyungmin Kim
6
0
0
04 Mar 2020
Gittins' theorem under uncertainty
Gittins' theorem under uncertainty
Samuel N. Cohen
Tanut Treetanthiploet
11
3
0
12 Jul 2019
Productization Challenges of Contextual Multi-Armed Bandits
Productization Challenges of Contextual Multi-Armed Bandits
D. Abensur
Ivan Balashov
S. Bar
R. Lempel
Nurit Moscovici
I. Orlov
Danny Rosenstein
Ido Tamir
6
3
0
10 Jul 2019
Multi-Armed Bandits with Fairness Constraints for Distributing Resources
  to Human Teammates
Multi-Armed Bandits with Fairness Constraints for Distributing Resources to Human Teammates
Houston Claure
Yifang Chen
Jignesh Modi
Malte Jung
S. Nikolaidis
14
22
0
30 Jun 2019
Adapting multi-armed bandits policies to contextual bandits scenarios
Adapting multi-armed bandits policies to contextual bandits scenarios
David Cortes
16
32
0
11 Nov 2018
Cuttlefish: A Lightweight Primitive for Adaptive Query Processing
Cuttlefish: A Lightweight Primitive for Adaptive Query Processing
Tomer Kaftan
Magdalena Balazinska
Alvin Cheung
J. Gehrke
13
24
0
26 Feb 2018
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
On Abruptly-Changing and Slowly-Varying Multiarmed Bandit Problems
Lai Wei
Vaibhav Srivastava
21
37
0
23 Feb 2018
Taming Non-stationary Bandits: A Bayesian Approach
Taming Non-stationary Bandits: A Bayesian Approach
Vishnu Raj
Sheetal Kalyani
19
76
0
31 Jul 2017
The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution
The Multi-Armed Bandit Problem: An Efficient Non-Parametric Solution
H. Chan
25
14
0
24 Mar 2017
1