Batch Ensemble for Variance Dependent Regret in Stochastic BanditsAAAI Conference on Artificial Intelligence (AAAI), 2024 |
Forced Exploration in Bandit ProblemsAAAI Conference on Artificial Intelligence (AAAI), 2023 |
Did we personalize? Assessing personalization by an online reinforcement
learning algorithm using resamplingMachine-mediated learning (ML), 2023 Susobhan Ghosh Raphael Kim Prasidh Chhabria Raaz Dwivedi Predrag Klasjna Peng Liao Kelly Zhang Susan Murphy |
Multiplier Bootstrap-based ExplorationInternational Conference on Machine Learning (ICML), 2023 |
Residual Bootstrap Exploration for Stochastic Linear BanditConference on Uncertainty in Artificial Intelligence (UAI), 2022 |
GuideBoot: Guided Bootstrap for Deep Contextual BanditsThe Web Conference (WWW), 2021 |
Sub-sampling for Efficient Non-Parametric Bandit ExplorationNeural Information Processing Systems (NeurIPS), 2020 |
BanditPAM: Almost Linear Time -Medoids Clustering via Multi-Armed
BanditsNeural Information Processing Systems (NeurIPS), 2020 |