Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in
Contextual Bandit AlgorithmsNeural Information Processing Systems (NeurIPS), 2021 |
DORB: Dynamically Optimizing Multiple Rewards with BanditsConference on Empirical Methods in Natural Language Processing (EMNLP), 2020 |
MAME : Model-Agnostic Meta-ExplorationConference on Robot Learning (CoRL), 2019 |
Meta Dynamic Pricing: Transfer Learning Across ExperimentsManagement Sciences (MS), 2019 |