Experimental Design for Semiparametric BanditsAnnual Conference Computational Learning Theory (COLT), 2025 |
A Doubly Robust Approach to Sparse Reinforcement LearningInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 |
Learning the Pareto Front Using Bootstrapped Observation SamplesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023 |
Improved Algorithms for Multi-period Multi-class Packing Problems with
Bandit FeedbackInternational Conference on Machine Learning (ICML), 2023 |
Risk-aware linear bandits with convex lossInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 |
Double Doubly Robust Thompson Sampling for Generalized Linear Contextual
BanditsAAAI Conference on Artificial Intelligence (AAAI), 2022 |
Squeeze All: Novel Estimator and Self-Normalized Bound for Linear
Contextual BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022 |
Finite-Time Regret of Thompson Sampling Algorithms for Exponential
Family Multi-Armed BanditsNeural Information Processing Systems (NeurIPS), 2022 |