v1v2v3 (latest)

Corralling Stochastic Bandit Algorithms

16 June 2020

Papers citing "Corralling Stochastic Bandit Algorithms"

27 / 27 papers shown

Title
Offline-to-online hyperparameter transfer for stochastic bandits Dravyansh Sharma Arun Sai Suggala OffRL 103 4 0 06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning Chen-Yu Wei Christoph Dann Julian Zimmert 193 45 0 31 Dec 2024
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals Ziyi Liu Idan Attias Daniel M. Roy CML 51 1 0 01 Jul 2024
Data-Driven Online Model Selection With Regret Guarantees Aldo Pacchiano Christoph Dann Claudio Gentile OffRL 113 3 0 05 Jun 2023
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits Yusha Liu Aarti Singh 65 2 0 26 Apr 2023
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond Christoph Dann Chen-Yu Wei Julian Zimmert 73 24 0 20 Feb 2023
Stochastic Rising Bandits Alberto Maria Metelli F. Trovò Matteo Pirola Marcello Restelli 51 18 0 07 Dec 2022
Model Selection in Reinforcement Learning with General Function Approximations Avishek Ghosh Sayak Ray Chowdhury 45 3 0 06 Jul 2022
Best of Both Worlds Model Selection Aldo Pacchiano Christoph Dann Claudio Gentile 79 10 0 29 Jun 2022
Leveraging Initial Hints for Free in Stochastic Linear Bandits Ashok Cutkosky Christoph Dann Abhimanyu Das Qiuyi Qiuyi Zhang 51 5 0 08 Mar 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits Haipeng Luo Mengxiao Zhang Peng Zhao Zhi Zhou 84 17 0 12 Feb 2022
Universal and data-adaptive algorithms for model selection in linear contextual bandits Vidya Muthukumar A. Krishnamurthy 71 5 0 08 Nov 2021
Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure Hsu Kao Chen-Yu Wei V. Subramanian 123 12 0 01 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 98 22 0 25 Oct 2021
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess Gregory Clark 84 9 0 05 Oct 2021
Model Selection for Generic Reinforcement Learning Avishek Ghosh Sayak Ray Chowdhury Kannan Ramchandran 47 1 0 13 Jul 2021
Model Selection for Generic Contextual Bandits Avishek Ghosh Abishek Sankararaman Kannan Ramchandran 73 6 0 07 Jul 2021
Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective Sanath Kumar Krishnamurthy Adrienne Margaret Propp Susan Athey 60 3 0 11 Jun 2021
Thompson Sampling with a Mixture Prior Joey Hong Branislav Kveton Manzil Zaheer Mohammad Ghavamzadeh Craig Boutilier 50 12 0 10 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits Matteo Papini Andrea Tirinzoni Marcello Restelli A. Lazaric Matteo Pirotta 73 27 0 08 Apr 2021
Pareto Optimal Model Selection in Linear Bandits Yinglun Zhu Robert D. Nowak 43 14 0 12 Feb 2021
Upper Confidence Bounds for Combining Stochastic Bandits Ashok Cutkosky Abhimanyu Das Manish Purohit 41 9 0 24 Dec 2020
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL Aldo Pacchiano Christoph Dann Claudio Gentile Peter L. Bartlett 90 49 0 24 Dec 2020
Smooth Bandit Optimization: Generalization to Hölder Space Yusha Liu Yining Wang Aarti Singh 60 10 0 11 Dec 2020
Online Model Selection for Reinforcement Learning with Function Approximation Jonathan Lee Aldo Pacchiano Vidya Muthukumar Weihao Kong Emma Brunskill OffRL 52 37 0 19 Nov 2020
Multitask Bandit Learning Through Heterogeneous Feedback Aggregation Zhi Wang Chicheng Zhang Manish Singh L. Riek Kamalika Chaudhuri 111 23 0 29 Oct 2020
Model Selection in Contextual Stochastic Bandit Problems Aldo Pacchiano My Phan Yasin Abbasi-Yadkori Anup B. Rao Julian Zimmert Tor Lattimore Csaba Szepesvári 203 94 0 03 Mar 2020