v1v2v3 (latest)

Corralling Stochastic Bandit Algorithms

16 June 2020

Papers citing "Corralling Stochastic Bandit Algorithms"

27 / 27 papers shown

Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025

Dravyansh Sharma

Arun Sai Suggala

OffRL

362

06 Jan 2025

A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021

Chen-Yu Wei

Christoph Dann

Julian Zimmert

390

31 Dec 2024

Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals

254

01 Jul 2024

Data-Driven Online Model Selection With Regret GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

453

05 Jun 2023

Adaptation to Misspecified Kernel Regularity in Kernelised BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Yusha Liu

Aarti Singh

356

26 Apr 2023

A Blackbox Approach to Best of Both Worlds in Bandits and BeyondAnnual Conference Computational Learning Theory (COLT), 2023

Christoph Dann

Chen-Yu Wei

Julian Zimmert

283

20 Feb 2023

Stochastic Rising BanditsInternational Conference on Machine Learning (ICML), 2022

Alberto Maria Metelli

F. Trovò

Matteo Pirola

Marcello Restelli

199

07 Dec 2022

Model Selection in Reinforcement Learning with General Function Approximations

Avishek Ghosh

Sayak Ray Chowdhury

193

06 Jul 2022

Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022

Aldo Pacchiano

Christoph Dann

Claudio Gentile

248

29 Jun 2022

Leveraging Initial Hints for Free in Stochastic Linear BanditsInternational Conference on Algorithmic Learning Theory (ALT), 2022

184

08 Mar 2022

Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022

261

12 Feb 2022

Universal and data-adaptive algorithms for model selection in linear contextual bandits

Vidya Muthukumar

A. Krishnamurthy

313

08 Nov 2021

Decentralized Cooperative Reinforcement Learning with Hierarchical Information StructureInternational Conference on Algorithmic Learning Theory (ALT), 2021

Hsu Kao

Chen-Yu Wei

V. Subramanian

395

01 Nov 2021

The Pareto Frontier of model selection for general Contextual Bandits

T. V. Marinov

Julian Zimmert

254

25 Oct 2021

Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess

Gregory Clark

282

05 Oct 2021

Model Selection for Generic Reinforcement Learning

Avishek Ghosh

Sayak Ray Chowdhury

Kannan Ramchandran

267

13 Jul 2021

Model Selection for Generic Contextual BanditsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021

Avishek Ghosh

Abishek Sankararaman

Kannan Ramchandran

307

07 Jul 2021

Towards Costless Model Selection in Contextual Bandits: A Bias-Variance PerspectiveInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

Sanath Kumar Krishnamurthy

Adrienne Margaret Propp

Susan Athey

336

11 Jun 2021

Thompson Sampling with a Mixture PriorInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

339

10 Jun 2021

Leveraging Good Representations in Linear Contextual BanditsInternational Conference on Machine Learning (ICML), 2021

207

08 Apr 2021

Pareto Optimal Model Selection in Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

Yinglun Zhu

Robert D. Nowak

273

12 Feb 2021

Upper Confidence Bounds for Combining Stochastic Bandits

Ashok Cutkosky

Abhimanyu Das

Manish Purohit

288

24 Dec 2020

Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

372

24 Dec 2020

Smooth Bandit Optimization: Generalization to Hölder SpaceInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

Yusha Liu

Yining Wang

Aarti Singh

253

11 Dec 2020

Online Model Selection for Reinforcement Learning with Function ApproximationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

254

19 Nov 2020

Multitask Bandit Learning Through Heterogeneous Feedback AggregationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020

465

29 Oct 2020

Model Selection in Contextual Stochastic Bandit ProblemsNeural Information Processing Systems (NeurIPS), 2020

590

103

03 Mar 2020