v1v2v3 (latest)

Model Selection in Contextual Stochastic Bandit Problems

Neural Information Processing Systems (NeurIPS), 2020

3 March 2020

Papers citing "Model Selection in Contextual Stochastic Bandit Problems"

50 / 92 papers shown

Improved Training Mechanism for Reinforcement Learning via Online Model Selection

Aida Afshar

Aldo Pacchiano

01 Dec 2025

UCB-type Algorithm for Budget-Constrained Expert Learning

149

26 Oct 2025

Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025

Dravyansh Sharma

Arun Sai Suggala

OffRL

371

06 Jan 2025

A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021

Chen-Yu Wei

Christoph Dann

Julian Zimmert

402

31 Dec 2024

State-free Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

Mingyu Chen

Aldo Pacchiano

Xuezhou Zhang

368

27 Sep 2024

Stochastic Bandits Robust to Adversarial Attacks

John C. S. Lui

183

16 Aug 2024

Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives

Aida Afshar

Aldo Pacchiano

234

07 Aug 2024

Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals

255

01 Jul 2024

Efficient Sequential Decision Making with Large Language Models

464

17 Jun 2024

Bootstrapping Expectiles in Reinforcement Learning

Matthieu Geist

330

06 Jun 2024

Sparsity-Agnostic Linear Bandits with Adaptive Adversaries

Tianyuan Jin

Kyoungseok Jang

Nicolò Cesa-Bianchi

273

03 Jun 2024

Online Bandits with (Biased) Offline Data: Adaptive Learning under Distribution MismatchInternational Conference on Machine Learning (ICML), 2024

Wang Chi Cheung

Lixing Lyu

OffRL

477

04 May 2024

Neural Active Learning Beyond Bandits

Ziwei Wu

303

18 Apr 2024

The SMART approach to instance-optimal online learning

Siddhartha Banerjee

Alankrita Bhatt

Chao Yu

238

27 Feb 2024

Understanding Model Selection For Learning In Strategic EnvironmentsNeural Information Processing Systems (NeurIPS), 2024

Tinashe Handina

Eric Mazumdar

272

12 Feb 2024

Budgeted Online Model Selection and Fine-Tuning via Federated Learning

P. M. Ghari

Yanning Shen

FedML

373

19 Jan 2024

Experiment Planning with Function ApproximationNeural Information Processing Systems (NeurIPS), 2024

239

10 Jan 2024

Best-of-Both-Worlds Algorithms for Linear Contextual Bandits

Fabio Vitale

338

24 Dec 2023

Online Clustering of Bandits with Misspecified User ModelsNeural Information Processing Systems (NeurIPS), 2023

Shuai Li

394

04 Oct 2023

Anytime Model Selection in Linear BanditsNeural Information Processing Systems (NeurIPS), 2023

407

24 Jul 2023

Active Policy Improvement from Multiple Black-box OraclesInternational Conference on Machine Learning (ICML), 2023

450

17 Jun 2023

Data-Driven Online Model Selection With Regret GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

453

05 Jun 2023

Robust Lipschitz Bandits to Adversarial CorruptionsNeural Information Processing Systems (NeurIPS), 2023

283

29 May 2023

Adaptation to Misspecified Kernel Regularity in Kernelised BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023

Yusha Liu

Aarti Singh

356

26 Apr 2023

Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret BoundsAnnual Conference Computational Learning Theory (COLT), 2023

Shinji Ito

Kei Takemura

192

24 Feb 2023

Estimating Optimal Policy Value in General Linear Contextual Bandits

283

19 Feb 2023

Linear Bandits with Memory: from Rotting to Rising

Giulia Clerici

Pierre Laforgue

Nicolò Cesa-Bianchi

254

16 Feb 2023

Learning Complementary Policies for Human-AI Teams

Ruijiang Gao

M. Saar-Tsechansky

Maria De-Arteaga

364

06 Feb 2023

On the Complexity of Representation Learning in Contextual Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022

Andrea Tirinzoni

Matteo Pirotta

A. Lazaric

258

19 Dec 2022

Stochastic Rising BanditsInternational Conference on Machine Learning (ICML), 2022

Alberto Maria Metelli

F. Trovò

Matteo Pirola

Marcello Restelli

200

07 Dec 2022

Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2022

Osama A. Hanna

Lin F. Yang

Christina Fragouli

380

08 Nov 2022

Oracle Inequalities for Model Selection in Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

391

03 Nov 2022

Lifelong Bandit Optimization: No Prior and No RegretConference on Uncertainty in Artificial Intelligence (UAI), 2022

397

27 Oct 2022

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret GuaranteesNeural Information Processing Systems (NeurIPS), 2022

306

24 Oct 2022

Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022

Abhishek Gupta

261

100

18 Oct 2022

Unsupervised Model Selection for Time-series Anomaly DetectionInternational Conference on Learning Representations (ICLR), 2022

Cristian Challu

499

03 Oct 2022

Neural Design for Genetic Perturbation ExperimentsInternational Conference on Learning Representations (ICLR), 2022

327

26 Jul 2022

Model Selection in Reinforcement Learning with General Function Approximations

Avishek Ghosh

Sayak Ray Chowdhury

193

06 Jul 2022

Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022

Aldo Pacchiano

Christoph Dann

Claudio Gentile

250

29 Jun 2022

Adversarial Bandits against Arbitrary Strategies

Jung-hun Kim

Se-Young Yun

454

30 May 2022

Leveraging Initial Hints for Free in Stochastic Linear BanditsInternational Conference on Algorithmic Learning Theory (ALT), 2022

184

08 Mar 2022

Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022

261

12 Feb 2022

Model Selection in Batch Policy OptimizationInternational Conference on Machine Learning (ICML), 2021

252

23 Dec 2021

Neural Pseudo-Label Optimism for the Bank Loan ProblemNeural Information Processing Systems (NeurIPS), 2021

180

03 Dec 2021

Misspecified Gaussian Process Bandit OptimizationNeural Information Processing Systems (NeurIPS), 2021

Ilija Bogunovic

Andreas Krause

266

09 Nov 2021

Universal and data-adaptive algorithms for model selection in linear contextual bandits

Vidya Muthukumar

A. Krishnamurthy

316

08 Nov 2021

Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021

Clémence Réda

Andrea Tirinzoni

Rémy Degenne

241

02 Nov 2021

The Pareto Frontier of model selection for general Contextual Bandits

T. V. Marinov

Julian Zimmert

258

25 Oct 2021

Linear Contextual Bandits with Adversarial Corruptions

Heyang Zhao

Dongruo Zhou

Quanquan Gu

AAML

257

25 Oct 2021

Distribution-free Contextual Dynamic Pricing

Yiyun Luo

W. Sun

Yufeng Liu

471

15 Sep 2021