Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.13045
Cited By
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
24 December 2020
Aldo Pacchiano
Christoph Dann
Claudio Gentile
Peter L. Bartlett
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Regret Bound Balancing and Elimination for Model Selection in Bandits and RL"
41 / 41 papers shown
Title
A Model Selection Approach for Corruption Robust Reinforcement Learning
Chen-Yu Wei
Christoph Dann
Julian Zimmert
193
45
0
31 Dec 2024
Model Selection for Average Reward RL with Application to Utility Maximization in Repeated Games
Alireza Masoumian
James R. Wright
142
1
0
09 Nov 2024
Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal
Juliusz Ziomek
Masaki Adachi
Michael A. Osborne
93
1
0
14 Oct 2024
Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives
Aida Afshar
Aldo Pacchiano
62
0
0
07 Aug 2024
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu
Idan Attias
Daniel M. Roy
CML
51
1
0
01 Jul 2024
Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Tianyuan Jin
Kyoungseok Jang
Nicolò Cesa-Bianchi
85
1
0
03 Jun 2024
Symmetric Linear Bandits with Hidden Symmetry
Nam-Phuong Tran
T. Ta
Debmalya Mandal
Long Tran-Thanh
109
0
0
22 May 2024
Experiment Planning with Function Approximation
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
70
4
0
10 Jan 2024
Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning
Pier Giuseppe Sessa
Pierre Laforgue
Nicolò Cesa-Bianchi
Andreas Krause
65
2
0
03 Aug 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo
Mohsen Bayati
58
1
0
26 Jun 2023
Data-Driven Online Model Selection With Regret Guarantees
Aldo Pacchiano
Christoph Dann
Claudio Gentile
OffRL
116
3
0
05 Jun 2023
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits
Yusha Liu
Aarti Singh
65
2
0
26 Apr 2023
Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction
Abhishek Paudel
Gregory J. Stein
63
2
0
03 Apr 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits
Jonathan Lee
Weihao Kong
Aldo Pacchiano
Vidya Muthukumar
Emma Brunskill
49
0
0
19 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits
Yue Kang
Cho-Jui Hsieh
T. C. Lee
61
1
0
18 Feb 2023
Stochastic Rising Bandits
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
51
18
0
07 Dec 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
105
67
0
18 Oct 2022
Neural Design for Genetic Perturbation Experiments
Aldo Pacchiano
Drausin Wulsin
Robert A. Barton
L. Voloch
80
5
0
26 Jul 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
68
10
0
23 Jul 2022
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
45
3
0
06 Jul 2022
Best of Both Worlds Model Selection
Aldo Pacchiano
Christoph Dann
Claudio Gentile
79
10
0
29 Jun 2022
Joint Representation Training in Sequential Tasks with Shared Structure
Aldo Pacchiano
Ofir Nachum
Nilseh Tripuraneni
Peter L. Bartlett
116
5
0
24 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning
Alekh Agarwal
Yuda Song
Wen Sun
Kaiwen Wang
Mengdi Wang
Xuezhou Zhang
OffRL
102
35
0
29 May 2022
Breaking the
T
\sqrt{T}
T
Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits
Avishek Ghosh
Abishek Sankararaman
52
4
0
19 May 2022
Neural Pseudo-Label Optimism for the Bank Loan Problem
Aldo Pacchiano
Shaun Singh
Edward Chou
Alexander C. Berg
Jakob N. Foerster
36
7
0
03 Dec 2021
Misspecified Gaussian Process Bandit Optimization
Ilija Bogunovic
Andreas Krause
86
45
0
09 Nov 2021
Universal and data-adaptive algorithms for model selection in linear contextual bandits
Vidya Muthukumar
A. Krishnamurthy
71
5
0
08 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
98
22
0
25 Oct 2021
Improved Algorithms for Misspecified Linear Markov Decision Processes
Daniel Vial
Advait Parulekar
Sanjay Shakkottai
R. Srikant
61
6
0
12 Sep 2021
Model Selection for Generic Reinforcement Learning
Avishek Ghosh
Sayak Ray Chowdhury
Kannan Ramchandran
47
1
0
13 Jul 2021
Model Selection for Generic Contextual Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
76
6
0
07 Jul 2021
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL
Weitong Zhang
Jiafan He
Dongruo Zhou
Amy Zhang
Quanquan Gu
OffRL
65
11
0
22 Jun 2021
Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective
Sanath Kumar Krishnamurthy
Adrienne Margaret Propp
Susan Athey
60
3
0
11 Jun 2021
Feature and Parameter Selection in Stochastic Linear Bandits
Ahmadreza Moradipari
Berkay Turan
Yasin Abbasi-Yadkori
M. Alizadeh
Mohammad Ghavamzadeh
151
5
0
09 Jun 2021
Neural Active Learning with Performance Guarantees
Pranjal Awasthi
Christoph Dann
Claudio Gentile
Ayush Sekhari
Zhilei Wang
56
22
0
06 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
73
27
0
08 Apr 2021
Model-free Representation Learning and Exploration in Low-rank MDPs
Aditya Modi
Jinglin Chen
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
OffRL
169
81
0
14 Feb 2021
Pareto Optimal Model Selection in Linear Bandits
Yinglun Zhu
Robert D. Nowak
43
14
0
12 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach
Chen-Yu Wei
Haipeng Luo
OffRL
183
107
0
10 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning
Theodore H. Moskovitz
Jack Parker-Holder
Aldo Pacchiano
Michael Arbel
Michael I. Jordan
92
59
0
07 Feb 2021
Model Selection in Contextual Stochastic Bandit Problems
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
203
94
0
03 Mar 2020
1