Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

24 December 2020

Papers citing "Regret Bound Balancing and Elimination for Model Selection in Bandits and RL"

41 / 41 papers shown

Title
A Model Selection Approach for Corruption Robust Reinforcement Learning Chen-Yu Wei Christoph Dann Julian Zimmert 193 45 0 31 Dec 2024
Model Selection for Average Reward RL with Application to Utility Maximization in Repeated Games Alireza Masoumian James R. Wright 142 1 0 09 Nov 2024
Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal Juliusz Ziomek Masaki Adachi Michael A. Osborne 93 1 0 14 Oct 2024
Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives Aida Afshar Aldo Pacchiano 62 0 0 07 Aug 2024
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals Ziyi Liu Idan Attias Daniel M. Roy CML 51 1 0 01 Jul 2024
Sparsity-Agnostic Linear Bandits with Adaptive Adversaries Tianyuan Jin Kyoungseok Jang Nicolò Cesa-Bianchi 85 1 0 03 Jun 2024
Symmetric Linear Bandits with Hidden Symmetry Nam-Phuong Tran T. Ta Debmalya Mandal Long Tran-Thanh 109 0 0 22 May 2024
Experiment Planning with Function Approximation Aldo Pacchiano Jonathan Lee Emma Brunskill OffRL 70 4 0 10 Jan 2024
Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning Pier Giuseppe Sessa Pierre Laforgue Nicolò Cesa-Bianchi Andreas Krause 65 2 0 03 Aug 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Yuwei Luo Mohsen Bayati 58 1 0 26 Jun 2023
Data-Driven Online Model Selection With Regret Guarantees Aldo Pacchiano Christoph Dann Claudio Gentile OffRL 116 3 0 05 Jun 2023
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits Yusha Liu Aarti Singh 65 2 0 26 Apr 2023
Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction Abhishek Paudel Gregory J. Stein 63 2 0 03 Apr 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits Jonathan Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill 49 0 0 19 Feb 2023
Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Yue Kang Cho-Jui Hsieh T. C. Lee 61 1 0 18 Feb 2023
Stochastic Rising Bandits Alberto Maria Metelli F. Trovò Matteo Pirola Marcello Restelli 51 18 0 07 Dec 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity Abhishek Gupta Aldo Pacchiano Yuexiang Zhai Sham Kakade Sergey Levine OffRL 105 67 0 18 Oct 2022
Neural Design for Genetic Perturbation Experiments Aldo Pacchiano Drausin Wulsin Robert A. Barton L. Voloch 80 5 0 26 Jul 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference Debangshu Banerjee Avishek Ghosh Sayak Ray Chowdhury Aditya Gopalan 68 10 0 23 Jul 2022
Model Selection in Reinforcement Learning with General Function Approximations Avishek Ghosh Sayak Ray Chowdhury 45 3 0 06 Jul 2022
Best of Both Worlds Model Selection Aldo Pacchiano Christoph Dann Claudio Gentile 79 10 0 29 Jun 2022
Joint Representation Training in Sequential Tasks with Shared Structure Aldo Pacchiano Ofir Nachum Nilseh Tripuraneni Peter L. Bartlett 116 5 0 24 Jun 2022
Provable Benefits of Representational Transfer in Reinforcement Learning Alekh Agarwal Yuda Song Wen Sun Kaiwen Wang Mengdi Wang Xuezhou Zhang OffRL 102 35 0 29 May 2022
$Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits$ Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits Avishek Ghosh Abishek Sankararaman 52 4 0 19 May 2022
Neural Pseudo-Label Optimism for the Bank Loan Problem Aldo Pacchiano Shaun Singh Edward Chou Alexander C. Berg Jakob N. Foerster 36 7 0 03 Dec 2021
Misspecified Gaussian Process Bandit Optimization Ilija Bogunovic Andreas Krause 86 45 0 09 Nov 2021
Universal and data-adaptive algorithms for model selection in linear contextual bandits Vidya Muthukumar A. Krishnamurthy 71 5 0 08 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits T. V. Marinov Julian Zimmert 98 22 0 25 Oct 2021
Improved Algorithms for Misspecified Linear Markov Decision Processes Daniel Vial Advait Parulekar Sanjay Shakkottai R. Srikant 61 6 0 12 Sep 2021
Model Selection for Generic Reinforcement Learning Avishek Ghosh Sayak Ray Chowdhury Kannan Ramchandran 47 1 0 13 Jul 2021
Model Selection for Generic Contextual Bandits Avishek Ghosh Abishek Sankararaman Kannan Ramchandran 76 6 0 07 Jul 2021
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL Weitong Zhang Jiafan He Dongruo Zhou Amy Zhang Quanquan Gu OffRL 65 11 0 22 Jun 2021
Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective Sanath Kumar Krishnamurthy Adrienne Margaret Propp Susan Athey 60 3 0 11 Jun 2021
Feature and Parameter Selection in Stochastic Linear Bandits Ahmadreza Moradipari Berkay Turan Yasin Abbasi-Yadkori M. Alizadeh Mohammad Ghavamzadeh 151 5 0 09 Jun 2021
Neural Active Learning with Performance Guarantees Pranjal Awasthi Christoph Dann Claudio Gentile Ayush Sekhari Zhilei Wang 56 22 0 06 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits Matteo Papini Andrea Tirinzoni Marcello Restelli A. Lazaric Matteo Pirotta 73 27 0 08 Apr 2021
Model-free Representation Learning and Exploration in Low-rank MDPs Aditya Modi Jinglin Chen A. Krishnamurthy Nan Jiang Alekh Agarwal OffRL 169 81 0 14 Feb 2021
Pareto Optimal Model Selection in Linear Bandits Yinglun Zhu Robert D. Nowak 43 14 0 12 Feb 2021
Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Chen-Yu Wei Haipeng Luo OffRL 183 107 0 10 Feb 2021
Tactical Optimism and Pessimism for Deep Reinforcement Learning Theodore H. Moskovitz Jack Parker-Holder Aldo Pacchiano Michael Arbel Michael I. Jordan 92 59 0 07 Feb 2021
Model Selection in Contextual Stochastic Bandit Problems Aldo Pacchiano My Phan Yasin Abbasi-Yadkori Anup B. Rao Julian Zimmert Tor Lattimore Csaba Szepesvári 203 94 0 03 Mar 2020