Adaptive Exploration in Linear Contextual Bandit

15 October 2019

Papers citing "Adaptive Exploration in Linear Contextual Bandit"

47 / 47 papers shown

Title
A Classification View on Meta Learning Bandits Mirco Mutti Jeongyeol Kwon Shie Mannor Aviv Tamar 28 0 0 06 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System J. Gornet Yilin Mo Bruno Sinopoli 30 0 0 04 Apr 2025
Sparse Nonparametric Contextual Bandits Hamish Flynn Julia Olkhovskaya Paul Rognon-Vael 53 0 0 20 Mar 2025
Information maximization for a broad variety of multi-armed bandit games Alex Barbier-Chebbah Christian L. Vestergaard Jean-Baptiste Masson 61 0 0 20 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits Elise Crépon Aurélien Garivier Wouter M. Koolen 47 1 0 29 Jan 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts Zhuohua Li Maoli Liu Xiangxiang Dai John C. S. Lui 36 0 0 03 Jan 2025
Optimal Batched Linear Bandits Xuanfei Ren Tianyuan Jin Pan Xu 34 2 0 06 Jun 2024
On Bits and Bandits: Quantifying the Regret-Information Trade-off Itai Shufaro Nadav Merlis Nir Weinberger Shie Mannor 38 0 0 26 May 2024
A Functional Model Method for Nonconvex Nonsmooth Conditional Stochastic Optimization Andrzej Ruszczyñski Shangzhe Yang 31 0 0 17 May 2024
$ε$ -Policy Gradient for Online Pricing Lukasz Szpruch Tanut Treetanthiploet Yufei Zhang OffRL 60 1 0 06 May 2024
Differentially Private High Dimensional Bandits Apurv Shukla 26 0 0 06 Feb 2024
Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach Wen Huang Xintao Wu OffRL CML 18 0 0 20 Dec 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Yuwei Luo Mohsen Bayati 21 1 0 26 Jun 2023
Federated Linear Contextual Bandits with User-level Differential Privacy Ruiquan Huang Huanyu Zhang Luca Melis Milan Shen Meisam Hajzinia J. Yang FedML 23 11 0 08 Jun 2023
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits Weitong Zhang Jiafan He Zhiyuan Fan Q. Gu 102 5 0 16 Mar 2023
Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications Johannes Kirschner Tor Lattimore Andreas Krause 36 8 0 07 Feb 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle Hyunwook Kang P. R. Kumar OffRL 33 1 0 29 Jan 2023
On the Complexity of Representation Learning in Contextual Linear Bandits Andrea Tirinzoni Matteo Pirotta A. Lazaric 32 1 0 19 Dec 2022
On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure Alessio Russo Alexandre Proutière 28 2 0 28 Nov 2022
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees Andrea Tirinzoni Matteo Papini Ahmed Touati A. Lazaric Matteo Pirotta 28 4 0 24 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference Debangshu Banerjee Avishek Ghosh Sayak Ray Chowdhury Aditya Gopalan 27 9 0 23 Jul 2022
Instance-optimal PAC Algorithms for Contextual Bandits Zhao Li Lillian J. Ratliff Houssam Nassif Kevin G. Jamieson Lalit P. Jain 12 17 0 05 Jul 2022
Asymptotic Instance-Optimal Algorithms for Interactive Decision Making Kefan Dong Tengyu Ma 18 9 0 06 Jun 2022
Residual Bootstrap Exploration for Stochastic Linear Bandit Shuang Wu ChiHua Wang Yuantong Li Guang Cheng 11 8 0 23 Feb 2022
Truncated LinUCB for Stochastic Linear Bandits Yanglei Song Meng zhou 49 0 0 23 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach Filippo Vannella Alexandre Proutière Yassir Jedra Jaeseong Jeong 23 7 0 06 Jan 2022
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection Matteo Papini Andrea Tirinzoni Aldo Pacchiano Marcello Restelli A. Lazaric Matteo Pirotta 19 18 0 27 Oct 2021
Sequential Estimation under Multiple Resources: a Bandit Point of View Alireza Masoumian Shayan Kiyani Mohammad Hossein Yassaee 31 1 0 29 Sep 2021
A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs Andrea Tirinzoni Matteo Pirotta A. Lazaric 13 16 0 24 Jun 2021
Fair Exploration via Axiomatic Bargaining Jackie Baek Vivek F. Farias FaML 16 28 0 04 Jun 2021
Information Directed Sampling for Sparse Linear Bandits Botao Hao Tor Lattimore Wei Deng 19 19 0 29 May 2021
Leveraging Good Representations in Linear Contextual Bandits Matteo Papini Andrea Tirinzoni Marcello Restelli A. Lazaric Matteo Pirotta 30 26 0 08 Apr 2021
Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously Chung-Wei Lee Haipeng Luo Chen-Yu Wei Mengxiao Zhang Xiaojin Zhang 15 46 0 11 Feb 2021
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature Kefan Dong Jiaqi Yang Tengyu Ma 24 32 0 08 Feb 2021
Asymptotically Optimal Information-Directed Sampling Johannes Kirschner Tor Lattimore Claire Vernade Csaba Szepesvári 11 33 0 11 Nov 2020
High-Dimensional Sparse Linear Bandits Botao Hao Tor Lattimore Mengdi Wang 6 60 0 08 Nov 2020
Experimental Design for Regret Minimization in Linear Bandits Andrew Wagenmaker Julian Katz-Samuels Kevin G. Jamieson 9 15 0 01 Nov 2020
Regret in Online Recommendation Systems Kaito Ariu Narae Ryu Seyoung Yun Alexandre Proutière 9 6 0 23 Oct 2020
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits Andrea Tirinzoni Matteo Pirotta Marcello Restelli A. Lazaric 14 34 0 23 Oct 2020
Thresholded Lasso Bandit Kaito Ariu Kenshi Abe Alexandre Proutière 21 17 0 22 Oct 2020
Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective Dylan J. Foster Alexander Rakhlin D. Simchi-Levi Yunzong Xu 21 75 0 07 Oct 2020
Diversity-Preserving K-Armed Bandits, Revisited Hédi Hadiji Sébastien Gerchinovitz Jean-Michel Loubes Gilles Stoltz 26 2 0 05 Oct 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality Kwang-Sung Jun Chicheng Zhang 15 10 0 15 Jun 2020
A Novel Confidence-Based Algorithm for Structured Bandits Andrea Tirinzoni A. Lazaric Marcello Restelli 13 11 0 23 May 2020
Information Directed Sampling for Linear Partial Monitoring Johannes Kirschner Tor Lattimore Andreas Krause 21 46 0 25 Feb 2020
The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms Mohsen Bayati N. Hamidi Ramesh Johari Khashayar Khosravi 37 28 0 24 Feb 2020
A General Theory of the Stochastic Linear Bandit and Its Applications N. Hamidi Mohsen Bayati 10 3 0 12 Feb 2020