ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.06996
  4. Cited By
Adaptive Exploration in Linear Contextual Bandit

Adaptive Exploration in Linear Contextual Bandit

15 October 2019
Botao Hao
Tor Lattimore
Csaba Szepesvári
ArXivPDFHTML

Papers citing "Adaptive Exploration in Linear Contextual Bandit"

47 / 47 papers shown
Title
A Classification View on Meta Learning Bandits
A Classification View on Meta Learning Bandits
Mirco Mutti
Jeongyeol Kwon
Shie Mannor
Aviv Tamar
28
0
0
06 Apr 2025
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System
J. Gornet
Yilin Mo
Bruno Sinopoli
30
0
0
04 Apr 2025
Sparse Nonparametric Contextual Bandits
Sparse Nonparametric Contextual Bandits
Hamish Flynn
Julia Olkhovskaya
Paul Rognon-Vael
53
0
0
20 Mar 2025
Information maximization for a broad variety of multi-armed bandit games
Information maximization for a broad variety of multi-armed bandit games
Alex Barbier-Chebbah
Christian L. Vestergaard
Jean-Baptiste Masson
61
0
0
20 Mar 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits
Sequential Learning of the Pareto Front for Multi-objective Bandits
Elise Crépon
Aurélien Garivier
Wouter M. Koolen
47
1
0
29 Jan 2025
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
Zhuohua Li
Maoli Liu
Xiangxiang Dai
John C. S. Lui
36
0
0
03 Jan 2025
Optimal Batched Linear Bandits
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
34
2
0
06 Jun 2024
On Bits and Bandits: Quantifying the Regret-Information Trade-off
On Bits and Bandits: Quantifying the Regret-Information Trade-off
Itai Shufaro
Nadav Merlis
Nir Weinberger
Shie Mannor
38
0
0
26 May 2024
A Functional Model Method for Nonconvex Nonsmooth Conditional Stochastic
  Optimization
A Functional Model Method for Nonconvex Nonsmooth Conditional Stochastic Optimization
Andrzej Ruszczyñski
Shangzhe Yang
31
0
0
17 May 2024
$ε$-Policy Gradient for Online Pricing
εεε-Policy Gradient for Online Pricing
Lukasz Szpruch
Tanut Treetanthiploet
Yufei Zhang
OffRL
60
1
0
06 May 2024
Differentially Private High Dimensional Bandits
Differentially Private High Dimensional Bandits
Apurv Shukla
26
0
0
06 Feb 2024
Robustly Improving Bandit Algorithms with Confounded and Selection
  Biased Offline Data: A Causal Approach
Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach
Wen Huang
Xintao Wu
OffRL
CML
18
0
0
20 Dec 2023
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo
Mohsen Bayati
21
1
0
26 Jun 2023
Federated Linear Contextual Bandits with User-level Differential Privacy
Federated Linear Contextual Bandits with User-level Differential Privacy
Ruiquan Huang
Huanyu Zhang
Luca Melis
Milan Shen
Meisam Hajzinia
J. Yang
FedML
23
11
0
08 Jun 2023
On the Interplay Between Misspecification and Sub-optimality Gap in
  Linear Contextual Bandits
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Weitong Zhang
Jiafan He
Zhiyuan Fan
Q. Gu
102
5
0
16 Mar 2023
Linear Partial Monitoring for Sequential Decision-Making: Algorithms,
  Regret Bounds and Applications
Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications
Johannes Kirschner
Tor Lattimore
Andreas Krause
36
8
0
07 Feb 2023
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls
  Oracle
Bounded (O(1)) Regret Recommendation Learning via Synthetic Controls Oracle
Hyunwook Kang
P. R. Kumar
OffRL
33
1
0
29 Jan 2023
On the Complexity of Representation Learning in Contextual Linear
  Bandits
On the Complexity of Representation Learning in Contextual Linear Bandits
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
32
1
0
19 Dec 2022
On the Sample Complexity of Representation Learning in Multi-task
  Bandits with Global and Local structure
On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure
Alessio Russo
Alexandre Proutière
28
2
0
28 Nov 2022
Scalable Representation Learning in Linear Contextual Bandits with
  Constant Regret Guarantees
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
A. Lazaric
Matteo Pirotta
28
4
0
24 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications
  for Inference
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
27
9
0
23 Jul 2022
Instance-optimal PAC Algorithms for Contextual Bandits
Instance-optimal PAC Algorithms for Contextual Bandits
Zhao Li
Lillian J. Ratliff
Houssam Nassif
Kevin G. Jamieson
Lalit P. Jain
12
17
0
05 Jul 2022
Asymptotic Instance-Optimal Algorithms for Interactive Decision Making
Asymptotic Instance-Optimal Algorithms for Interactive Decision Making
Kefan Dong
Tengyu Ma
18
9
0
06 Jun 2022
Residual Bootstrap Exploration for Stochastic Linear Bandit
Residual Bootstrap Exploration for Stochastic Linear Bandit
Shuang Wu
ChiHua Wang
Yuantong Li
Guang Cheng
11
8
0
23 Feb 2022
Truncated LinUCB for Stochastic Linear Bandits
Truncated LinUCB for Stochastic Linear Bandits
Yanglei Song
Meng zhou
49
0
0
23 Feb 2022
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear
  Bandit Approach
Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach
Filippo Vannella
Alexandre Proutière
Yassir Jedra
Jaeseong Jeong
23
7
0
06 Jan 2022
Reinforcement Learning in Linear MDPs: Constant Regret and
  Representation Selection
Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Matteo Papini
Andrea Tirinzoni
Aldo Pacchiano
Marcello Restelli
A. Lazaric
Matteo Pirotta
19
18
0
27 Oct 2021
Sequential Estimation under Multiple Resources: a Bandit Point of View
Sequential Estimation under Multiple Resources: a Bandit Point of View
Alireza Masoumian
Shayan Kiyani
Mohammad Hossein Yassaee
31
1
0
29 Sep 2021
A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs
A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
13
16
0
24 Jun 2021
Fair Exploration via Axiomatic Bargaining
Fair Exploration via Axiomatic Bargaining
Jackie Baek
Vivek F. Farias
FaML
16
28
0
04 Jun 2021
Information Directed Sampling for Sparse Linear Bandits
Information Directed Sampling for Sparse Linear Bandits
Botao Hao
Tor Lattimore
Wei Deng
19
19
0
29 May 2021
Leveraging Good Representations in Linear Contextual Bandits
Leveraging Good Representations in Linear Contextual Bandits
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
30
26
0
08 Apr 2021
Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic
  and Adversarial Linear Bandits Simultaneously
Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously
Chung-Wei Lee
Haipeng Luo
Chen-Yu Wei
Mengxiao Zhang
Xiaojin Zhang
15
46
0
11 Feb 2021
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve
  Optimism, Embrace Virtual Curvature
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Kefan Dong
Jiaqi Yang
Tengyu Ma
24
32
0
08 Feb 2021
Asymptotically Optimal Information-Directed Sampling
Asymptotically Optimal Information-Directed Sampling
Johannes Kirschner
Tor Lattimore
Claire Vernade
Csaba Szepesvári
11
33
0
11 Nov 2020
High-Dimensional Sparse Linear Bandits
High-Dimensional Sparse Linear Bandits
Botao Hao
Tor Lattimore
Mengdi Wang
6
60
0
08 Nov 2020
Experimental Design for Regret Minimization in Linear Bandits
Experimental Design for Regret Minimization in Linear Bandits
Andrew Wagenmaker
Julian Katz-Samuels
Kevin G. Jamieson
9
15
0
01 Nov 2020
Regret in Online Recommendation Systems
Regret in Online Recommendation Systems
Kaito Ariu
Narae Ryu
Seyoung Yun
Alexandre Proutière
9
6
0
23 Oct 2020
An Asymptotically Optimal Primal-Dual Incremental Algorithm for
  Contextual Linear Bandits
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits
Andrea Tirinzoni
Matteo Pirotta
Marcello Restelli
A. Lazaric
14
34
0
23 Oct 2020
Thresholded Lasso Bandit
Thresholded Lasso Bandit
Kaito Ariu
Kenshi Abe
Alexandre Proutière
21
17
0
22 Oct 2020
Instance-Dependent Complexity of Contextual Bandits and Reinforcement
  Learning: A Disagreement-Based Perspective
Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective
Dylan J. Foster
Alexander Rakhlin
D. Simchi-Levi
Yunzong Xu
21
75
0
07 Oct 2020
Diversity-Preserving K-Armed Bandits, Revisited
Diversity-Preserving K-Armed Bandits, Revisited
Hédi Hadiji
Sébastien Gerchinovitz
Jean-Michel Loubes
Gilles Stoltz
26
2
0
05 Oct 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic
  Optimality
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
15
10
0
15 Jun 2020
A Novel Confidence-Based Algorithm for Structured Bandits
A Novel Confidence-Based Algorithm for Structured Bandits
Andrea Tirinzoni
A. Lazaric
Marcello Restelli
13
11
0
23 May 2020
Information Directed Sampling for Linear Partial Monitoring
Information Directed Sampling for Linear Partial Monitoring
Johannes Kirschner
Tor Lattimore
Andreas Krause
21
46
0
25 Feb 2020
The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed
  Bandit with Many Arms
The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms
Mohsen Bayati
N. Hamidi
Ramesh Johari
Khashayar Khosravi
37
28
0
24 Feb 2020
A General Theory of the Stochastic Linear Bandit and Its Applications
A General Theory of the Stochastic Linear Bandit and Its Applications
N. Hamidi
Mohsen Bayati
10
3
0
12 Feb 2020
1