ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.01704
  4. Cited By
Model Selection in Contextual Stochastic Bandit Problems
v1v2v3 (latest)

Model Selection in Contextual Stochastic Bandit Problems

Neural Information Processing Systems (NeurIPS), 2020
3 March 2020
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
ArXiv (abs)PDFHTML

Papers citing "Model Selection in Contextual Stochastic Bandit Problems"

50 / 92 papers shown
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Improved Training Mechanism for Reinforcement Learning via Online Model Selection
Aida Afshar
Aldo Pacchiano
88
0
0
01 Dec 2025
UCB-type Algorithm for Budget-Constrained Expert Learning
UCB-type Algorithm for Budget-Constrained Expert Learning
Ilgam Latypov
A. Suvorikova
Alexey Kroshnin
Alexander Gasnikov
Yuriy Dorn
149
0
0
26 Oct 2025
Offline-to-online hyperparameter transfer for stochastic bandits
Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025
Dravyansh Sharma
Arun Sai Suggala
OffRL
371
8
0
06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
402
51
0
31 Dec 2024
State-free Reinforcement Learning
State-free Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Mingyu Chen
Aldo Pacchiano
Xuezhou Zhang
368
0
0
27 Sep 2024
Stochastic Bandits Robust to Adversarial Attacks
Stochastic Bandits Robust to Adversarial Attacks
Xuchuang Wang
Jinhang Zuo
Xutong Liu
John C. S. Lui
Mohammad Hajiesmaili
AAML
183
2
0
16 Aug 2024
Learning Rate-Free Reinforcement Learning: A Case for Model Selection
  with Non-Stationary Objectives
Learning Rate-Free Reinforcement Learning: A Case for Model Selection with Non-Stationary Objectives
Aida Afshar
Aldo Pacchiano
234
0
0
07 Aug 2024
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction
  to Linear Bandits, and Limitations around Unknown Marginals
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu
Idan Attias
Daniel M. Roy
CML
255
2
0
01 Jul 2024
Efficient Sequential Decision Making with Large Language Models
Efficient Sequential Decision Making with Large Language Models
Dingyang Chen
Qi Zhang
Yinglun Zhu
LRM
464
12
0
17 Jun 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
330
1
0
06 Jun 2024
Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Sparsity-Agnostic Linear Bandits with Adaptive Adversaries
Tianyuan Jin
Kyoungseok Jang
Nicolò Cesa-Bianchi
273
1
0
03 Jun 2024
Online Bandits with (Biased) Offline Data: Adaptive Learning under Distribution Mismatch
Online Bandits with (Biased) Offline Data: Adaptive Learning under Distribution MismatchInternational Conference on Machine Learning (ICML), 2024
Wang Chi Cheung
Lixing Lyu
OffRL
477
12
0
04 May 2024
Neural Active Learning Beyond Bandits
Neural Active Learning Beyond Bandits
Yikun Ban
Ishika Agarwal
Ziwei Wu
Yada Zhu
Kommy Weldemariam
Hanghang Tong
Jingrui He
303
14
0
18 Apr 2024
The SMART approach to instance-optimal online learning
The SMART approach to instance-optimal online learning
Siddhartha Banerjee
Alankrita Bhatt
Chao Yu
238
0
0
27 Feb 2024
Understanding Model Selection For Learning In Strategic Environments
Understanding Model Selection For Learning In Strategic EnvironmentsNeural Information Processing Systems (NeurIPS), 2024
Tinashe Handina
Eric Mazumdar
272
2
0
12 Feb 2024
Budgeted Online Model Selection and Fine-Tuning via Federated Learning
Budgeted Online Model Selection and Fine-Tuning via Federated Learning
P. M. Ghari
Yanning Shen
FedML
373
2
0
19 Jan 2024
Experiment Planning with Function Approximation
Experiment Planning with Function ApproximationNeural Information Processing Systems (NeurIPS), 2024
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
239
6
0
10 Jan 2024
Best-of-Both-Worlds Algorithms for Linear Contextual Bandits
Best-of-Both-Worlds Algorithms for Linear Contextual Bandits
Yuko Kuroki
Alberto Rumi
Taira Tsuchiya
Fabio Vitale
Nicolò Cesa-Bianchi
338
13
0
24 Dec 2023
Online Clustering of Bandits with Misspecified User Models
Online Clustering of Bandits with Misspecified User ModelsNeural Information Processing Systems (NeurIPS), 2023
Zhiyong Wang
Jize Xie
Xutong Liu
Shuai Li
J. C. Lui
394
15
0
04 Oct 2023
Anytime Model Selection in Linear Bandits
Anytime Model Selection in Linear BanditsNeural Information Processing Systems (NeurIPS), 2023
Parnian Kassraie
N. Emmenegger
Andreas Krause
Aldo Pacchiano
407
7
0
24 Jul 2023
Active Policy Improvement from Multiple Black-box Oracles
Active Policy Improvement from Multiple Black-box OraclesInternational Conference on Machine Learning (ICML), 2023
Xuefeng Liu
Takuma Yoneda
Simon Mahns
Matthew R. Walter
Yuxin Chen
450
13
0
17 Jun 2023
Data-Driven Online Model Selection With Regret Guarantees
Data-Driven Online Model Selection With Regret GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Aldo Pacchiano
Christoph Dann
Claudio Gentile
OffRL
453
11
0
05 Jun 2023
Robust Lipschitz Bandits to Adversarial Corruptions
Robust Lipschitz Bandits to Adversarial CorruptionsNeural Information Processing Systems (NeurIPS), 2023
Yue Kang
Cho-Jui Hsieh
T. C. Lee
AAML
283
15
0
29 May 2023
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits
Adaptation to Misspecified Kernel Regularity in Kernelised BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yusha Liu
Aarti Singh
356
3
0
26 Apr 2023
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive
  Regret Bounds
Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret BoundsAnnual Conference Computational Learning Theory (COLT), 2023
Shinji Ito
Kei Takemura
192
15
0
24 Feb 2023
Estimating Optimal Policy Value in General Linear Contextual Bandits
Estimating Optimal Policy Value in General Linear Contextual Bandits
Jonathan Lee
Weihao Kong
Aldo Pacchiano
Vidya Muthukumar
Emma Brunskill
283
0
0
19 Feb 2023
Linear Bandits with Memory: from Rotting to Rising
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
254
3
0
16 Feb 2023
Learning Complementary Policies for Human-AI Teams
Learning Complementary Policies for Human-AI Teams
Ruijiang Gao
M. Saar-Tsechansky
Maria De-Arteaga
364
11
0
06 Feb 2023
On the Complexity of Representation Learning in Contextual Linear
  Bandits
On the Complexity of Representation Learning in Contextual Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
258
1
0
19 Dec 2022
Stochastic Rising Bandits
Stochastic Rising BanditsInternational Conference on Machine Learning (ICML), 2022
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
200
19
0
07 Dec 2022
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear
  Bandit Algorithms
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
380
20
0
08 Nov 2022
Oracle Inequalities for Model Selection in Offline Reinforcement
  Learning
Oracle Inequalities for Model Selection in Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
391
14
0
03 Nov 2022
Lifelong Bandit Optimization: No Prior and No Regret
Lifelong Bandit Optimization: No Prior and No RegretConference on Uncertainty in Artificial Intelligence (UAI), 2022
Felix Schur
Parnian Kassraie
Jonas Rothfuss
Andreas Krause
397
3
0
27 Oct 2022
Scalable Representation Learning in Linear Contextual Bandits with
  Constant Regret Guarantees
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret GuaranteesNeural Information Processing Systems (NeurIPS), 2022
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
A. Lazaric
Matteo Pirotta
306
6
0
24 Oct 2022
Unpacking Reward Shaping: Understanding the Benefits of Reward
  Engineering on Sample Complexity
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample ComplexityNeural Information Processing Systems (NeurIPS), 2022
Abhishek Gupta
Aldo Pacchiano
Yuexiang Zhai
Sham Kakade
Sergey Levine
OffRL
261
100
0
18 Oct 2022
Unsupervised Model Selection for Time-series Anomaly Detection
Unsupervised Model Selection for Time-series Anomaly DetectionInternational Conference on Learning Representations (ICLR), 2022
Mononito Goswami
Cristian Challu
Laurent Callot
Lenon Minorics
Andrey Kan
OOD
499
40
0
03 Oct 2022
Neural Design for Genetic Perturbation Experiments
Neural Design for Genetic Perturbation ExperimentsInternational Conference on Learning Representations (ICLR), 2022
Aldo Pacchiano
Drausin Wulsin
Robert A. Barton
L. Voloch
327
7
0
26 Jul 2022
Model Selection in Reinforcement Learning with General Function
  Approximations
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
193
3
0
06 Jul 2022
Best of Both Worlds Model Selection
Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022
Aldo Pacchiano
Christoph Dann
Claudio Gentile
250
11
0
29 Jun 2022
Adversarial Bandits against Arbitrary Strategies
Adversarial Bandits against Arbitrary Strategies
Jung-hun Kim
Se-Young Yun
454
1
0
30 May 2022
Leveraging Initial Hints for Free in Stochastic Linear Bandits
Leveraging Initial Hints for Free in Stochastic Linear BanditsInternational Conference on Algorithmic Learning Theory (ALT), 2022
Ashok Cutkosky
Christoph Dann
Abhimanyu Das
Qiuyi
Qiuyi Zhang
184
6
0
08 Mar 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
261
22
0
12 Feb 2022
Model Selection in Batch Policy Optimization
Model Selection in Batch Policy OptimizationInternational Conference on Machine Learning (ICML), 2021
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
OffRL
252
12
0
23 Dec 2021
Neural Pseudo-Label Optimism for the Bank Loan Problem
Neural Pseudo-Label Optimism for the Bank Loan ProblemNeural Information Processing Systems (NeurIPS), 2021
Aldo Pacchiano
Shaun Singh
Edward Chou
Alexander C. Berg
Jakob N. Foerster
180
8
0
03 Dec 2021
Misspecified Gaussian Process Bandit Optimization
Misspecified Gaussian Process Bandit OptimizationNeural Information Processing Systems (NeurIPS), 2021
Ilija Bogunovic
Andreas Krause
266
57
0
09 Nov 2021
Universal and data-adaptive algorithms for model selection in linear
  contextual bandits
Universal and data-adaptive algorithms for model selection in linear contextual bandits
Vidya Muthukumar
A. Krishnamurthy
316
5
0
08 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m
  Identification
Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
241
11
0
02 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
258
28
0
25 Oct 2021
Linear Contextual Bandits with Adversarial Corruptions
Linear Contextual Bandits with Adversarial Corruptions
Heyang Zhao
Dongruo Zhou
Quanquan Gu
AAML
257
25
0
25 Oct 2021
Distribution-free Contextual Dynamic Pricing
Distribution-free Contextual Dynamic Pricing
Yiyun Luo
W. Sun
Yufeng Liu
471
44
0
15 Sep 2021
12
Next
Page 1 of 2