ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.09255
  4. Cited By
Corralling Stochastic Bandit Algorithms
v1v2v3 (latest)

Corralling Stochastic Bandit Algorithms

16 June 2020
R. Arora
T. V. Marinov
M. Mohri
ArXiv (abs)PDFHTML

Papers citing "Corralling Stochastic Bandit Algorithms"

27 / 27 papers shown
Offline-to-online hyperparameter transfer for stochastic bandits
Offline-to-online hyperparameter transfer for stochastic banditsAAAI Conference on Artificial Intelligence (AAAI), 2025
Dravyansh Sharma
Arun Sai Suggala
OffRL
362
8
0
06 Jan 2025
A Model Selection Approach for Corruption Robust Reinforcement Learning
A Model Selection Approach for Corruption Robust Reinforcement LearningInternational Conference on Algorithmic Learning Theory (ALT), 2021
Chen-Yu Wei
Christoph Dann
Julian Zimmert
390
51
0
31 Dec 2024
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction
  to Linear Bandits, and Limitations around Unknown Marginals
Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals
Ziyi Liu
Idan Attias
Daniel M. Roy
CML
254
2
0
01 Jul 2024
Data-Driven Online Model Selection With Regret Guarantees
Data-Driven Online Model Selection With Regret GuaranteesInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Aldo Pacchiano
Christoph Dann
Claudio Gentile
OffRL
453
11
0
05 Jun 2023
Adaptation to Misspecified Kernel Regularity in Kernelised Bandits
Adaptation to Misspecified Kernel Regularity in Kernelised BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yusha Liu
Aarti Singh
356
3
0
26 Apr 2023
A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
A Blackbox Approach to Best of Both Worlds in Bandits and BeyondAnnual Conference Computational Learning Theory (COLT), 2023
Christoph Dann
Chen-Yu Wei
Julian Zimmert
283
29
0
20 Feb 2023
Stochastic Rising Bandits
Stochastic Rising BanditsInternational Conference on Machine Learning (ICML), 2022
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
199
19
0
07 Dec 2022
Model Selection in Reinforcement Learning with General Function
  Approximations
Model Selection in Reinforcement Learning with General Function Approximations
Avishek Ghosh
Sayak Ray Chowdhury
193
3
0
06 Jul 2022
Best of Both Worlds Model Selection
Best of Both Worlds Model SelectionNeural Information Processing Systems (NeurIPS), 2022
Aldo Pacchiano
Christoph Dann
Claudio Gentile
248
11
0
29 Jun 2022
Leveraging Initial Hints for Free in Stochastic Linear Bandits
Leveraging Initial Hints for Free in Stochastic Linear BanditsInternational Conference on Algorithmic Learning Theory (ALT), 2022
Ashok Cutkosky
Christoph Dann
Abhimanyu Das
Qiuyi
Qiuyi Zhang
184
6
0
08 Mar 2022
Corralling a Larger Band of Bandits: A Case Study on Switching Regret
  for Linear Bandits
Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear BanditsAnnual Conference Computational Learning Theory (COLT), 2022
Haipeng Luo
Mengxiao Zhang
Peng Zhao
Zhi Zhou
261
22
0
12 Feb 2022
Universal and data-adaptive algorithms for model selection in linear
  contextual bandits
Universal and data-adaptive algorithms for model selection in linear contextual bandits
Vidya Muthukumar
A. Krishnamurthy
313
5
0
08 Nov 2021
Decentralized Cooperative Reinforcement Learning with Hierarchical
  Information Structure
Decentralized Cooperative Reinforcement Learning with Hierarchical Information StructureInternational Conference on Algorithmic Learning Theory (ALT), 2021
Hsu Kao
Chen-Yu Wei
V. Subramanian
395
17
0
01 Nov 2021
The Pareto Frontier of model selection for general Contextual Bandits
The Pareto Frontier of model selection for general Contextual Bandits
T. V. Marinov
Julian Zimmert
254
28
0
25 Oct 2021
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess
Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess
Gregory Clark
282
9
0
05 Oct 2021
Model Selection for Generic Reinforcement Learning
Model Selection for Generic Reinforcement Learning
Avishek Ghosh
Sayak Ray Chowdhury
Kannan Ramchandran
267
1
0
13 Jul 2021
Model Selection for Generic Contextual Bandits
Model Selection for Generic Contextual BanditsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2021
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
307
7
0
07 Jul 2021
Towards Costless Model Selection in Contextual Bandits: A Bias-Variance
  Perspective
Towards Costless Model Selection in Contextual Bandits: A Bias-Variance PerspectiveInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Sanath Kumar Krishnamurthy
Adrienne Margaret Propp
Susan Athey
336
5
0
11 Jun 2021
Thompson Sampling with a Mixture Prior
Thompson Sampling with a Mixture PriorInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Joey Hong
Branislav Kveton
Manzil Zaheer
Mohammad Ghavamzadeh
Craig Boutilier
339
16
0
10 Jun 2021
Leveraging Good Representations in Linear Contextual Bandits
Leveraging Good Representations in Linear Contextual BanditsInternational Conference on Machine Learning (ICML), 2021
Matteo Papini
Andrea Tirinzoni
Marcello Restelli
A. Lazaric
Matteo Pirotta
207
33
0
08 Apr 2021
Pareto Optimal Model Selection in Linear Bandits
Pareto Optimal Model Selection in Linear BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Yinglun Zhu
Robert D. Nowak
273
15
0
12 Feb 2021
Upper Confidence Bounds for Combining Stochastic Bandits
Upper Confidence Bounds for Combining Stochastic Bandits
Ashok Cutkosky
Abhimanyu Das
Manish Purohit
288
9
0
24 Dec 2020
Regret Bound Balancing and Elimination for Model Selection in Bandits
  and RL
Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Aldo Pacchiano
Christoph Dann
Claudio Gentile
Peter L. Bartlett
372
55
0
24 Dec 2020
Smooth Bandit Optimization: Generalization to Hölder Space
Smooth Bandit Optimization: Generalization to Hölder SpaceInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Yusha Liu
Yining Wang
Aarti Singh
253
15
0
11 Dec 2020
Online Model Selection for Reinforcement Learning with Function
  Approximation
Online Model Selection for Reinforcement Learning with Function ApproximationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Jonathan Lee
Aldo Pacchiano
Vidya Muthukumar
Weihao Kong
Emma Brunskill
OffRL
254
39
0
19 Nov 2020
Multitask Bandit Learning Through Heterogeneous Feedback Aggregation
Multitask Bandit Learning Through Heterogeneous Feedback AggregationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2020
Zhi Wang
Chicheng Zhang
Manish Singh
L. Riek
Kamalika Chaudhuri
465
26
0
29 Oct 2020
Model Selection in Contextual Stochastic Bandit Problems
Model Selection in Contextual Stochastic Bandit ProblemsNeural Information Processing Systems (NeurIPS), 2020
Aldo Pacchiano
My Phan
Yasin Abbasi-Yadkori
Anup B. Rao
Julian Zimmert
Tor Lattimore
Csaba Szepesvári
590
103
0
03 Mar 2020
1
Page 1 of 1