ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.01980
  4. Cited By
Linear Bandits with Limited Adaptivity and Learning Distributional
  Optimal Design
v1v2v3 (latest)

Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design

4 July 2020
Yufei Ruan
Jiaqi Yang
Yuanshuo Zhou
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design"

41 / 41 papers shown
The Adaptivity Barrier in Batched Nonparametric Bandits: Sharp Characterization of the Price of Unknown Margin
The Adaptivity Barrier in Batched Nonparametric Bandits: Sharp Characterization of the Price of Unknown Margin
Rong Jiang
Cong Ma
197
0
0
05 Nov 2025
Robust Batched Bandits
Robust Batched Bandits
Yunwen Guo
Yunlun Shu
Gongyi Zhuo
Tianyu Wang
125
0
0
04 Oct 2025
Stochastic Matching Bandits with Rare Optimization Updates
Stochastic Matching Bandits with Rare Optimization Updates
Jung-hun Kim
Min-hwan Oh
201
0
0
04 Sep 2025
Achieving Limited Adaptivity for Multinomial Logistic Bandits
Achieving Limited Adaptivity for Multinomial Logistic Bandits
Sukruta Prakash Midigeshi
Tanmay Goyal
Gaurav Sinha
151
1
0
05 Aug 2025
Optimal and Practical Batched Linear Bandit Algorithm
Optimal and Practical Batched Linear Bandit Algorithm
Sanghoon Yu
Min-hwan Oh
356
1
0
11 Jul 2025
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function ApproximationConference on Uncertainty in Artificial Intelligence (UAI), 2025
Runze Zhao
Yue Yu
Adams Yiyue Zhu
Chen Yang
Dongruo Zhou
283
1
0
20 May 2025
Breaking the $\log(1/\Delta_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids
Breaking the log⁡(1/Δ2)\log(1/\Delta_2)log(1/Δ2​) Barrier: Better Batched Best Arm Identification with Adaptive Grids
Tianyuan Jin
Qin Zhang
Dongruo Zhou
280
0
0
29 Jan 2025
Optimal Batched Linear Bandits
Optimal Batched Linear Bandits
Xuanfei Ren
Tianyuan Jin
Pan Xu
371
6
0
06 Jun 2024
Batched Stochastic Bandit for Nondegenerate Functions
Batched Stochastic Bandit for Nondegenerate FunctionsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2024
Yu Liu
Yunlu Shu
Tianyu Wang
607
2
0
09 May 2024
Generalized Linear Bandits with Limited Adaptivity
Generalized Linear Bandits with Limited AdaptivityNeural Information Processing Systems (NeurIPS), 2024
Ayush Sawarni
Nirjhar Das
Siddharth Barman
Gaurav Sinha
828
16
0
10 Apr 2024
Batched Nonparametric Contextual Bandits
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
529
4
0
27 Feb 2024
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity
  Constraints
Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
Dan Qiao
Yu Wang
OffRL
322
4
0
02 Feb 2024
Experiment Planning with Function Approximation
Experiment Planning with Function ApproximationNeural Information Processing Systems (NeurIPS), 2024
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
237
6
0
10 Jan 2024
Federated Linear Bandits with Finite Adversarial Actions
Federated Linear Bandits with Finite Adversarial ActionsNeural Information Processing Systems (NeurIPS), 2023
Li Fan
Ruida Zhou
Chao Tian
Cong Shen
FedML
373
3
0
02 Nov 2023
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for
  Dimension-Dependent Adaptivity
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent AdaptivityInternational Conference on Learning Representations (ICLR), 2023
Emmeran Johnson
Ciara Pike-Burke
Patrick Rebeschini
OffRL
371
2
0
02 Oct 2023
Policy Finetuning in Reinforcement Learning via Design of Experiments
  using Offline Data
Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline DataNeural Information Processing Systems (NeurIPS), 2023
Ruiqi Zhang
Andrea Zanette
OffRLOnRL
331
11
0
10 Jul 2023
From Random Search to Bandit Learning in Metric Measure Spaces
From Random Search to Bandit Learning in Metric Measure Spaces
Chuying Han
Yasong Feng
Tianyu Wang
488
3
0
19 May 2023
Cooperative Multi-Agent Reinforcement Learning: Asynchronous
  Communication and Linear Function Approximation
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function ApproximationInternational Conference on Machine Learning (ICML), 2023
Yifei Min
Jiafan He
Tianhao Wang
Quanquan Gu
397
13
0
10 May 2023
CO-BED: Information-Theoretic Contextual Optimization via Bayesian
  Experimental Design
CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental DesignInternational Conference on Machine Learning (ICML), 2023
Desi R. Ivanova
Joel Jennings
Tom Rainforth
Cheng Zhang
Adam Foster
351
4
0
27 Feb 2023
Active learning for data streams: a survey
Active learning for data streams: a surveyMachine-mediated learning (ML), 2023
Davide Cacciarelli
M. Kulahci
361
95
0
17 Feb 2023
A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization
A Lipschitz Bandits Approach for Continuous Hyperparameter Optimization
Yasong Feng
Weijian Luo
Yimin Huang
Tianyu Wang
410
10
0
03 Feb 2023
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear
  Bandit Algorithms
Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit AlgorithmsAnnual Conference Computational Learning Theory (COLT), 2022
Osama A. Hanna
Lin F. Yang
Christina Fragouli
380
20
0
08 Nov 2022
Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
Near-Optimal Regret Bounds for Multi-batch Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Zihan Zhang
Yuhang Jiang
Yuanshuo Zhou
Xiangyang Ji
OffRL
257
14
0
15 Oct 2022
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning
  with Linear Function Approximation
Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function ApproximationInternational Conference on Learning Representations (ICLR), 2022
Dan Qiao
Yu Wang
OffRL
335
15
0
03 Oct 2022
Contextual Bandits with Large Action Spaces: Made Practical
Contextual Bandits with Large Action Spaces: Made PracticalInternational Conference on Machine Learning (ICML), 2022
Yinglun Zhu
Dylan J. Foster
John Langford
Paul Mineiro
376
34
0
12 Jul 2022
A Simple and Provably Efficient Algorithm for Asynchronous Federated
  Contextual Linear Bandits
A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear BanditsNeural Information Processing Systems (NeurIPS), 2022
Jiafan He
Tianhao Wang
Yifei Min
Quanquan Gu
FedML
313
41
0
07 Jul 2022
Distributed Contextual Linear Bandits with Minimax Optimal Communication
  Cost
Distributed Contextual Linear Bandits with Minimax Optimal Communication CostInternational Conference on Machine Learning (ICML), 2022
Sanae Amani
Tor Lattimore
András Gyorgy
Lin F. Yang
FedML
241
14
0
26 May 2022
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and
  Optimality
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and OptimalityInternational Conference on Learning Representations (ICLR), 2022
Jiawei Huang
Jinglin Chen
Li Zhao
Tao Qin
Nan Jiang
Tie-Yan Liu
OffRL
352
29
0
14 Feb 2022
A Benchmark for Low-Switching-Cost Reinforcement Learning
A Benchmark for Low-Switching-Cost Reinforcement Learning
Shusheng Xu
Yancheng Liang
Yunfei Li
S. Du
Yi Wu
OffRL
179
0
0
13 Dec 2021
Lipschitz Bandits with Batched Feedback
Lipschitz Bandits with Batched Feedback
Yasong Feng
Zengfeng Huang
Tianyu Wang
470
18
0
19 Oct 2021
Batched Thompson Sampling
Batched Thompson Sampling
Cem Kalkanli
Ayfer Özgür
OffRL
283
24
0
01 Oct 2021
Design of Experiments for Stochastic Contextual Linear Bandits
Design of Experiments for Stochastic Contextual Linear BanditsNeural Information Processing Systems (NeurIPS), 2021
Andrea Zanette
Kefan Dong
Jonathan Lee
Emma Brunskill
OffRL
225
20
0
21 Jul 2021
Parallelizing Thompson Sampling
Parallelizing Thompson SamplingNeural Information Processing Systems (NeurIPS), 2021
Amin Karbasi
Vahab Mirrokni
M. Shadravan
233
28
0
02 Jun 2021
Parallelizing Contextual Bandits
Parallelizing Contextual Bandits
Jeffrey Chan
Aldo Pacchiano
Nilesh Tripuraneni
Yun S. Song
Peter L. Bartlett
Michael I. Jordan
290
3
0
21 May 2021
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant
  Suboptimality Gap
An Exponential Lower Bound for Linearly-Realizable MDPs with Constant Suboptimality GapNeural Information Processing Systems (NeurIPS), 2021
Yuanhao Wang
Ruosong Wang
Sham Kakade
OffRL
454
48
0
23 Mar 2021
Online Convex Optimization with Continuous Switching Constraint
Online Convex Optimization with Continuous Switching ConstraintNeural Information Processing Systems (NeurIPS), 2021
Guanghui Wang
Yuanyu Wan
Tianbao Yang
Lijun Zhang
176
11
0
21 Mar 2021
Encrypted Linear Contextual Bandit
Encrypted Linear Contextual BanditInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Evrard Garcelon
Vianney Perchet
Matteo Pirotta
FedML
234
2
0
17 Mar 2021
Batched Neural Bandits
Batched Neural BanditsACM / IMS Journal of Data Science (JIDS), 2021
Quanquan Gu
Amin Karbasi
Khashayar Khosravi
Vahab Mirrokni
Dongruo Zhou
BDLOffRL
140
28
0
25 Feb 2021
Provably Efficient Reinforcement Learning with Linear Function
  Approximation Under Adaptivity Constraints
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity ConstraintsNeural Information Processing Systems (NeurIPS), 2021
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
559
169
0
06 Jan 2021
Impact of Representation Learning in Linear Bandits
Impact of Representation Learning in Linear Bandits
Jiaqi Yang
Wei Hu
Jason D. Lee
S. Du
328
57
0
13 Oct 2020
Double Explore-then-Commit: Asymptotic Optimality and Beyond
Double Explore-then-Commit: Asymptotic Optimality and BeyondAnnual Conference Computational Learning Theory (COLT), 2020
Tianyuan Jin
Pan Xu
Xiaokui Xiao
Quanquan Gu
224
31
0
21 Feb 2020
1
Page 1 of 1