ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.00953
  4. Cited By
Gamification of Pure Exploration for Linear Bandits

Gamification of Pure Exploration for Linear Bandits

2 July 2020
Rémy Degenne
Pierre Ménard
Xuedong Shang
Michal Valko
ArXiv (abs)PDFHTML

Papers citing "Gamification of Pure Exploration for Linear Bandits"

50 / 64 papers shown
Challenger-Based Combinatorial Bandits for Subcarrier Selection in OFDM Systems
Challenger-Based Combinatorial Bandits for Subcarrier Selection in OFDM Systems
Mohsen Amiri
V Venktesh
Sindri Magnússon
93
0
0
06 Oct 2025
Pure Exploration via Frank-Wolfe Self-Play
Pure Exploration via Frank-Wolfe Self-Play
Xinyu Liu
Chao Qin
Wei You
197
0
0
24 Sep 2025
FraPPE: Fast and Efficient Preference-based Pure Exploration
FraPPE: Fast and Efficient Preference-based Pure Exploration
Udvas Das
Apurv Shukla
Debabrota Basu
234
1
0
22 Aug 2025
Experimental Design for Semiparametric Bandits
Experimental Design for Semiparametric BanditsAnnual Conference Computational Learning Theory (COLT), 2025
Seok-Jin Kim
Gi-Soo Kim
Min-hwan Oh
248
1
0
16 Jun 2025
Adapting to Heterophilic Graph Data with Structure-Guided Neighbor Discovery
Victor M. Tenorio
Madeline Navarro
Samuel Rey
Santiago Segarra
Antonio G. Marques
176
1
0
10 Jun 2025
Sample Efficient Demonstration Selection for In-Context Learning
Kiran Purohit
Venktesh V
Sourangshu Bhattacharya
Avishek Anand
208
8
0
10 Jun 2025
Pure Exploration with Infinite Answers
Pure Exploration with Infinite Answers
Riccardo Poiani
Martino Bernasconi
A. Celli
211
2
0
28 May 2025
On the Problem of Best Arm Retention
On the Problem of Best Arm RetentionTheoretical Computer Science (TCS), 2025
Houshuang Chen
Yuchen He
Chihao Zhang
250
0
0
16 Apr 2025
Sequential Learning of the Pareto Front for Multi-objective Bandits
Sequential Learning of the Pareto Front for Multi-objective BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Elise Crépon
Aurélien Garivier
Wouter M. Koolen
183
10
0
29 Jan 2025
Enhancing Preference-based Linear Bandits via Human Response Time
Enhancing Preference-based Linear Bandits via Human Response TimeNeural Information Processing Systems (NeurIPS), 2024
Shen Li
Yuyang Zhang
Tongzheng Ren
Claire Liang
Na Li
J. Shah
560
3
0
03 Jan 2025
Near Optimal Pure Exploration in Logistic Bandits
Near Optimal Pure Exploration in Logistic Bandits
Eduardo Ochoa Rivera
Ambuj Tewari
449
1
0
28 Oct 2024
Optimal Design for Reward Modeling in RLHF
Optimal Design for Reward Modeling in RLHF
Antoine Scheid
Etienne Boursier
Alain Durmus
Michael I. Jordan
Pierre Ménard
Eric Moulines
Michal Valko
OffRL
559
21
0
22 Oct 2024
Linear Submodular Maximization with Bandit Feedback
Linear Submodular Maximization with Bandit Feedback
Wenjing Chen
Victoria G. Crawford
175
3
0
02 Jul 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert D. Nowak
742
7
0
07 Jun 2024
Regret Minimization via Saddle Point Optimization
Regret Minimization via Saddle Point OptimizationNeural Information Processing Systems (NeurIPS), 2024
Johannes Kirschner
Seyed Alireza Bakhtiari
Kushagra Chandak
Volodymyr Tkachuk
Csaba Szepesvári
240
2
0
15 Mar 2024
LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding
  Linear Bandit Problem
LinearAPT: An Adaptive Algorithm for the Fixed-Budget Thresholding Linear Bandit ProblemPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2024
Yun-Ang Wu
Yun-Da Tsai
Shou-De Lin
274
2
0
10 Mar 2024
Optimal Thresholding Linear Bandit
Optimal Thresholding Linear Bandit
Eduardo Ochoa Rivera
Ambuj Tewari
226
0
0
11 Feb 2024
Differentially Private High Dimensional Bandits
Differentially Private High Dimensional Bandits
Apurv Shukla
250
0
0
06 Feb 2024
Adaptive Experiment Design with Synthetic Controls
Adaptive Experiment Design with Synthetic Controls
Alihan Huyuk
Zhaozhi Qian
M. Schaar
281
3
0
30 Jan 2024
Experiment Planning with Function Approximation
Experiment Planning with Function ApproximationNeural Information Processing Systems (NeurIPS), 2024
Aldo Pacchiano
Jonathan Lee
Emma Brunskill
OffRL
239
6
0
10 Jan 2024
Robust Best-arm Identification in Linear Bandits
Robust Best-arm Identification in Linear Bandits
Wei Wang
Sattar Vakili
Ilija Bogunovic
245
0
0
08 Nov 2023
Optimal Batched Best Arm Identification
Optimal Batched Best Arm IdentificationNeural Information Processing Systems (NeurIPS), 2023
Tianyuan Jin
Yu Yang
Jing Tang
Xiaokui Xiao
Pan Xu
318
7
0
21 Oct 2023
Pure Exploration in Asynchronous Federated Bandits
Pure Exploration in Asynchronous Federated BanditsConference on Uncertainty in Artificial Intelligence (UAI), 2023
Zichen Wang
Chuanhao Li
Chenyu Song
Lianghui Wang
Quanquan Gu
Huazheng Wang
FedML
333
6
0
17 Oct 2023
Optimal Exploration is no harder than Thompson Sampling
Optimal Exploration is no harder than Thompson SamplingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Zhaoqi Li
Kevin Jamieson
Lalit P. Jain
509
4
0
09 Oct 2023
Experimental Designs for Heteroskedastic Variance
Experimental Designs for Heteroskedastic VarianceNeural Information Processing Systems (NeurIPS), 2023
Justin Weltz
Tanner Fiez
Alex Volfovsky
Eric B. Laber
Blake Mason
Houssam Nassif
Lalit P. Jain
359
9
0
06 Oct 2023
Price of Safety in Linear Best Arm Identification
Price of Safety in Linear Best Arm Identification
Xuedong Shang
Igor Colin
M. Barlier
Hamza Cherkaoui
LLMSV
309
5
0
15 Sep 2023
Pure Exploration under Mediators' Feedback
Pure Exploration under Mediators' Feedback
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
264
1
0
29 Aug 2023
Certified Multi-Fidelity Zeroth-Order Optimization
Certified Multi-Fidelity Zeroth-Order Optimization
Étienne de Montbrun
Sébastien Gerchinovitz
294
2
0
02 Aug 2023
A/B Testing and Best-arm Identification for Linear Bandits with
  Robustness to Non-stationarity
A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarityInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Zhihan Xiong
Romain Camilleri
Maryam Fazel
Lalit P. Jain
Kevin Jamieson
377
2
0
27 Jul 2023
Pure Exploration in Bandits with Linear Constraints
Pure Exploration in Bandits with Linear ConstraintsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Emil Carlsson
Debabrota Basu
Fredrik D. Johansson
Devdatt Dubhashi
377
12
0
22 Jun 2023
Cooperative Thresholded Lasso for Sparse Linear Bandit
Cooperative Thresholded Lasso for Sparse Linear BanditEuropean Conference on Artificial Intelligence (ECAI), 2023
Haniyeh Barghi
Xiaotong Cheng
S. Maghsudi
288
0
0
30 May 2023
Multi-task Representation Learning for Pure Exploration in Linear
  Bandits
Multi-task Representation Learning for Pure Exploration in Linear BanditsInternational Conference on Machine Learning (ICML), 2023
Yihan Du
Longbo Huang
Wen Sun
412
6
0
09 Feb 2023
An Asymptotically Optimal Algorithm for the Convex Hull Membership
  Problem
An Asymptotically Optimal Algorithm for the Convex Hull Membership Problem
Gang Qiao
Ambuj Tewari
372
0
0
03 Feb 2023
Best Arm Identification in Stochastic Bandits: Beyond $β-$optimality
Best Arm Identification in Stochastic Bandits: Beyond β−β-β−optimalityIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023
Arpan Mukherjee
A. Tajer
308
4
0
10 Jan 2023
Scalable Representation Learning in Linear Contextual Bandits with
  Constant Regret Guarantees
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret GuaranteesNeural Information Processing Systems (NeurIPS), 2022
Andrea Tirinzoni
Matteo Papini
Ahmed Touati
A. Lazaric
Matteo Pirotta
309
7
0
24 Oct 2022
SPRT-based Efficient Best Arm Identification in Stochastic Bandits
SPRT-based Efficient Best Arm Identification in Stochastic BanditsIEEE Journal on Selected Areas in Information Theory (JSAIT), 2022
Arpan Mukherjee
A. Tajer
339
6
0
22 Jul 2022
Instance-optimal PAC Algorithms for Contextual Bandits
Instance-optimal PAC Algorithms for Contextual BanditsNeural Information Processing Systems (NeurIPS), 2022
Zhao Li
Lillian J. Ratliff
Houssam Nassif
Kevin Jamieson
Lalit P. Jain
289
19
0
05 Jul 2022
Active Learning with Safety Constraints
Active Learning with Safety ConstraintsNeural Information Processing Systems (NeurIPS), 2022
Romain Camilleri
Andrew Wagenmaker
Jamie Morgenstern
Lalit P. Jain
Kevin Jamieson
288
16
0
22 Jun 2022
Choosing Answers in $\varepsilon$-Best-Answer Identification for Linear
  Bandits
Choosing Answers in ε\varepsilonε-Best-Answer Identification for Linear Bandits
Marc Jourdan
Rémy Degenne
204
1
0
09 Jun 2022
On Elimination Strategies for Bandit Fixed-Confidence Identification
On Elimination Strategies for Bandit Fixed-Confidence IdentificationNeural Information Processing Systems (NeurIPS), 2022
Andrea Tirinzoni
Rémy Degenne
275
9
0
22 May 2022
Instance-Dependent Regret Analysis of Kernelized Bandits
Instance-Dependent Regret Analysis of Kernelized BanditsInternational Conference on Machine Learning (ICML), 2022
S. Shekhar
T. Javidi
313
4
0
12 Mar 2022
Nearly Optimal Algorithms for Level Set Estimation
Nearly Optimal Algorithms for Level Set EstimationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Blake Mason
Romain Camilleri
Subhojyoti Mukherjee
Kevin Jamieson
Robert D. Nowak
Lalit P. Jain
285
26
0
02 Nov 2021
Dealing With Misspecification In Fixed-Confidence Linear Top-m
  Identification
Dealing With Misspecification In Fixed-Confidence Linear Top-m IdentificationNeural Information Processing Systems (NeurIPS), 2021
Clémence Réda
Andrea Tirinzoni
Rémy Degenne
247
11
0
02 Nov 2021
Vector Optimization with Stochastic Bandit Feedback
Vector Optimization with Stochastic Bandit FeedbackInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021
Shiao Liu
Jian Huang
296
12
0
23 Oct 2021
Near Instance Optimal Model Selection for Pure Exploration Linear
  Bandits
Near Instance Optimal Model Selection for Pure Exploration Linear Bandits
Yinglun Zhu
Julian Katz-Samuels
Robert D. Nowak
263
8
0
10 Sep 2021
Design of Experiments for Stochastic Contextual Linear Bandits
Design of Experiments for Stochastic Contextual Linear BanditsNeural Information Processing Systems (NeurIPS), 2021
Andrea Zanette
Kefan Dong
Jonathan Lee
Emma Brunskill
OffRL
239
21
0
21 Jul 2021
Pure Exploration in Kernel and Neural Bandits
Pure Exploration in Kernel and Neural BanditsNeural Information Processing Systems (NeurIPS), 2021
Yinglun Zhu
Dongruo Zhou
Ruoxi Jiang
Quanquan Gu
Rebecca Willett
Robert D. Nowak
204
16
0
22 Jun 2021
Fixed-Budget Best-Arm Identification in Structured Bandits
Fixed-Budget Best-Arm Identification in Structured BanditsInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Javad Azizi
Branislav Kveton
Mohammad Ghavamzadeh
817
28
0
09 Jun 2021
Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits
Minimax Optimal Fixed-Budget Best Arm Identification in Linear BanditsNeural Information Processing Systems (NeurIPS), 2021
Junwen Yang
Vincent Y. F. Tan
231
35
0
27 May 2021
High-Dimensional Experimental Design and Kernel Bandits
High-Dimensional Experimental Design and Kernel BanditsInternational Conference on Machine Learning (ICML), 2021
Romain Camilleri
Julian Katz-Samuels
Kevin Jamieson
291
63
0
12 May 2021
12
Next
Page 1 of 2