ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.00445
  4. Cited By
On Kernelized Multi-armed Bandits

On Kernelized Multi-armed Bandits

3 April 2017
Sayak Ray Chowdhury
Aditya Gopalan
ArXivPDFHTML

Papers citing "On Kernelized Multi-armed Bandits"

50 / 114 papers shown
Title
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
64
0
0
03 Apr 2025
Bandit Optimal Transport
Bandit Optimal Transport
Lorenzo Croissant
82
0
0
11 Feb 2025
Cost-aware Bayesian Optimization via the Pandora's Box Gittins Index
Cost-aware Bayesian Optimization via the Pandora's Box Gittins Index
Qian Xie
Raul Astudillo
P. Frazier
Ziv Scully
Alexander Terenin
92
2
0
17 Jan 2025
Constrained Multi-objective Bayesian Optimization through Optimistic Constraints Estimation
Constrained Multi-objective Bayesian Optimization through Optimistic Constraints Estimation
Diantong Li
Fengxue Zhang
Chong Liu
Yuxin Chen
206
0
0
06 Nov 2024
Towards safe Bayesian optimization with Wiener kernel regression
Towards safe Bayesian optimization with Wiener kernel regression
O. Molodchyk
Johannes Teutsch
T. Faulwasser
31
0
0
04 Nov 2024
Optimizing Posterior Samples for Bayesian Optimization via Rootfinding
Optimizing Posterior Samples for Bayesian Optimization via Rootfinding
Taiwo A. Adebiyi
Bach Do
Ruda Zhang
114
2
0
29 Oct 2024
Lower Bounds for Time-Varying Kernelized Bandits
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
36
0
0
22 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
Towards safe and tractable Gaussian process-based MPC: Efficient
  sampling within a sequential quadratic programming framework
Towards safe and tractable Gaussian process-based MPC: Efficient sampling within a sequential quadratic programming framework
Manish Prajapat
Amon Lahr
Johannes Köhler
Andreas Krause
Melanie Zeilinger
29
2
0
13 Sep 2024
Variational Search Distributions
Variational Search Distributions
Daniel M. Steinberg
Rafael Oliveira
Cheng Soon Ong
Edwin V. Bonilla
35
0
0
10 Sep 2024
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and
  Rotting
Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting
Gianmarco Genalti
Marco Mussi
Nicola Gatti
Marcello Restelli
Matteo Castiglioni
Alberto Maria Metelli
38
0
0
09 Sep 2024
PACSBO: Probably approximately correct safe Bayesian optimization
PACSBO: Probably approximately correct safe Bayesian optimization
Abdullah Tokmak
Thomas B. Schon
Dominik Baumann
37
2
0
02 Sep 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
81
1
0
22 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
37
5
0
24 Jul 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
33
4
0
19 Jul 2024
Meta-Analysis with Untrusted Data
Meta-Analysis with Untrusted Data
Shiva Kaul
Geoffrey J. Gordon
CML
32
1
0
12 Jul 2024
Simultaneous System Identification and Model Predictive Control with No Dynamic Regret
Simultaneous System Identification and Model Predictive Control with No Dynamic Regret
Hongyu Zhou
Vasileios Tzoumas
93
4
0
04 Jul 2024
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
51
0
0
15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
84
2
0
07 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
41
0
0
03 Jun 2024
OptEx: Expediting First-Order Optimization with Approximately
  Parallelized Iterations
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations
Yao Shu
Jiongfeng Fang
Y. He
Fei Richard Yu
35
0
0
18 Feb 2024
Active Few-Shot Fine-Tuning
Active Few-Shot Fine-Tuning
Jonas Hübotter
Bhavya Sukhija
Lenart Treven
Yarden As
Andreas Krause
45
1
0
13 Feb 2024
Improving sample efficiency of high dimensional Bayesian optimization
  with MCMC
Improving sample efficiency of high dimensional Bayesian optimization with MCMC
Zeji Yi
Yunyue Wei
Chu Xin Cheng
Kaibo He
Yanan Sui
30
5
0
05 Jan 2024
Towards Safe Multi-Task Bayesian Optimization
Towards Safe Multi-Task Bayesian Optimization
Jannis O. Lübsen
Christian Hespe
Annika Eichler
29
3
0
12 Dec 2023
Constraint-Guided Online Data Selection for Scalable Data-Driven Safety
  Filters in Uncertain Robotic Systems
Constraint-Guided Online Data Selection for Scalable Data-Driven Safety Filters in Uncertain Robotic Systems
Jason J. Choi
F. Castañeda
Wonsuhk Jung
Bike Zhang
Claire J. Tomlin
Koushil Sreenath
37
3
0
23 Nov 2023
Adaptive Interventions with User-Defined Goals for Health Behavior
  Change
Adaptive Interventions with User-Defined Goals for Health Behavior Change
Aishwarya Mandyam
Matthew Joerke
William Denton
Barbara E. Engelhardt
Emma Brunskill
32
1
0
16 Nov 2023
Exact Inference for Continuous-Time Gaussian Process Dynamics
Exact Inference for Continuous-Time Gaussian Process Dynamics
K. Ensinger
Nicholas Tagliapietra
Sebastian Ziesche
Sebastian Trimpe
29
1
0
05 Sep 2023
Online Network Source Optimization with Graph-Kernel MAB
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
26
1
0
07 Jul 2023
Training-Free Neural Active Learning with Initialization-Robustness
  Guarantees
Training-Free Neural Active Learning with Initialization-Robustness Guarantees
Apivich Hemachandra
Zhongxiang Dai
Jasraj Singh
See-Kiong Ng
K. H. Low
AAML
36
6
0
07 Jun 2023
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via
  Pessimism
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism
Zihao Li
Zhuoran Yang
Mengdi Wang
OffRL
37
55
0
29 May 2023
Can Learning Deteriorate Control? Analyzing Computational Delays in
  Gaussian Process-Based Event-Triggered Online Learning
Can Learning Deteriorate Control? Analyzing Computational Delays in Gaussian Process-Based Event-Triggered Online Learning
X. Dai
Armin Lederer
Zewen Yang
Sandra Hirche
38
13
0
14 May 2023
BO-Muse: A human expert and AI teaming framework for accelerated
  experimental design
BO-Muse: A human expert and AI teaming framework for accelerated experimental design
Sunil R. Gupta
A. Shilton
V. ArunKumarA.
S. Ryan
Majid Abdolshah
Hung Le
Santu Rana
Julian Berk
Mahad Rashid
Svetha Venkatesh
42
7
0
03 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy Evaluation
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
21
10
0
02 Mar 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
46
5
0
24 Feb 2023
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and
  Scaling Laws
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
Kush S. Bhatia
Wenshuo Guo
Jacob Steinhardt
27
0
0
23 Feb 2023
Sharp Calibrated Gaussian Processes
Sharp Calibrated Gaussian Processes
A. Capone
Geoff Pleiss
Sandra Hirche
UQCV
42
4
0
23 Feb 2023
Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian
  Regret Bounds
Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian Regret Bounds
Shion Takeno
Yu Inatsu
Masayuki Karasuyama
30
13
0
03 Feb 2023
Sample Complexity of Kernel-Based Q-Learning
Sample Complexity of Kernel-Based Q-Learning
Sing-Yuan Yeh
Fu-Chieh Chang
Chang-Wei Yueh
Pei-Yuan Wu
A. Bernacchia
Sattar Vakili
OffRL
30
4
0
01 Feb 2023
Contextual Causal Bayesian Optimisation
Contextual Causal Bayesian Optimisation
Vahan Arsenyan
Antoine Grosnit
Haitham Bou-Ammar
CML
38
2
0
29 Jan 2023
(Private) Kernelized Bandits with Distributed Biased Feedback
(Private) Kernelized Bandits with Distributed Biased Feedback
Fengjiao Li
Xingyu Zhou
Bo Ji
33
5
0
28 Jan 2023
Learning Disturbances Online for Risk-Aware Control: Risk-Aware Flight
  with Less Than One Minute of Data
Learning Disturbances Online for Risk-Aware Control: Risk-Aware Flight with Less Than One Minute of Data
Prithvi Akella
Skylar X. Wei
J. W. Burdick
Aaron D. Ames
32
4
0
12 Dec 2022
Information-Theoretic Safe Exploration with Gaussian Processes
Information-Theoretic Safe Exploration with Gaussian Processes
A. Bottero
Carlos E. Luis
Julia Vinogradska
Felix Berkenkamp
Jan Peters
31
13
0
09 Dec 2022
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed
  Bandit with Constraints
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints
Heng Guo
Qi Zhu
Xin Liu
29
11
0
27 Nov 2022
CONFIG: Constrained Efficient Global Optimization for Closed-Loop
  Control System Optimization with Unmodeled Constraints
CONFIG: Constrained Efficient Global Optimization for Closed-Loop Control System Optimization with Unmodeled Constraints
Wenjie Xu
Yuning Jiang
B. Svetozarevic
Colin N. Jones
25
7
0
21 Nov 2022
Safe Optimization of an Industrial Refrigeration Process Using an
  Adaptive and Explorative Framework
Safe Optimization of an Industrial Refrigeration Process Using an Adaptive and Explorative Framework
Buse Sibel Korkmaz
Marta Zagórowska
Mehmet Mercangöz
AI4CE
22
3
0
21 Nov 2022
Safe and Adaptive Decision-Making for Optimization of Safety-Critical
  Systems: The ARTEO Algorithm
Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO Algorithm
Buse Sibel Korkmaz
Marta Zagórowska
Mehmet Mercangöz
30
2
0
10 Nov 2022
Movement Penalized Bayesian Optimization with Application to Wind Energy
  Systems
Movement Penalized Bayesian Optimization with Application to Wind Energy Systems
Shyam Sundhar Ramesh
Pier Giuseppe Sessa
Andreas Krause
Ilija Bogunovic
16
12
0
14 Oct 2022
Near-Optimal Multi-Agent Learning for Safe Coverage Control
Near-Optimal Multi-Agent Learning for Safe Coverage Control
Manish Prajapat
M. Turchetta
Melanie Zeilinger
Andreas Krause
35
14
0
12 Oct 2022
Game-theoretic statistics and safe anytime-valid inference
Game-theoretic statistics and safe anytime-valid inference
Aaditya Ramdas
Peter Grünwald
V. Vovk
Glenn Shafer
40
119
0
04 Oct 2022
Batch Bayesian optimisation via density-ratio estimation with guarantees
Batch Bayesian optimisation via density-ratio estimation with guarantees
Rafael Oliveira
Louis C. Tiao
Fabio Ramos
44
7
0
22 Sep 2022
123
Next