Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09127
Cited By
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
26 February 2018
C. Riquelme
George Tucker
Jasper Snoek
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling"
50 / 227 papers shown
Title
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
124
0
0
04 May 2025
Exploring Pseudo-Token Approaches in Transformer Neural Processes
Jose Lara-Rangel
Nanze Chen
Fengzhe Zhang
27
0
0
19 Apr 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
54
0
0
23 Mar 2025
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction
Anni Zhou
Raheem Beyah
Rishikesan Kamaleswaran
36
0
0
20 Mar 2025
Exploring the Potential of Bilevel Optimization for Calibrating Neural Networks
Gabriele Sanguin
Arjun Pakrashi
Marco Viola
Francesco Rinaldi
54
0
0
17 Mar 2025
Exploiting Concavity Information in Gaussian Process Contextual Bandit Optimization
Kevin Li
Eric Laber
43
0
0
13 Mar 2025
Active Learning for Direct Preference Optimization
B. Kveton
Xintong Li
Julian McAuley
Ryan Rossi
Jingbo Shang
Junda Wu
Tong Yu
56
1
0
03 Mar 2025
LNUCB-TA: Linear-nonlinear Hybrid Bandit Learning with Temporal Attention
H. Khosravi
Mohammad Reza Shafie
Ahmed Shoyeb Raihan
Srinjoy Das
I. Imtiaz Ahmed
29
0
0
01 Mar 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
97
0
0
08 Nov 2024
PageRank Bandits for Link Prediction
Yikun Ban
Jiaru Zou
Zihao Li
Yunzhe Qi
Dongqi Fu
Jian Kang
Hanghang Tong
Jingrui He
34
2
0
03 Nov 2024
Online Posterior Sampling with a Diffusion Prior
B. Kveton
Boris Oreshkin
Youngsuk Park
Aniket Deshmukh
Rui Song
DiffM
35
0
0
04 Oct 2024
The Digital Transformation in Health: How AI Can Improve the Performance of Health Systems
África Periánez
Ana Fernández del Río
Ivan Nazarov
Enric Jané
Moiz Hassan
Aditya Rastogi
Dexian Tang
44
11
0
24 Sep 2024
Adaptive User Journeys in Pharma E-Commerce with Reinforcement Learning: Insights from SwipeRx
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
OnRL
34
2
0
15 Aug 2024
Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services
Ana Fernández del Río
Michael Brennan Leong
Paulo Saraiva
Ivan Nazarov
Aditya Rastogi
Moiz Hassan
Dexian Tang
África Periánez
OffRL
21
3
0
14 Aug 2024
Optimizing HIV Patient Engagement with Reinforcement Learning in Resource-Limited Settings
África Periánez
Kathrin Schmitz
Lazola Makhupula
Moiz Hassan
Moeti Moleko
Ana Fernández del Río
Ivan Nazarov
Aditya Rastogi
Dexian Tang
OffRL
30
0
0
14 Aug 2024
Meta Clustering of Neural Bandits
Yikun Ban
Yunzhe Qi
Tianxin Wei
Lihui Liu
Jingrui He
40
2
0
10 Aug 2024
AExGym: Benchmarks and Environments for Adaptive Experimentation
Jimmy Wang
Ethan Che
Daniel R. Jiang
Hongseok Namkoong
32
0
0
08 Aug 2024
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits
Ziyi Huang
Henry Lam
Haofeng Zhang
33
0
0
20 Jun 2024
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
43
0
0
15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
67
2
0
07 Jun 2024
A Bayesian Approach to Online Planning
Nir Greshler
David Ben-Eli
Carmel Rabinovitz
Gabi Guetta
Liran Gispan
Guy Zohar
Aviv Tamar
18
0
0
04 Jun 2024
Position: Why We Must Rethink Empirical Research in Machine Learning
Moritz Herrmann
F. J. D. Lange
Katharina Eggensperger
Giuseppe Casalicchio
Marcel Wever
Matthias Feurer
David Rügamer
Eyke Hüllermeier
A. Boulesteix
Bernd Bischl
47
6
0
03 May 2024
Online Personalizing White-box LLMs Generation with Neural Bandits
Zekai Chen
Weeden Daniel
Po-yu Chen
Francois Buet-Golfouse
36
2
0
24 Apr 2024
Uncertainty in Language Models: Assessment through Rank-Calibration
Xinmeng Huang
Shuo Li
Mengxin Yu
Matteo Sesia
Hamed Hassani
Insup Lee
Osbert Bastani
Edgar Dobriban
35
16
0
04 Apr 2024
On the Importance of Uncertainty in Decision-Making with Large Language Models
Nicolò Felicioni
Lucas Maystre
Sina Ghiassian
K. Ciosek
LLMAG
29
2
0
03 Apr 2024
Better than classical? The subtle art of benchmarking quantum machine learning models
Joseph Bowles
Shahnawaz Ahmed
Maria Schuld
34
63
0
11 Mar 2024
ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment
Hao-Lun Hsu
Qitong Gao
Miroslav Pajic
53
0
0
11 Mar 2024
Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Xiaoying Zhang
Jean-François Ton
Wei Shen
Hongning Wang
Yang Liu
37
13
0
08 Mar 2024
Watch Your Head: Assembling Projection Heads to Save the Reliability of Federated Models
Jinqian Chen
Jihua Zhu
Qinghai Zheng
Zhongyu Li
Zhiqiang Tian
FedML
14
3
0
26 Feb 2024
Diffusion Models Meet Contextual Bandits with Large Action Spaces
Imad Aouali
DiffM
27
4
0
15 Feb 2024
Predictive Churn with the Set of Good Models
J. Watson-Daniels
Flavio du Pin Calmon
Alexander DÁmour
Carol Xuan Long
David C. Parkes
Berk Ustun
79
7
0
12 Feb 2024
LiRank: Industrial Large Scale Ranking Models at LinkedIn
Fedor Borisyuk
Mingzhou Zhou
Qingquan Song
Siyu Zhu
B. Tiwana
...
Chen-Chen Jiang
Haichao Wei
Maneesh Varshney
Amol Ghoting
Souvik Ghosh
24
1
0
10 Feb 2024
Efficient Exploration for LLMs
Vikranth Dwaracherla
S. Asghari
Botao Hao
Benjamin Van Roy
LLMAG
15
20
0
01 Feb 2024
Improving sample efficiency of high dimensional Bayesian optimization with MCMC
Zeji Yi
Yunyue Wei
Chu Xin Cheng
Kaibo He
Yanan Sui
17
5
0
05 Jan 2024
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration
Fahri Wisnu Murti
Samad Ali
Matti Latva-aho
18
0
0
26 Dec 2023
Risk-Aware Continuous Control with Neural Contextual Bandits
J. Ayala-Romero
A. Garcia-Saavedra
Xavier Pérez Costa
13
3
0
15 Dec 2023
RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions
Easton K. Huch
Jieru Shi
Madeline R Abbott
J. Golbus
Alexander Moreno
Walter Dempsey
OffRL
19
0
0
11 Dec 2023
Bootstrap Your Own Variance
Polina Turishcheva
Jason Ramapuram
Sinead Williamson
Dan Busbridge
Eeshan Gunesh Dhekane
Russ Webb
UQCV
18
0
0
06 Dec 2023
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling
Bairu Hou
Yujian Liu
Kaizhi Qian
Jacob Andreas
Shiyu Chang
Yang Zhang
UD
UQCV
PER
21
48
0
15 Nov 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu-Xiang Wang
Yian Ma
24
6
0
29 Oct 2023
Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization
Yunfan Zhao
Nikhil Behari
Edward Hughes
Edwin Zhang
Dheeraj M. Nagaraj
K. Tuyls
Aparna Taneja
Milind Tambe
21
7
0
23 Oct 2023
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Zheqing Zhu
Yueyang Liu
Xu Kuang
Benjamin Van Roy
AI4TS
27
0
0
11 Oct 2023
Epsilon non-Greedy: A Bandit Approach for Unbiased Recommendation via Uniform Data
S.M.F. Sani
Seyed Abbas Hosseini
Hamid R. Rabiee
OffRL
12
1
0
07 Oct 2023
Multi-fidelity climate model parameterization for better generalization and extrapolation
Mohamed Aziz Bhouri
Liran Peng
Michael S. Pritchard
Pierre Gentine
AI4CE
26
4
0
19 Sep 2023
Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness
Jiuhai Chen
Jonas W. Mueller
42
55
0
30 Aug 2023
Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem
Elena Gal
Shaun Singh
Aldo Pacchiano
Benjamin Walker
Terry Lyons
Jakob N. Foerster
FaML
20
0
0
15 Aug 2023
VITS : Variational Inference Thompson Sampling for contextual bandits
Pierre Clavier
Tom Huix
Alain Durmus
25
3
0
19 Jul 2023
Density Uncertainty Layers for Reliable Uncertainty Estimation
Yookoon Park
David M. Blei
UQCV
BDL
19
2
0
21 Jun 2023
Collapsed Inference for Bayesian Deep Learning
Zhe Zeng
Guy Van den Broeck
FedML
BDL
UQCV
17
8
0
16 Jun 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
27
5
0
15 Jun 2023
1
2
3
4
5
Next