Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1807.09809
Cited By
Deep Contextual Multi-armed Bandits
25 July 2018
Mark Collier
H. Llorens
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Contextual Multi-armed Bandits"
19 / 19 papers shown
Clutch Control: An Attention-based Combinatorial Bandit for Efficient Mutation in JavaScript Engine Fuzzing
Myles Foley
Sergio Maffeis
Muhammad Fakhrur Rozi
Takeshi Takahashi
169
0
0
14 Oct 2025
Consistency of Selection Strategies for Fraud Detection
Christos Revelas
O. Boldea
B. Werker
AAML
88
0
0
23 Sep 2025
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaqiang Tang
Jian Li
Nan Du
Sihong Xie
321
5
0
10 Dec 2024
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization
Kshitij Bhatta
Geigh Zollicoffer
Manish Bhattarai
Phil Romero
C. Negre
Anders M. N. Niklasson
A. Adedoyin
177
0
0
23 Jun 2024
Insurance pricing on price comparison websites via reinforcement learning
Tanut Treetanthiploet
Yufei Zhang
Lukasz Szpruch
Isaac Bowers-Barnard
Henrietta Ridley
James Hickey
C. Pearce
OffRL
192
2
0
14 Aug 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
133
1
0
23 Mar 2023
A Reinforcement Learning Framework for Online Speaker Diarization
Baihan Lin
Xinxin Zhang
OffRL
272
2
0
21 Feb 2023
Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings
Diego Martinez-Taboada
Dino Sejdinovic
BDL
132
1
0
25 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and Outlook
Expert systems with applications (ESWA), 2022
Baihan Lin
OffRL
AI4TS
417
28
0
24 Oct 2022
Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Mengyan Zhang
Thanh Nguyen-Tang
Fangzhao Wu
Zhenyu He
Xing Xie
Cheng Soon Ong
250
4
0
26 Jun 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent Surfaces
Proceedings of the IEEE (Proc. IEEE), 2022
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
192
112
0
08 May 2022
X2T: Training an X-to-Text Typing Interface with Online Learning from User Feedback
International Conference on Learning Representations (ICLR), 2022
Jensen Gao
S. Reddy
Glen Berseth
Nicholas Hardy
N. Natraj
K. Ganguly
Anca Dragan
Sergey Levine
239
10
0
04 Mar 2022
Top-K Ranking Deep Contextual Bandits for Information Selection Systems
IEEE International Conference on Systems, Man and Cybernetics (SMC), 2021
Jade Freeman
Michael Rawson
215
2
0
28 Jan 2022
Empirical analysis of representation learning and exploration in neural kernel bandits
Michal Lisicki
Arash Afkanpour
Graham W. Taylor
213
0
0
05 Nov 2021
An Efficient Algorithm for Deep Stochastic Contextual Bandits
AAAI Conference on Artificial Intelligence (AAAI), 2021
Tan Zhu
Guannan Liang
Chunjiang Zhu
HaiNing Li
J. Bi
222
1
0
12 Apr 2021
Neural Contextual Bandits with Deep Representation and Shallow Exploration
International Conference on Learning Representations (ICLR), 2020
Pan Xu
Zheng Wen
Handong Zhao
Quanquan Gu
OffRL
183
85
0
03 Dec 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement Learning
Intelligent Medicine (IM), 2020
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
364
29
0
14 Sep 2020
Hedging using reinforcement learning: Contextual
k
k
k
-Armed Bandit versus
Q
Q
Q
-learning
Loris Cannelli
Giuseppe Nuti
M. Sala
O. Szehr
OffRL
218
18
0
03 Jul 2020
Deep Reinforcement Learning with Weighted Q-Learning
Andrea Cini
Carlo DÉramo
Jan Peters
Cesare Alippi
OffRL
172
10
0
20 Mar 2020
1