Deep Contextual Multi-armed Bandits

25 July 2018

Mark Collier

H. Llorens

ArXiv (abs)PDF HTML

Papers citing "Deep Contextual Multi-armed Bandits"

19 / 19 papers shown

Clutch Control: An Attention-based Combinatorial Bandit for Efficient Mutation in JavaScript Engine Fuzzing

Myles Foley

Sergio Maffeis

Muhammad Fakhrur Rozi

Takeshi Takahashi

169

14 Oct 2025

Consistency of Selection Strategies for Fraud Detection

23 Sep 2025

Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsAAAI Conference on Artificial Intelligence (AAAI), 2024

321

10 Dec 2024

Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization

Manish Bhattarai

Anders M. N. Niklasson

A. Adedoyin

177

23 Jun 2024

Insurance pricing on price comparison websites via reinforcement learning

Tanut Treetanthiploet

192

14 Aug 2023

Adaptive Endpointing with Deep Contextual Multi-armed BanditsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Venkatesh Ravichandran

V. Trinh

OffRL

133

23 Mar 2023

A Reinforcement Learning Framework for Online Speaker Diarization

Baihan Lin

Xinxin Zhang

OffRL

272

21 Feb 2023

Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings

Diego Martinez-Taboada

Dino Sejdinovic

BDL

132

25 Oct 2022

Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and OutlookExpert systems with applications (ESWA), 2022

Baihan Lin

OffRL AI4TS

417

24 Oct 2022

Two-Stage Neural Contextual Bandits for Personalised News Recommendation

Thanh Nguyen-Tang

Xing Xie

250

26 Jun 2022

Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent SurfacesProceedings of the IEEE (Proc. IEEE), 2022

G. C. Alexandropoulos

Kyriakos Stylianopoulos

Chongwen Huang

Chau Yuen

M. Bennis

Mérouane Debbah

192

112

08 May 2022

X2T: Training an X-to-Text Typing Interface with Online Learning from User FeedbackInternational Conference on Learning Representations (ICLR), 2022

239

04 Mar 2022

Top-K Ranking Deep Contextual Bandits for Information Selection SystemsIEEE International Conference on Systems, Man and Cybernetics (SMC), 2021

Jade Freeman

Michael Rawson

215

28 Jan 2022

Empirical analysis of representation learning and exploration in neural kernel bandits

Michal Lisicki

Arash Afkanpour

Graham W. Taylor

213

05 Nov 2021

An Efficient Algorithm for Deep Stochastic Contextual BanditsAAAI Conference on Artificial Intelligence (AAAI), 2021

222

12 Apr 2021

Neural Contextual Bandits with Deep Representation and Shallow ExplorationInternational Conference on Learning Representations (ICLR), 2020

Quanquan Gu

183

03 Dec 2020

VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement LearningIntelligent Medicine (IM), 2020

Ponnurangam Kumaraguru

Tavpritesh Sethi

364

14 Sep 2020

Hedging using reinforcement learning: Contextual

k

-Armed Bandit versus

Q

218

03 Jul 2020

Deep Reinforcement Learning with Weighted Q-Learning

Jan Peters

172

20 Mar 2020