ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.09809
  4. Cited By
Deep Contextual Multi-armed Bandits

Deep Contextual Multi-armed Bandits

25 July 2018
Mark Collier
H. Llorens
ArXiv (abs)PDFHTML

Papers citing "Deep Contextual Multi-armed Bandits"

19 / 19 papers shown
Clutch Control: An Attention-based Combinatorial Bandit for Efficient Mutation in JavaScript Engine Fuzzing
Clutch Control: An Attention-based Combinatorial Bandit for Efficient Mutation in JavaScript Engine Fuzzing
Myles Foley
Sergio Maffeis
Muhammad Fakhrur Rozi
Takeshi Takahashi
169
0
0
14 Oct 2025
Consistency of Selection Strategies for Fraud Detection
Consistency of Selection Strategies for Fraud Detection
Christos Revelas
O. Boldea
B. Werker
AAML
88
0
0
23 Sep 2025
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced
  Retrieval-Augmented Generation on Knowledge Graphs
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaqiang Tang
Jian Li
Nan Du
Sihong Xie
321
5
0
10 Dec 2024
Accelerating Matrix Diagonalization through Decision Transformers with
  Epsilon-Greedy Optimization
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization
Kshitij Bhatta
Geigh Zollicoffer
Manish Bhattarai
Phil Romero
C. Negre
Anders M. N. Niklasson
A. Adedoyin
177
0
0
23 Jun 2024
Insurance pricing on price comparison websites via reinforcement
  learning
Insurance pricing on price comparison websites via reinforcement learning
Tanut Treetanthiploet
Yufei Zhang
Lukasz Szpruch
Isaac Bowers-Barnard
Henrietta Ridley
James Hickey
C. Pearce
OffRL
188
2
0
14 Aug 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Adaptive Endpointing with Deep Contextual Multi-armed BanditsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
133
1
0
23 Mar 2023
A Reinforcement Learning Framework for Online Speaker Diarization
A Reinforcement Learning Framework for Online Speaker Diarization
Baihan Lin
Xinxin Zhang
OffRL
271
2
0
21 Feb 2023
Sequential Decision Making on Unmatched Data using Bayesian Kernel
  Embeddings
Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings
Diego Martinez-Taboada
Dino Sejdinovic
BDL
132
1
0
25 Oct 2022
Reinforcement Learning and Bandits for Speech and Language Processing:
  Tutorial, Review and Outlook
Reinforcement Learning and Bandits for Speech and Language Processing: Tutorial, Review and OutlookExpert systems with applications (ESWA), 2022
Baihan Lin
OffRLAI4TS
417
28
0
24 Oct 2022
Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Two-Stage Neural Contextual Bandits for Personalised News Recommendation
Mengyan Zhang
Thanh Nguyen-Tang
Fangzhao Wu
Zhenyu He
Xing Xie
Cheng Soon Ong
250
4
0
26 Jun 2022
Pervasive Machine Learning for Smart Radio Environments Enabled by
  Reconfigurable Intelligent Surfaces
Pervasive Machine Learning for Smart Radio Environments Enabled by Reconfigurable Intelligent SurfacesProceedings of the IEEE (Proc. IEEE), 2022
G. C. Alexandropoulos
Kyriakos Stylianopoulos
Chongwen Huang
Chau Yuen
M. Bennis
Mérouane Debbah
192
111
0
08 May 2022
X2T: Training an X-to-Text Typing Interface with Online Learning from
  User Feedback
X2T: Training an X-to-Text Typing Interface with Online Learning from User FeedbackInternational Conference on Learning Representations (ICLR), 2022
Jensen Gao
S. Reddy
Glen Berseth
Nicholas Hardy
N. Natraj
K. Ganguly
Anca Dragan
Sergey Levine
239
10
0
04 Mar 2022
Top-K Ranking Deep Contextual Bandits for Information Selection Systems
Top-K Ranking Deep Contextual Bandits for Information Selection SystemsIEEE International Conference on Systems, Man and Cybernetics (SMC), 2021
Jade Freeman
Michael Rawson
215
2
0
28 Jan 2022
Empirical analysis of representation learning and exploration in neural
  kernel bandits
Empirical analysis of representation learning and exploration in neural kernel bandits
Michal Lisicki
Arash Afkanpour
Graham W. Taylor
213
0
0
05 Nov 2021
An Efficient Algorithm for Deep Stochastic Contextual Bandits
An Efficient Algorithm for Deep Stochastic Contextual BanditsAAAI Conference on Artificial Intelligence (AAAI), 2021
Tan Zhu
Guannan Liang
Chunjiang Zhu
HaiNing Li
J. Bi
221
1
0
12 Apr 2021
Neural Contextual Bandits with Deep Representation and Shallow
  Exploration
Neural Contextual Bandits with Deep Representation and Shallow ExplorationInternational Conference on Learning Representations (ICLR), 2020
Pan Xu
Zheng Wen
Handong Zhao
Quanquan Gu
OffRL
183
85
0
03 Dec 2020
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution
  using Reinforcement Learning
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement LearningIntelligent Medicine (IM), 2020
R. Awasthi
K. K. Guliani
Saif Ahmad Khan
Aniket Vashishtha
M. S. Gill
Arshita Bhatt
A. Nagori
Aniket Gupta
Ponnurangam Kumaraguru
Tavpritesh Sethi
364
29
0
14 Sep 2020
Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus
  $Q$-learning
Hedging using reinforcement learning: Contextual kkk-Armed Bandit versus QQQ-learning
Loris Cannelli
Giuseppe Nuti
M. Sala
O. Szehr
OffRL
218
18
0
03 Jul 2020
Deep Reinforcement Learning with Weighted Q-Learning
Deep Reinforcement Learning with Weighted Q-Learning
Andrea Cini
Carlo DÉramo
Jan Peters
Cesare Alippi
OffRL
171
10
0
20 Mar 2020
1