v1v2 (latest)

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

AAAI Conference on Artificial Intelligence (AAAI), 2020

13 December 2020

Papers citing "Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback"

7 / 7 papers shown

Adversarial Bandits with Multi-User Delayed Feedback: Theory and ApplicationIEEE Transactions on Mobile Computing (IEEE TMC), 2023

Weijia Jia

425

17 Oct 2023

Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward

Washim Uddin Mondal

Vaneet Aggarwal

257

04 May 2023

Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit FeedbackIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023

M. Pedramfar

Vaneet Aggarwal

248

23 Mar 2023

Multi-Armed Bandits with Generalized Temporally-Partitioned RewardsInternational Symposium on Intelligent Data Analysis (IDA), 2023

Ronald C. van den Broek

193

01 Mar 2023

Dynamical Linear BanditsInternational Conference on Machine Learning (ICML), 2022

Marco Mussi

Alberto Maria Metelli

Marcello Restelli

243

16 Nov 2022

Generalizing distribution of partial rewards for multi-armed bandits with temporally-partitioned rewards

Ronald C. van den Broek

13 Nov 2022

Bounded Memory Adversarial Bandits with Composite Anonymous Delayed FeedbackInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

Zongqi Wan

Xiaoming Sun

Jialin Zhang

179

27 Apr 2022