ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.07048
  4. Cited By
Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous
  Feedback
v1v2 (latest)

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

AAAI Conference on Artificial Intelligence (AAAI), 2020
13 December 2020
Siwei Wang
Haoyun Wang
Longbo Huang
ArXiv (abs)PDFHTML

Papers citing "Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback"

7 / 7 papers shown
Adversarial Bandits with Multi-User Delayed Feedback: Theory and
  Application
Adversarial Bandits with Multi-User Delayed Feedback: Theory and ApplicationIEEE Transactions on Mobile Computing (IEEE TMC), 2023
Yandi Li
Jianxiong Guo
Yupeng Li
Tian-sheng Wang
Weijia Jia
425
2
0
17 Oct 2023
Reinforcement Learning with Delayed, Composite, and Partially Anonymous
  Reward
Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward
Washim Uddin Mondal
Vaneet Aggarwal
257
3
0
04 May 2023
Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback
Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit FeedbackIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023
M. Pedramfar
Vaneet Aggarwal
248
2
0
23 Mar 2023
Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards
Multi-Armed Bandits with Generalized Temporally-Partitioned RewardsInternational Symposium on Intelligent Data Analysis (IDA), 2023
Ronald C. van den Broek
Rik Litjens
Tobias Sagis
Luc Siecker
Nina Verbeeke
Pratik Gajane
193
0
0
01 Mar 2023
Dynamical Linear Bandits
Dynamical Linear BanditsInternational Conference on Machine Learning (ICML), 2022
Marco Mussi
Alberto Maria Metelli
Marcello Restelli
243
3
0
16 Nov 2022
Generalizing distribution of partial rewards for multi-armed bandits
  with temporally-partitioned rewards
Generalizing distribution of partial rewards for multi-armed bandits with temporally-partitioned rewards
Ronald C. van den Broek
Rik Litjens
Tobias Sagis
Luc Siecker
Nina Verbeeke
Pratik Gajane
91
0
0
13 Nov 2022
Bounded Memory Adversarial Bandits with Composite Anonymous Delayed
  Feedback
Bounded Memory Adversarial Bandits with Composite Anonymous Delayed FeedbackInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Zongqi Wan
Xiaoming Sun
Jialin Zhang
179
1
0
27 Apr 2022
1
Page 1 of 1