ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.06248
  4. Cited By
Improved Sleeping Bandits with Stochastic Actions Sets and Adversarial
  Rewards
v1v2 (latest)

Improved Sleeping Bandits with Stochastic Actions Sets and Adversarial Rewards

International Conference on Machine Learning (ICML), 2020
14 April 2020
Aadirupa Saha
Pierre Gaillard
Michal Valko
ArXiv (abs)PDFHTML

Papers citing "Improved Sleeping Bandits with Stochastic Actions Sets and Adversarial Rewards"

9 / 9 papers shown
DP-Dueling: Learning from Preference Feedback without Compromising User
  Privacy
DP-Dueling: Learning from Preference Feedback without Compromising User Privacy
Aadirupa Saha
Hilal Asi
322
1
0
22 Mar 2024
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual
  Bandits
Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual BanditsNeural Information Processing Systems (NeurIPS), 2023
Haolin Liu
Chen-Yu Wei
Julian Zimmert
288
14
0
02 Sep 2023
Adversarial Sleeping Bandit Problems with Multiple Plays: Algorithm and
  Ranking Application
Adversarial Sleeping Bandit Problems with Multiple Plays: Algorithm and Ranking ApplicationACM Conference on Recommender Systems (RecSys), 2023
Jianjun Yuan
W. Woon
Ludovik Çoba
119
1
0
27 Jul 2023
One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret
  Guarantees in Sleeping Bandits
One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping BanditsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Pierre Gaillard
Aadirupa Saha
Soham Dan
235
4
0
26 Oct 2022
Walk for Learning: A Random Walk Approach for Federated Learning from
  Heterogeneous Data
Walk for Learning: A Random Walk Approach for Federated Learning from Heterogeneous DataIEEE Journal on Selected Areas in Communications (JSAC), 2022
Ghadir Ayache
Venkat Dassari
S. E. Rouayheb
FedML
185
30
0
01 Jun 2022
Non-Stationary Bandits under Recharging Payoffs: Improved Planning with
  Sublinear Regret
Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear RegretNeural Information Processing Systems (NeurIPS), 2022
Orestis Papadigenopoulos
Constantine Caramanis
Sanjay Shakkottai
194
5
0
29 May 2022
Improving Sequential Query Recommendation with Immediate User Feedback
Improving Sequential Query Recommendation with Immediate User Feedback
Shameem Puthiya Parambath
Christos Anagnostopoulos
Roderick Murray-Smith
171
1
0
12 May 2022
Low-Regret Active learning
Low-Regret Active learning
Cenk Baykal
Lucas Liebenwein
Dan Feldman
Daniela Rus
UQCV
362
4
0
06 Apr 2021
Adversarial Dueling Bandits
Adversarial Dueling BanditsInternational Conference on Machine Learning (ICML), 2020
Aadirupa Saha
Tomer Koren
Yishay Mansour
371
34
0
27 Oct 2020
1
Page 1 of 1