Recovering Bandits

31 October 2019

Papers citing "Recovering Bandits"

31 / 31 papers shown

Title
Deep Index Policy for Multi-Resource Restless Matching Bandit and Its Application in Multi-Channel Scheduling Nida Zamir I-Hong Hou 66 0 0 13 Aug 2024
Artificial Intelligence-based Decision Support Systems for Precision and Digital Health Nina Deliu Bibhas Chakraborty 52 5 0 22 Jul 2024
Accounting for AI and Users Shaping One Another: The Role of Mathematical Models Sarah Dean Evan Dong Meena Jagadeesan Liu Leqi 82 8 0 18 Apr 2024
State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards Yuto Tanimoto Kenji Fukumizu 54 0 0 18 Mar 2024
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms Khashayar Khosravi R. Leme Chara Podimata Apostolis Tsorvantzis 90 1 0 21 Jul 2023
Last Switch Dependent Bandits with Monotone Payoff Functions Ayoub Foussoul Vineet Goyal Orestis Papadigenopoulos A. Zeevi 59 5 0 01 Jun 2023
Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality Dhruv Malik Conor Igoe Yuanzhi Li Aarti Singh OffRL 71 1 0 04 May 2023
Linear Bandits with Memory: from Rotting to Rising Giulia Clerici Pierre Laforgue Nicolò Cesa-Bianchi 52 3 0 16 Feb 2023
Learning with Exposure Constraints in Recommendation Systems Omer Ben-Porat Rotem Torkan 71 12 0 02 Feb 2023
Congested Bandits: Optimal Routing via Short-term Resets Pranjal Awasthi Kush S. Bhatia Sreenivas Gollapudi Kostas Kollias 45 5 0 23 Jan 2023
Stochastic Rising Bandits Alberto Maria Metelli F. Trovò Matteo Pirola Marcello Restelli 51 18 0 07 Dec 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs Khaled Nakhleh I.-Hong Hou 148 6 0 18 Sep 2022
Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear Regret Orestis Papadigenopoulos Constantine Caramanis Sanjay Shakkottai 48 4 0 29 May 2022
Complete Policy Regret Bounds for Tallying Bandits Dhruv Malik Yuanzhi Li Aarti Singh OffRL 55 2 0 24 Apr 2022
Modeling Attrition in Recommender Systems with Departing Bandits Omer Ben-Porat Lee Cohen Liu Leqi Zachary Chase Lipton Yishay Mansour 75 12 0 25 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions Nina Deliu Joseph Jay Williams B. Chakraborty OffRL 65 5 0 04 Mar 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning Mengbing Li C. Shi Zhanghua Wu Piotr Fryzlewicz OffRL 110 9 0 03 Mar 2022
Bandit problems with fidelity rewards Gábor Lugosi Ciara Pike-Burke Pierre-André Savalle 47 0 0 25 Nov 2021
A Last Switch Dependent Analysis of Satiation and Seasonality in Bandits Pierre Laforgue Giulia Clerici Nicolò Cesa-Bianchi Ran Gilad-Bachrach 71 9 0 22 Oct 2021
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL Khaled Nakhleh Santosh Ganji Ping-Chun Hsieh I.-Hong Hou S. Shakkottai 137 40 0 05 Oct 2021
Batched Bandits with Crowd Externalities Romain Laroche Othmane Safsafi Raphael Feraud N. Broutin 48 0 0 29 Sep 2021
Continuous Time Bandits With Sampling Costs R. Vaze M. Hanawal 49 0 0 12 Jul 2021
Offline Planning and Online Learning under Recovering Rewards D. Simchi-Levi Zeyu Zheng Feng Zhu OffRL 58 1 0 28 Jun 2021
Combinatorial Blocking Bandits with Stochastic Delays Alexia Atsidakou Orestis Papadigenopoulos Soumya Basu Constantine Caramanis Sanjay Shakkottai 61 8 0 22 May 2021
Recurrent Submodular Welfare and Matroid Blocking Bandits Orestis Papadigenopoulos Constantine Caramanis 69 2 0 30 Jan 2021
Online Model Selection: a Rested Bandit Formulation Leonardo Cella Claudio Gentile Massimiliano Pontil 48 0 0 07 Dec 2020
Rebounding Bandits for Modeling Satiation Effects Liu Leqi Fatma Kılınç Karzan Zachary Chase Lipton A. Montgomery 58 26 0 13 Nov 2020
Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect Priyank Agrawal Theja Tulabandhula 35 1 0 18 Jun 2020
Contextual Blocking Bandits Soumya Basu Orestis Papadigenopoulos Constantine Caramanis Sanjay Shakkottai 83 21 0 06 Mar 2020
Bandit Learning with Delayed Impact of Actions Wei Tang Chien-Ju Ho Yang Liu 95 12 0 24 Feb 2020
Stochastic Bandits with Delay-Dependent Payoffs Leonardo Cella Nicolò Cesa-Bianchi 87 39 0 07 Oct 2019