Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.14354
Cited By
Recovering Bandits
31 October 2019
Ciara Pike-Burke
Steffen Grunewalder
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Recovering Bandits"
31 / 31 papers shown
Title
Deep Index Policy for Multi-Resource Restless Matching Bandit and Its Application in Multi-Channel Scheduling
Nida Zamir
I-Hong Hou
66
0
0
13 Aug 2024
Artificial Intelligence-based Decision Support Systems for Precision and Digital Health
Nina Deliu
Bibhas Chakraborty
52
5
0
22 Jul 2024
Accounting for AI and Users Shaping One Another: The Role of Mathematical Models
Sarah Dean
Evan Dong
Meena Jagadeesan
Liu Leqi
82
8
0
18 Apr 2024
State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards
Yuto Tanimoto
Kenji Fukumizu
54
0
0
18 Mar 2024
Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms
Khashayar Khosravi
R. Leme
Chara Podimata
Apostolis Tsorvantzis
90
1
0
21 Jul 2023
Last Switch Dependent Bandits with Monotone Payoff Functions
Ayoub Foussoul
Vineet Goyal
Orestis Papadigenopoulos
A. Zeevi
59
5
0
01 Jun 2023
Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality
Dhruv Malik
Conor Igoe
Yuanzhi Li
Aarti Singh
OffRL
71
1
0
04 May 2023
Linear Bandits with Memory: from Rotting to Rising
Giulia Clerici
Pierre Laforgue
Nicolò Cesa-Bianchi
52
3
0
16 Feb 2023
Learning with Exposure Constraints in Recommendation Systems
Omer Ben-Porat
Rotem Torkan
71
12
0
02 Feb 2023
Congested Bandits: Optimal Routing via Short-term Resets
Pranjal Awasthi
Kush S. Bhatia
Sreenivas Gollapudi
Kostas Kollias
45
5
0
23 Jan 2023
Stochastic Rising Bandits
Alberto Maria Metelli
F. Trovò
Matteo Pirola
Marcello Restelli
51
18
0
07 Dec 2022
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh
I.-Hong Hou
148
6
0
18 Sep 2022
Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear Regret
Orestis Papadigenopoulos
Constantine Caramanis
Sanjay Shakkottai
48
4
0
29 May 2022
Complete Policy Regret Bounds for Tallying Bandits
Dhruv Malik
Yuanzhi Li
Aarti Singh
OffRL
55
2
0
24 Apr 2022
Modeling Attrition in Recommender Systems with Departing Bandits
Omer Ben-Porat
Lee Cohen
Liu Leqi
Zachary Chase Lipton
Yishay Mansour
75
12
0
25 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
65
5
0
04 Mar 2022
Testing Stationarity and Change Point Detection in Reinforcement Learning
Mengbing Li
C. Shi
Zhanghua Wu
Piotr Fryzlewicz
OffRL
110
9
0
03 Mar 2022
Bandit problems with fidelity rewards
Gábor Lugosi
Ciara Pike-Burke
Pierre-André Savalle
47
0
0
25 Nov 2021
A Last Switch Dependent Analysis of Satiation and Seasonality in Bandits
Pierre Laforgue
Giulia Clerici
Nicolò Cesa-Bianchi
Ran Gilad-Bachrach
71
9
0
22 Oct 2021
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
137
40
0
05 Oct 2021
Batched Bandits with Crowd Externalities
Romain Laroche
Othmane Safsafi
Raphael Feraud
N. Broutin
48
0
0
29 Sep 2021
Continuous Time Bandits With Sampling Costs
R. Vaze
M. Hanawal
49
0
0
12 Jul 2021
Offline Planning and Online Learning under Recovering Rewards
D. Simchi-Levi
Zeyu Zheng
Feng Zhu
OffRL
58
1
0
28 Jun 2021
Combinatorial Blocking Bandits with Stochastic Delays
Alexia Atsidakou
Orestis Papadigenopoulos
Soumya Basu
Constantine Caramanis
Sanjay Shakkottai
61
8
0
22 May 2021
Recurrent Submodular Welfare and Matroid Blocking Bandits
Orestis Papadigenopoulos
Constantine Caramanis
69
2
0
30 Jan 2021
Online Model Selection: a Rested Bandit Formulation
Leonardo Cella
Claudio Gentile
Massimiliano Pontil
48
0
0
07 Dec 2020
Rebounding Bandits for Modeling Satiation Effects
Liu Leqi
Fatma Kılınç Karzan
Zachary Chase Lipton
A. Montgomery
58
26
0
13 Nov 2020
Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect
Priyank Agrawal
Theja Tulabandhula
35
1
0
18 Jun 2020
Contextual Blocking Bandits
Soumya Basu
Orestis Papadigenopoulos
Constantine Caramanis
Sanjay Shakkottai
83
21
0
06 Mar 2020
Bandit Learning with Delayed Impact of Actions
Wei Tang
Chien-Ju Ho
Yang Liu
95
12
0
24 Feb 2020
Stochastic Bandits with Delay-Dependent Payoffs
Leonardo Cella
Nicolò Cesa-Bianchi
87
39
0
07 Oct 2019
1