ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.05654
  4. Cited By
Thompson Sampling in Non-Episodic Restless Bandits

Thompson Sampling in Non-Episodic Restless Bandits

12 October 2019
Young Hun Jung
Marc Abeille
Ambuj Tewari
ArXiv (abs)PDFHTML

Papers citing "Thompson Sampling in Non-Episodic Restless Bandits"

13 / 13 papers shown
Title
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem
Nima Akbarzadeh
Erick Delage
Yossiri Adulyasak
180
0
0
30 Oct 2024
A resource-constrained stochastic scheduling algorithm for homeless
  street outreach and gleaning edible food
A resource-constrained stochastic scheduling algorithm for homeless street outreach and gleaning edible food
Conor M. Artman
Aditya Mate
Ezinne Nwankwo
A. Heching
Tsuyoshi Idé
...
Kush R. Varshney
Lauri Goldkind
Gidi Kroch
Jaclyn Sawyer
Ian Watson
72
0
0
15 Mar 2024
Fairness of Exposure in Online Restless Multi-armed Bandits
Fairness of Exposure in Online Restless Multi-armed Bandits
Archit Sood
Shweta Jain
Sujit Gujar
61
2
0
09 Feb 2024
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints
Shu-Fan Wang
Guojun Xiong
Jian Li
108
7
0
16 Dec 2023
Langevin Thompson Sampling with Logarithmic Communication: Bandits and
  Reinforcement Learning
Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning
Amin Karbasi
Nikki Lijing Kuang
Yi-An Ma
Siddharth Mitra
OffRL
67
5
0
15 Jun 2023
Networked Restless Bandits with Positive Externalities
Networked Restless Bandits with Positive Externalities
Christine Herlihy
John P. Dickerson
84
3
0
09 Dec 2022
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Optimistic Whittle Index Policy: Online Learning for Restless Bandits
Kai Wang
Lily Xu
Aparna Taneja
Milind Tambe
84
17
0
30 May 2022
On learning Whittle index policy for restless bandits with scalable
  regret
On learning Whittle index policy for restless bandits with scalable regret
N. Akbarzadeh
Aditya Mahajan
97
13
0
07 Feb 2022
Reinforcement Learning for Finite-Horizon Restless Multi-Armed
  Multi-Action Bandits
Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits
Guojun Xiong
Jian Li
Rahul Singh
45
4
0
20 Sep 2021
Restless and Uncertain: Robust Policies for Restless Bandits via Deep
  Multi-Agent Reinforcement Learning
Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning
J. Killian
Lily Xu
Arpita Biswas
Milind Tambe
61
6
0
04 Jul 2021
Planning to Fairly Allocate: Probabilistic Fairness in the Restless
  Bandit Setting
Planning to Fairly Allocate: Probabilistic Fairness in the Restless Bandit Setting
Christine Herlihy
Aviva Prins
A. Srinivasan
John P. Dickerson
68
15
0
14 Jun 2021
Restless-UCB, an Efficient and Low-complexity Algorithm for Online
  Restless Bandits
Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits
Siwei Wang
Longbo Huang
John C. S. Lui
OffRL
91
39
0
05 Nov 2020
Screening for an Infectious Disease as a Problem in Stochastic Control
Screening for an Infectious Disease as a Problem in Stochastic Control
Jakub Mareˇcek
31
3
0
01 Nov 2020
1