Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions

8 November 2020

Papers citing "Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions"

14 / 14 papers shown

Counterfactual Multi-player Bandits for Explainable Recommendation Diversification

639

27 May 2025

Improved Bandits in Many-to-one Matching Markets with Incentive CompatibilityAAAI Conference on Artificial Intelligence (AAAI), 2024

Fang-yuan Kong

Shuai Li

331

03 Jan 2024

Decentralized Online Bandit Optimization on Directed Graphs with Regret Bounds

Johan Ostman

Ather Gattami

D. Gillblad

253

27 Jan 2023

A survey on multi-player banditsJournal of machine learning research (JMLR), 2022

Etienne Boursier

Vianney Perchet

339

29 Nov 2022

Decentralized, Communication- and Coordination-free Learning in Structured Matching MarketsNeural Information Processing Systems (NeurIPS), 2022

C. Maheshwari

Eric Mazumdar

S. Shankar Sastry

173

06 Jun 2022

The Pareto Frontier of Instance-Dependent Guarantees in Multi-Player Multi-Armed Bandits with no CommunicationAnnual Conference Computational Learning Theory (COLT), 2022

Allen Liu

Mark Sellke

269

19 Feb 2022

Cooperative Online Learning in Stochastic and Adversarial MDPsInternational Conference on Machine Learning (ICML), 2022

Tal Lancewicki

Aviv A. Rosenberg

Yishay Mansour

382

31 Jan 2022

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

Aldo Pacchiano

Peter L. Bartlett

Sai Li

326

08 Nov 2021

Decentralized Cooperative Reinforcement Learning with Hierarchical Information StructureInternational Conference on Algorithmic Learning Theory (ALT), 2021

Hsu Kao

Chen-Yu Wei

V. Subramanian

399

01 Nov 2021

Collaborative Pure Exploration in Kernel BanditInternational Conference on Learning Representations (ICLR), 2021

514

29 Oct 2021

Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization

Chengshuai Shi

Wei Xiong

Cong Shen

Jing Yang

346

27 Oct 2021

Multi-player Multi-armed Bandits with Collision-Dependent Reward DistributionsIEEE Transactions on Signal Processing (IEEE TSP), 2021

Chengshuai Shi

Cong Shen

141

25 Jun 2021

Decentralized Learning in Online Queuing SystemsNeural Information Processing Systems (NeurIPS), 2021

Flore Sentenac

Etienne Boursier

Vianney Perchet

251

08 Jun 2021

Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing InformationAnnual Conference Computational Learning Theory (COLT), 2021

Wei Huang

Richard Combes

Cindy Trinh

223

24 Mar 2021