Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games

AAAI Conference on Artificial Intelligence (AAAI), 2021

8 March 2021

Papers citing "Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games"

12 / 12 papers shown

Efficient Near-Optimal Algorithm for Online Shortest Paths in Directed Acyclic Graphs with Bandit Feedback Against Adaptive AdversariesAnnual Conference Computational Learning Theory (COLT), 2025

770

01 Apr 2025

Best of Both Worlds: Regret Minimization versus Minimax Play

260

17 Feb 2025

A Policy-Gradient Approach to Solving Imperfect-Information Games with Best-Iterate ConvergenceInternational Conference on Learning Representations (ICLR), 2024

Mingyang Liu

Gabriele Farina

Asuman Ozdaglar

439

01 Aug 2024

Local and adaptive mirror descents in extensive-form gamesNeural Information Processing Systems (NeurIPS), 2023

Pierre Ménard

273

01 Sep 2023

Adapting to game trees in zero-sum imperfect information gamesInternational Conference on Machine Learning (ICML), 2022

Pierre Ménard

513

23 Dec 2022

Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient AlgorithmsInternational Conference on Learning Representations (ICLR), 2022

Fan Chen

Yu Bai

Song Mei

342

29 Sep 2022

Sequential Information Design: Learning to Persuade in the DarkNeural Information Processing Systems (NeurIPS), 2022

222

08 Sep 2022

Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror DescentNeural Information Processing Systems (NeurIPS), 2022

273

30 May 2022

Generalized Bandit Regret Minimizer Framework in Imperfect Information Extensive-Form Game

Lin Meng

Yang Gao

395

11 Mar 2022

Near-Optimal Learning of Extensive-Form Games with Imperfect InformationInternational Conference on Machine Learning (ICML), 2022

388

03 Feb 2022

Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall

Tadashi Kozuno

Pierre Ménard

Rémi Munos

Michal Valko

285

11 Jun 2021

Model-Free Online Learning in Unknown Sequential Decision Making Problems and GamesAAAI Conference on Artificial Intelligence (AAAI), 2021

Gabriele Farina

Tuomas Sandholm

OffRL

235

08 Mar 2021