Title
Empirical Validation of the Independent Chip Model Juho Kim 20 0 0 30 May 2025
Generalization in Monitored Markov Decision Processes (Mon-MDPs) Montaser Mohammedalamen Michael Bowling 97 0 0 13 May 2025
Meta-Learning in Self-Play Regret Minimization David Sychrovský Martin Schmid Michal Sustr Michael Bowling 71 0 0 26 Apr 2025
Approximating Nash Equilibria in General-Sum Games via Meta-Learning David Sychrovský Christopher Solinas Revan MacQueen Kevin Wang James Wright Nathan R Sturtevant Michael Bowling 54 0 0 26 Apr 2025
Rethinking the Foundations for Continual Reinforcement Learning Michael Bowling Esraa Elelimy CLL OffRL LRM 83 4 0 10 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers Jake Grigsby Yuqi Xie Justin Sasek Steven Zheng Yuke Zhu OffRL 87 1 0 06 Apr 2025
Faster Rates for No-Regret Learning in General Games via Cautious Optimism Ashkan Soleymani Georgios Piliouras Gabriele Farina 103 1 0 31 Mar 2025
Asynchronous Predictive Counterfactual Regret Minimization $^+$ Algorithm in Solving Extensive-Form Games Linjian Meng Youzhi Zhang Zhenxing Ge Tianpei Yang Yang Gao 100 0 0 17 Mar 2025
Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity A. Hussain D. Leonte Francesco Belardinelli Raphael Huser Dario Paccagnan 78 0 0 13 Mar 2025
Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning Kha Vo Chin-Teng Lin GNN 100 0 0 10 Mar 2025
On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games Yang Cai Gabriele Farina Julien Grand-Clément Christian Kroer Chung-Wei Lee Haipeng Luo Weiqiang Zheng 77 1 0 04 Mar 2025
Two-Player Zero-Sum Differential Games with One-Sided Information Mukesh Ghimire Z. Xu Yi Ren SyDa 198 0 0 17 Feb 2025
A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios Xiachong Feng Longxu Dou Ella Li Qinghao Wang Haoran Wang Yu Guo Chang Ma Lingpeng Kong LM&Ro LM&MA ELM LLMAG AI4CE 150 7 0 05 Dec 2024
Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms Thanh Nguyen-Tang Raman Arora 113 1 0 01 Nov 2024
System 2 Reasoning via Generality and Adaptation Sejin Kim Sundong Kim LRM AI4CE 120 0 0 10 Oct 2024
Learning in Games with Progressive Hiding Benjamin Heymann Marc Lanctot 62 0 0 05 Sep 2024
GPU-Accelerated Counterfactual Regret Minimization Juho Kim 78 0 0 27 Aug 2024
In-Context Exploiter for Extensive-Form Games Shuxin Li Chang Yang Youzhi Zhang Pengdeng Li Xinrun Wang Xiao Huang Hau Chan Bo An 76 0 0 10 Aug 2024
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information Yauwai Yim Chunkit Chan Tianyu Shi Zheye Deng Wei Fan Tianshi Zheng Yangqiu Song LLMAG 98 13 0 05 Aug 2024
Perfect Information Monte Carlo with Postponing Reasoning Jérôme Arjonilla Abdallah Saffidine Tristan Cazenave 66 0 0 05 Aug 2024
A Survey on Self-play Methods in Reinforcement Learning Chao Yu Zelai Xu Chengdong Ma Chao Yu Weijuan Tu ... Deheng Ye Wenbo Ding Yaodong Yang Yu Wang Yu Wang SyDa SSL OnRL 168 9 0 02 Aug 2024
Neural Network-based Information Set Weighting for Playing Reconnaissance Blind Chess Timo Bertram Johannes Fürnkranz Martin Müller 108 1 0 08 Jul 2024
XQSV: A Structurally Variable Network to Imitate Human Play in Xiangqi Chenliang Zhou GNN 70 0 0 05 Jul 2024
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI Haruka Kita Sotetsu Koyamada Yotaro Yamaguchi Shin Ishii 73 0 0 14 Jun 2024
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning Wenzhe Li Zihan Ding Seth Karten Chi Jin 103 2 0 04 Jun 2024
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment Chen Zhang Qiang He Zhou Yuan Elvis S. Liu Hong Wang Jian Zhao Yang-Feng Wang 116 2 0 03 Jun 2024
Mixture of Public and Private Distributions in Imperfect Information Games Jérôme Arjonilla Abdallah Saffidine Tristan Cazenave 146 1 0 23 May 2024
Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents Radovan Haluška Martin Schmid LLMAG 81 0 0 25 Apr 2024
Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent Hang Xu Kai Li Bingyun Liu Haobo Fu Qiang Fu Junliang Xing Jian Cheng 70 3 0 22 Apr 2024
Transformer Based Planning in the Observation Space with Applications to Trick Taking Card Games Douglas Rebstock Christopher Solinas Nathan R Sturtevant M. Buro 49 0 0 19 Apr 2024
HSVI-based Online Minimax Strategies for Partially Observable Stochastic Games with Neural Perception Mechanisms R. Yan G. Santos G. Norman David Parker Marta Z. Kwiatkowska 69 2 0 16 Apr 2024
LookALike: Human Mimicry based collaborative decision making Rabimba Karanjai Weidong Shi 45 0 0 16 Mar 2024
Trust in AI: Progress, Challenges, and Future Directions S. Afroogh Ali Akbari Evan Malone Mohammadali Kargar Hananeh Alambeigi AI4TS 105 40 0 12 Mar 2024
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating Yifan YangGong Haojun Pan Lei Wang 72 1 0 21 Feb 2024
Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation Ayesha Siddika Nipu Siming Liu Anthony Harris 43 2 0 13 Feb 2024
A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System Jiaqi Liang Sanjay Dominik Jena Defeng Liu Andrea Lodi 103 1 0 05 Feb 2024
PokerGPT: An End-to-End Lightweight Solver for Multi-Player Texas Holdém via Large Language Model Chenghao Huang Yanbo Cao Yinlong Wen Tao Zhou Yanru Zhang OffRL LLMAG 81 7 0 04 Jan 2024
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property Ioannis Anagnostides Ioannis Panageas Gabriele Farina Tuomas Sandholm 88 3 0 19 Dec 2023
Recording and Describing Poker Hands Juho Kim LMTD 57 0 0 18 Dec 2023
An Invitation to Deep Reinforcement Learning Bernhard Jaeger Andreas Geiger OffRL OOD 154 5 0 13 Dec 2023
Computing Perfect Bayesian Equilibria in Sequential Auctions with Verification Vinzenz Thoma Vitor Bosshard Sven Seuken 155 1 0 07 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning Youpeng Zhao Yudong Lu Jian Zhao Wen-gang Zhou Houqiang Li 82 6 0 05 Dec 2023
History Filtering in Imperfect Information Games: Algorithms and Complexity Christopher Solinas Douglas Rebstock Nathan R Sturtevant M. Buro 72 0 0 24 Nov 2023
PcLast: Discovering Plannable Continuous Latent States Anurag Koul Shivakanth Sujit Shaoru Chen Ben Evans Lili Wu ... Yonathan Efroni Lekan Molu Miro Dudik John Langford Alex Lamb OffRL BDL 102 1 0 06 Nov 2023
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games Yang Cai Gabriele Farina Julien Grand-Clément Christian Kroer Chung-Wei Lee Haipeng Luo Weiqiang Zheng 80 2 0 01 Nov 2023
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game Zelai Xu Chao Yu Fei Fang Yu Wang Yi Wu LLMAG 125 95 0 29 Oct 2023
Partially Observable Stochastic Games with Neural Perception Mechanisms R. Yan G. Santos G. Norman David Parker Marta Z. Kwiatkowska 80 4 0 17 Oct 2023
Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability Revan MacQueen James R. Wright 72 2 0 17 Oct 2023
BridgeHand2Vec Bridge Hand Representation Anna Sztyber-Betley Filip Kolodziej Jan Betley Piotr Duszak GAN 38 0 0 10 Oct 2023
$$\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis$ $\mathcal{B}$ -Coder: Value-Based Deep Reinforcement Learning for Program Synthesis Zishun Yu Yunzhe Tao Liyu Chen Tao Sun Hongxia Yang 83 13 0 04 Oct 2023