491

On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation

International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Abstract

In this paper, we study the statistical efficiency of Reinforcement Learning in Mean-Field Control (MFC) and Mean-Field Game (MFG) with general function approximation. We introduce a new concept called Mean-Field Model-Based Eluder Dimension (MBED), which subsumes a rich family of Mean-Field RL problems. Additionally, we propose algorithms based on Optimistic Maximal Likelihood Estimation, which can return an ϵ\epsilon-optimal policy for MFC or an ϵ\epsilon-Nash Equilibrium policy for MFG, with sample complexity polynomial w.r.t. relevant parameters and independent of the number of states, actions and the number of agents. Notably, our results only require a mild assumption of Lipschitz continuity on transition dynamics comparing with previous works.

View on arXiv
Comments on this paper