226

Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning

Abstract

In this paper, I propose actor-critic approaches by introducing an actor policy on QMIX ([1]), which can remove the monotonicity constraint of QMIX and implement a non-monotonic value function factorization for joint action-value.

View on arXiv
Comments on this paper