Non-monotonic Value Function Factorization for Deep Multi-Agent
Reinforcement Learning
- OffRL
Abstract
In this paper, I propose actor-critic approaches by introducing an actor policy on QMIX ([1]), which can remove the monotonicity constraint of QMIX and implement a non-monotonic value function factorization for joint action-value.
View on arXivComments on this paper
