Dynamic social learning under graph constraints

IEEE Transactions on Control of Network Systems (TCNS), 2020

8 July 2020

Konstantin Avrachenkov

Abstract

We argue that graph-constrained dynamic choice with reinforcement can be viewed as a scaled version of a special instance of replicator dynamics. The latter also arises as the limiting differential equation for the empirical measures of a vertex reinforced random walk on a directed graph. We use this equivalence to show that for a class of positively $\alpha$ -homogeneous rewards, $\alpha > 0$ , the asymptotic outcome concentrates around the optimum in a certain limiting sense when `annealed' by letting $\alpha\uparrow\infty$ slowly. We also discuss connections with classical simulated annealing.

View on arXiv

Comments on this paper