159

Convergence of Contrastive Divergence Algorithm in Exponential Family

Abstract

This paper studies the convergence properties of contrastive divergence algorithm for parameter inference in exponential family, by relating it to Markov chain theory and stochastic stability literature. We prove that, under mild conditions and given a finite data sample X1,,XnpθX_1,\dots,X_n \sim p_{\theta^*} i.i.d. in an event with probability approaching to 1, the sequence {θt}t0\{\theta_t\}_{t \ge 0} generated by CD algorithm is a positive Harris recurrent chain, and thus processes an unique invariant distribution πn\pi_n. The invariant distribution concentrates around the Maximum Likelihood Estimate at a speed arbitrarily slower than n\sqrt{n}, and the number of steps in Markov Chain Monte Carlo only affects the coefficient factor of the concentration rate. Finally we conclude that as nn \to \infty, lim supt1ts=1tθsθp0.\limsup_{t \to \infty} \left\Vert \frac{1}{t} \sum_{s=1}^t \theta_s - \theta^*\right\Vert \overset{p}{\to} 0.

View on arXiv
Comments on this paper