22

Full-Span Log-Linear Model and Fast Learning Algorithm

Neural Computation (Neural Comput.), 2022
Abstract

The full-span log-linear(FSLL) model introduced in this paper is considered an nn-th order Boltzmann machine, where nn is the number of all variables in the target system. Let X=(X0,...,Xn1)X=(X_0,...,X_{n-1}) be finite discrete random variables that can take X=X0...Xn1|X|=|X_0|...|X_{n-1}| different values. The FSLL model has X1|X|-1 parameters and can represent arbitrary positive distributions of XX. The FSLL model is a "highest-order" Boltzmann machine; nevertheless, we can compute the dual parameters of the model distribution, which plays important roles in exponential families, in O(XlogX)O(|X|\log|X|) time. Furthermore, using properties of the dual parameters of the FSLL model, we can construct an efficient learning algorithm. The FSLL model is limited to small probabilistic models up to X225|X|\approx2^{25}; however, in this problem domain, the FSLL model flexibly fits various true distributions underlying the training data without any hyperparameter tuning. The experiments presented that the FSLL successfully learned six training datasets such that X=220|X|=2^{20} within one minute with a laptop PC.

View on arXiv
Comments on this paper