Bayesian Hypernetworks
- UQCVBDL
We propose Bayesian hypernetworks: a framework for approximate Bayesian inference in neural networks. A Bayesian hypernetwork, , is a neural network which learns to transform a simple noise distribution, , to a distribution over the parameters of another neural network (the "primary network"). We train with variational inference, using an invertible to enable efficient estimation of the variational lower bound on the posterior via sampling. In contrast to most methods for Bayesian deep learning, Bayesian hypernets can represent a complex multimodal approximate posterior with correlations between parameters, while enabling cheap i.i.d. sampling of . We demonstrate these qualitative advantages of Bayesian hypernets, which also achieve competitive performance on a suite of tasks that demonstrate the advantage of estimating model uncertainty, including active learning and anomaly detection.
View on arXiv