Disentangling in Variational Autoencoders with Natural Clustering

27 January 2019

Javier Antorán

Abstract

Learning robust representations that disentangle the underlying factors of variability in data is an intuitive way to achieve generalisation in deep models. Consequently, it has been the object of much effort from the machine learning community. This work takes a step further in this direction by addressing the scenario where generative factors present a multimodal distribution due to the existence of class distinction in the data. We formulate a lower bound on the joint log-likelihood of inputs and class labels and present N-VAE, a model which is capable of separating factors of variation which are exclusive to certain classes from factors that are shared among classes. This model implements an explicitly compositional latent variable structure by defining a class-conditioned latent space and a shared latent space. We show its usefulness for detecting and disentangling class-dependent generative factors as well as its capacity to generate artificial samples which contain characteristics unseen in the training data.

View on arXiv

Comments on this paper