Inferring Multidimensional Rates of Aging from Cross-Sectional Data

12 July 2018

Emma Pierson

Pang Wei Koh

Tatsunori Hashimoto

Abstract

Modeling how individuals evolve over time is a fundamental problem in the natural and social sciences. However, existing datasets are often cross-sectional with each individual only observed once, making it impossible to apply traditional time-series methods. Motivated by the study of human aging, we present an interpretable latent variable model that learns temporal dynamics from cross-sectional data. Our model represents each individual as a nonlinear function of a latent, linear, and low dimensional aging time-series. We prove that when this nonlinear function is constrained to be order-isomorphic, the model family is identifiable solely from cross-sectional data if the distribution of time-independent variation is known. Applied to the UK Biobank human health dataset, our model reconstructs the observed data while learning interpretable rates of aging that are associated with diseases, mortality, and aging risk factors.

View on arXiv

Comments on this paper