FEMA-Long: Modeling unstructured covariances for discovery of time-dependent effects in large-scale longitudinal datasets.

Journal: BioRxiv : The Preprint Server For Biology
Published:
Abstract

Linear mixed-effects (LME) models are commonly used for analyzing longitudinal data. However, most applications of LME models rely on random intercepts or simple, e.g., stationary, covariance. Here, we extend the Fast and Efficient Mixed-Effects Algorithm (FEMA) and present FEMA-Long, a computationally tractable approach to flexibly modeling longitudinal covariance suitable for high-dimensional data. FEMA-Long can i) model unstructured covariance, ii) model non-linear fixed effects using splines, iii) discover time-dependent fixed effects with spline interactions, and iv) perform genome-wide association studies (GWAS) supporting discovery of time-dependent genetic effects. We applied FEMA-Long to perform a longitudinal GWAS with non-linear SNP-by-time interaction on length, weight, and body mass index of 68,273 infants with up to six measurements in the first year of life. We found dynamic patterns of random effects including time-varying heritability and correlations, as well as several genetic variants showing time-dependent effects, highlighting the applicability of FEMA-Long to enable novel discoveries.

Authors
Pravesh Parekh, Nadine Parker, Diliana Pecheva, Evgeniia Frei, Marc Vaudel, Diana Smith, Alison Rigby, Piotr Jahołkowski, Ida Sønderby, Viktoria Birkenæs, Nora Bakken, Chun Fan, Carolina Makowski, Jakub Kopal, Robert Loughnan, Donald Hagler, Dennis Van Der Meer, Stefan Johansson, Pål Njølstad, Terry Jernigan, Wesley Thompson, Oleksandr Frei, Alexey Shadrin, Thomas Nichols, Ole Andreassen, Anders Dale