Author:
Higgins-Chen Albert T.,Thrush Kyra L.,Wang Yunzhang,Kuo Pei-Lun,Wang Meng,Minteer Christopher J.,Moore Ann Zenobia,Bandinelli Stefania,Vinkers Christiaan H.,Vermetten Eric,Rutten Bart P.F.,Geuze Elbert,Okhuijsen-Pfeifer Cynthia,van der Horst Marte Z.,Schreiter Stefanie,Gutwinski Stefan,Luykx Jurjen J.,Ferrucci Luigi,Crimmins Eileen M.,Boks Marco P.,Hägg Sara,Hu-Seliger Tina T.,Levine Morgan E.
Abstract
AbstractEpigenetic clocks are widely used aging biomarkers calculated from DNA methylation data. Unfortunately, measurements for individual CpGs can be surprisingly unreliable due to technical noise, and this may limit the utility of epigenetic clocks. We report that noise produces deviations up to 3 to 9 years between technical replicates for six major epigenetic clocks. The elimination of low-reliability CpGs does not ameliorate this issue. Here, we present a novel computational multi-step solution to address this noise, involving performing principal component analysis on the CpG-level data followed by biological age prediction using principal components as input. This method extracts shared systematic variation in DNAm while minimizing random noise from individual CpGs. Our novel principal-component versions of six clocks show agreement between most technical replicates within 0 to 1.5 years, equivalent or improved prediction of outcomes, and more stable trajectories in longitudinal studies and cell culture. This method entails only one additional step compared to traditional clocks, does not require prior knowledge of CpG reliabilities, and can improve the reliability of any existing or future epigenetic biomarker. The high reliability of principal component-based epigenetic clocks will make them particularly useful for applications in personalized medicine and clinical trials evaluating novel aging interventions.
Publisher
Cold Spring Harbor Laboratory