Individual CpG sites that are associated with age and life expectancy become hypomethylated upon aging
© The Author(s). 2017
Received: 1 December 2016
Accepted: 19 January 2017
Published: 2 February 2017
There is a growing interest in simple molecular biomarkers for biological aging. Age-associated DNA methylation (DNAm) changes at specific CG dinucleotides can be combined into epigenetic age predictors to estimate chronological age—and the deviation of chronological and predicted age (∆age) seems to be associated with all-cause mortality. In this study, we have further validated this association and analyzed whether or not individual age-associated CG-dinucleotides (CpGs) are related to life expectancy.
In the German ESTHER cohort, we used 864 DNAm profiles of blood samples as the discovery set and 1000 DNAm profiles as the validation set to predict chronological age with three previously reported age predictors—based on 99, 71, or 353 age-associated CpGs. Several of these individual CpGs were significantly associated with life expectancy, and for some of these CpGs, this was even reproducible in the independent datasets. Notably, those CpGs that revealed significant association with life expectancy were overall rather hypomethylated upon aging.
Individual age-associated CpGs may provide biomarkers for all-cause mortality—but confounding factors need to be critically taken into consideration, and alternative methods, which facilitate more quantitative measurements at individual CpGs, might be advantageous. Our data suggest that particularly specific CpGs that become hypomethylated upon aging are indicative of biological aging.
Biomarkers for aging may allow for testing of interventions to extend lifespan or to increase the odds of staying healthy. Ideally, such biomarkers should rather reflect “biological age” than “chronological age,” and they should not be skewed by predisposition to specific diseases . Advances in molecular biology, genetics, and epigenetics have fueled the hope for simple and reliable biomarkers for biological age [2, 3].
Within the last five years, a multitude of studies demonstrated that aging is associated with highly reproducible DNA methylation (DNAm) changes at specific sites in the genome [4–8]. About 60% of these age-associated CG dinucleotides—so called “CpG sites”—become hypomethylated upon aging, whereas about 40% become hypermethylated . Age-associated hypermethylation is rather enriched close to CG islands (CGIs), whereas hypomethylation rather occurs outside of CGIs [9–12]. Furthermore, particularly DNAm at CpGs with age-associated hypermethylation seem to be coherently modified in cancer , indicating that de novo DNAm and demethylation may be regulated by different mechanisms. It is yet unclear how these DNAm patterns are regulated, and if they are functionally relevant or rather reflect other means of chromatin conformation—either way, they provide powerful biomarkers.
Several age-associated DNAm changes are acquired linearly over time and hence facilitate estimation of chronological age—either based on individual CpGs  or by integration of multiple CpGs into age predictors [5, 6, 12]. Particularly, the epigenetic clock described by Horvath , consisting of 353 age-associated CpGs, has been shown to facilitate precise age estimations across multiple tissues. Other frequently used age predictors for blood samples have been introduced by Hannum and coworkers (71 CpGs)  and Weidner et al. (99 CpGs) [17, 18]. Notably, the difference between chronological age and predicted age—referred to as ∆age—seems to be related to the parameters of biological aging: Marioni and coworkers have demonstrated that ∆age (per 5 years) was associated with a 21% higher mortality risk in the “Hannum predictor” (95% CI 1.14–1.29) and with a 11% higher mortality risk with the “Horvath predictor” (95% CI 1.05–1.18), if adjusted for chronological age and gender . Similar findings were reproduced by other study groups on other datasets [18, 20, 21]. Furthermore, epigenetic age predictions are lower in women and in semi-supercentenarians , whereas accelerated epigenetic age was associated with obesity  and with lower abilities in physical and mental fitness —suggesting that age-associated DNAm patterns may be indicative of biological aging.
In this study, we aimed for a better understanding of how epigenetic age predictions are associated with life expectancy in the ESTHER study cohort, a large population-based epidemiological study conducted in the German State of Saarland. To estimate reproducibility of results, we separated the DNAm profiles (analyzed by HumanMethylation 450 BeadChips) into a discovery set of 864 samples and a validation set of 1000 samples (further information is provided in the Additional file 1). We were particularly interested whether there are individual CpGs that reveal higher association with life expectancy than others.
Comparison of different multi-CpG age predictors
Correlation of age predictions with chronological age
Weidner99 CpGs (61 hypo- and 38 hypermethylated)
Hannum71 CpGs (31 hypo- and 40 hypermethylated)
Horvath 353 CpGs (186 hypo- and 167 hypermethylated)
Discovery set (n = 864)
Correlation with age (Spearman)
Mean average deviation (years)
Validation set (n = 1000)
Correlation with age (Spearman)
Mean average deviation (years)
Overall (n = 1864)
Correlation with age (Spearman)
Mean average deviation (years)
Previous studies have demonstrated that ∆age of the Hannum and Horvath predictors are associated with life expectancy in DNAm profiles of the ESTHER study . Here, we have analyzed if ∆age of the Weidner model would also be associated with all-cause mortality. When the results were adjusted for age, sex, batch, and leucocyte distribution, there was a clear tendency in the discovery and validation sets, but the results did not reach statistical significance (P = 0.058 and P = 0.095, respectively). When we combined the discovery and validation sets to increase statistical power, the results reached the significance (P = 0.041) and the hazard ratios were slightly lower than in the other two predictors (HR = 1.087; 95% CI 1.003–1.178; Additional file 1: Table S1). In our previous work, we analyzed the data of the Lothian Birth Cohort 1921 (LBC1921), a study from the Lothian region (Edinburgh and its surrounding areas of Scotland) with participants born in 1921 and analyzed at about the age of 79 [18, 25]: in this dataset a 5-year higher age prediction by the Weidner model was associated with 11% greater mortality risk (P = 0.0003; 95% CI 1.04, 1.19; after adjustment for age and gender). These results support the notion that the association of ∆age with all-cause mortality may vary between different aging models and cohorts—but it is overall consistent if using age predictors that comprise multiple CpGs.
Individual CpGs are associated with life expectancy
We have previously analyzed if individual age-associated CpGs are associated with life expectancy in the Lothian Birth Cohorts 1921 and 1936 . The only one CpG site that reached statistical significance in both datasets after multiple correction and adjustment for age and gender was cg05228408, which is associated with the gene for the chloride transport protein 6 (CLCN6; LBC1921 [HR = 1.16; 95% CI 1.06–1.26; P = 0.00072]; LBC1936 [HR = 1.26; 95% CI 1.12–1.42; P = 0.00013]). This genomic region is of specific interest because single-nucleotide polymorphisms identified in its vicinity were found to be associated with blood pressure and hypertension [26–28]. Therefore, we have now trained a model for the ESTHER discovery group based on the beta values of cg05228408. Upon the adjustment for chronological age, gender, batch, and leucocyte distribution, this model revealed significant association with all-cause mortality in the discovery (P = 0.0011) and in the overall population (P = 0.0148; Additional file 1: Table S2).
To our surprise, almost all of the CpGs that are associated with life expectancy in either of the two datasets were hypomethylated upon aging (Fig. 2b, c). In the discovery set there was a significant enrichment of hypomethylated CpG sites (hypergeometric distribution) for the Weidner (P = 3.3 × 10−6) and the Hannum (P = 0.0007) predictor. Furthermore, all significant CpGs in the overlap of the discovery and the validation set were hypomethylated (Additional file 1: Table S6).
We revisited the previously published data on association of these CpGs in the Lothian Birth Cohort 1921 . A big advantage in this cohort is that it comprises donors of a defined age range (about 79 years)—and hence, a different slope in the comparison of predicted and chronological ages would hardly affect the association with life expectancy. Only four CpGs of the Weidner predictor reached statistical significance in LBC1921 (adjusted P value <0.05), and all of them were also significant in the ESTHER discovery set: cg05228408 (CLCN6), cg12554573 (PARP3), cg25268718 (PSME1), and cg03224418 (SAMD10)—furthermore, all of them become hypomethylated upon aging (Additional file 1: Figure S2A). However, for the CpGs of the Hannum predictor, the reproducibility between the LBC1921 and the ESTHER cohorts was low. In general, CpGs that revealed significant association with life expectancy in LBC1921 and LBC1936 were rather hypomethylated, but these results did not reach statistical significance (Additional file 1: Figure S2B, C).
Our explorative study further supports the notion that specific age-associated CpGs can be indicative of life expectancy, but the reproducibility in independent cohorts is overall not very high. Furthermore, we demonstrate that significant association with all-cause mortality is particularly observed in CpGs that become hypomethylated upon aging. It is therefore conceivable that a combination of such specific age-associated CpGs gives rise to alternative epigenetic age predictors that better reflect the association of ∆age with all-cause mortality—and may hence be a better biomarker for biological aging.
There are however limitations that need to be critically taken into consideration: (1) only blood samples have been considered for this analysis, and it remains to be demonstrated if the findings hold also true for cells from other tissues; (2) the association of life expectancy with CpGs that become hypomethylated upon aging was only addressed on elderly people, whereas biomarkers for biological aging may rather be desired for young humans who had not yet developed age-related diseases ; (3) ∆age of epigenetic age predictions may have systematic offsets, and hence, it remains a challenge to entirely rule out that the results are impacted by chronological age; (4) the beta values of Illumina BeadChip correlate with the absolute level of DNAm, but the precision is not always high . Particularly, for age predictors based on individual CpGs, it therefore appears to be advantageous to train model on data that was generated by more quantitative methods—such as pyrosequencing, MassARRAY, bisulfite deep sequencing, or digital PCR ; and (5) last but not least, the association with all-cause mortality is only one aspect of biological aging, and it will be important to better understand the association with other molecular parameters, such as telomere length, or functional measures, such as physical strength, cognitive decline, and other signs of aging .
The ESTHER study was supported by the Baden-Württemberg State Ministry of Science, Research, and Arts (Stuttgart, Germany), the Federal Ministry of Education and Research (Berlin, Germany), and the Federal Ministry of Family Affairs, Senior Citizens, Women, and Youth (Berlin, Germany). The sponsors had no role in the study design, in the collection, analysis and interpretation of data, and preparation, review, or approval of the manuscript. WW was supported by the Else Kröner-Fresenius Stiftung (2014 A193), the German Research Foundation (WA/1706/8-1), and the Interdisciplinary Center for Clinical Research (IZKF) within the Faculty of Medicine at the RWTH Aachen University (O1-1).
Availability of data and materials
Data protection standards, which were part of the informed consent procedure of the ESTHER study, preclude that data can be deposited in publically available repositories. Individual data access may be granted within a framework of scientific cooperation.
YZ, HB, and WW conceived the study. YZ performed bioinformatics analysis. JH and WW performed cross validations. WW wrote the first draft of the manuscript. All authors read and approved the final manuscript.
WW is involved in the company Cygenia GmbH that may provide service for epigenetic age predictions to other scientists (www.cygenia.com). The authors’ declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
The ESTHER study was approved by the ethics committees of the University of Heidelberg and of the state medical board of Saarland, Germany. All participants provided written informed consent.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Baker GT, Sprott RL. Biomarkers of aging. Exp Gerontol. 1988;23:223–39.View ArticlePubMedGoogle Scholar
- Burkle A, Moreno-Villanueva M, Bernhard J, Blasco M, Zondag G, et al. MARK-AGE biomarkers of ageing. Mech Ageing Dev. 2015;151:2–12.View ArticlePubMedGoogle Scholar
- Belsky DW, Moffitt TE, Cohen AA, Corcoran D, Horvath S, et al. Telomere, epigenetic clock, and biomarker-composite quantifications of biological aging: do they measure the same thing? bioRxiv 2016;doi: http://dx.doi.org/10.1101/071373.Google Scholar
- Bork S, Pfister S, Witt H, Horn P, Korn B, Ho AD, Wagner W. DNA methylation pattern changes upon long-term culture and aging of human mesenchymal stromal cells. Aging Cell. 2010;9:54–63.View ArticlePubMedPubMed CentralGoogle Scholar
- Koch CM, Wagner W. Epigenetic-aging-signature to determine age in different tissues. Aging (Albany NY). 2011;3:1018–27.View ArticleGoogle Scholar
- Bocklandt S, Lin W, Sehl ME, Sanchez FJ, Sinsheimer JS, Horvath S, Vilain E. Epigenetic predictor of age. PLoS One. 2011;6:e14821.View ArticlePubMedPubMed CentralGoogle Scholar
- Teschendorff AE, Menon U, Gentry-Maharaj A, Ramus SJ, Weisenberger DJ, et al. Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer. Genome Res. 2010;20:440–6.View ArticlePubMedPubMed CentralGoogle Scholar
- Rakyan VK, Down TA, Maslau S, Andrew T, Yang TP, et al. Human aging-associated DNA hypermethylation occurs preferentially at bivalent chromatin domains. Genome Res. 2010;20:434–9.View ArticlePubMedPubMed CentralGoogle Scholar
- Johansson A, Enroth S, Gyllensten U. Continuous aging of the human DNA methylome throughout the human lifespan. PLoS One. 2013;8:e67378.View ArticlePubMedPubMed CentralGoogle Scholar
- McClay JL, Aberg KA, Clark SL, Nerella S, Kumar G, et al. A methylome-wide study of aging using massively parallel sequencing of the methyl-CpG-enriched genomic fraction from blood in over 700 subjects. Hum Mol Genet. 2014;23:1175–85.View ArticlePubMedGoogle Scholar
- Christensen BC, Houseman EA, Marsit CJ, Zheng S, Wrensch MR, et al. Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet. 2009;5:e1000602.View ArticlePubMedPubMed CentralGoogle Scholar
- Florath I, Butterbach K, Muller H, Bewerunge-Hudler M, Brenner H. Cross-sectional and longitudinal changes in DNA methylation with age: an epigenome-wide analysis revealing over 60 novel age-associated CpG sites. Hum Mol Genet. 2014;23:1186–201.View ArticlePubMedGoogle Scholar
- Lin Q, Wagner W. Epigenetic aging signatures are coherently modified in cancer. PLoS Genet. 2015;11:e1005334.View ArticlePubMedPubMed CentralGoogle Scholar
- Garagnani P, Bacalini MG, Pirazzini C, Gori D, Giuliani C, et al. Methylation of ELOVL2 gene as a new epigenetic marker of age. Aging Cell. 2012;11:1132–4.View ArticlePubMedGoogle Scholar
- Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14:R115.View ArticlePubMedPubMed CentralGoogle Scholar
- Hannum G, Guinney J, Zhao L, Zhang L, Hughes G, et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol Cell. 2013;49:459–367.View ArticleGoogle Scholar
- Weidner CI, Lin Q, Koch CM, Eisele L, Beier F, et al. Aging of blood can be tracked by DNA methylation changes at just three CpG sites. Genome Biol. 2014;15:R24.View ArticlePubMedPubMed CentralGoogle Scholar
- Lin Q, Weidner CI, Costa IG, Marioni RE, Ferreira MR, Deary IJ, Wagner W. DNA methylation levels at individual age-associated CpG sites can be indicative for life expectancy. Aging (Albany NY). 2016;8:394–401.Google Scholar
- Marioni RE, Shah S, McRae AF, Chen BH, Colicino E, et al. DNA methylation age of blood predicts all-cause mortality in later life. Genome Biol. 2015;16:25.View ArticlePubMedPubMed CentralGoogle Scholar
- Perna L, Zhang Y, Mons U, Holleczek B, Saum KU, Brenner H. Epigenetic age acceleration predicts cancer, cardiovascular, and all-cause mortality in a German case cohort. Clin Epigenetics. 2016;8:64.View ArticlePubMedPubMed CentralGoogle Scholar
- Christiansen L, Lenart A, Tan Q, Vaupel JW, Aviv A, McGue M, Christensen K. DNA methylation age is associated with mortality in a longitudinal Danish twin study. Aging Cell. 2016;15:5.View ArticleGoogle Scholar
- Horvath S, Pirazzini C, Bacalini MG, Gentilini D, Di Blasio AM, et al. Decreased epigenetic age of PBMCs from Italian semi-supercentenarians and their offspring. Aging (Albany NY). 2015;7:1159–70.View ArticleGoogle Scholar
- Horvath S, Erhart W, Brosch M, Ammerpohl O, Von SW, et al. Obesity accelerates epigenetic aging of human liver. Proc Natl Acad Sci U S A. 2014;111:15538–43.View ArticlePubMedPubMed CentralGoogle Scholar
- Marioni RE, Shah S, McRae AF, Ritchie SJ, Muniz-Terrera G, et al. The epigenetic clock is correlated with physical and cognitive fitness in the Lothian Birth Cohort 1936. Int J Epidemiol. 2015;44:1388–96.View ArticlePubMedPubMed CentralGoogle Scholar
- Deary IJ, Gow AJ, Pattie A, Starr JM. Cohort profile: the Lothian Birth Cohorts of 1921 and 1936. Int J Epidemiol. 2012;41:1576–84.View ArticlePubMedGoogle Scholar
- Tomaszewski M, Debiec R, Braund PS, Nelson CP, Hardwick R, et al. Genetic architecture of ambulatory blood pressure in the general population: insights from cardiovascular gene-centric array. Hypertension. 2010;56:1069–76.View ArticlePubMedPubMed CentralGoogle Scholar
- Levy D, Ehret GB, Rice K, Verwoert GC, Launer LJ, et al. Genome-wide association study of blood pressure and hypertension. Nat Genet. 2009;41:677–87.View ArticlePubMedPubMed CentralGoogle Scholar
- Newton-Cheh C, Johnson T, Gateva V, Tobin MD, Bochud M, et al. Genome-wide association study identifies eight loci associated with blood pressure. Nat Genet. 2009;41:666–76.View ArticlePubMedPubMed CentralGoogle Scholar
- Belsky DW, Caspi A, Houts R, Cohen HJ, Corcoran DL, et al. Quantification of biological aging in young adults. Proc Natl Acad Sci U S A. 2015;112:E4104–4110.View ArticlePubMedPubMed CentralGoogle Scholar
- BLUEPRINT consortium. Quantitative comparison of DNA methylation assays for biomarker development and clinical applications. Nat Biotechnol. 2016;34:726–37.View ArticleGoogle Scholar