Skip to main content

Whole blood DNA methylation aging markers predict colorectal cancer survival: a prospective cohort study



Blood DNA methylation-based aging algorithms predict mortality in the general population. We investigated the prognostic value of five established DNA methylation aging algorithms for patients with colorectal cancer (CRC).


AgeAccelHorvath, AgeAccelHannum, DNAmMRscore, AgeAccelPheno and AgeAccelGrim were constructed using whole blood epi-genomic data from 2206 CRC patients. After a median follow-up of 6.2 years, 1079 deaths were documented, including 596 from CRC. Associations of the aging algorithms with survival outcomes were evaluated using the Cox regression and survival curves. Harrell’s C-statistics were computed to investigate predictive performance.


Adjusted hazard ratios (95% confidence intervals) of all-cause mortality for patients in the third compared to the first tertile were 1.66 (1.32, 2.09) for the DNAmMRscore, 1.35 (1.14, 1.59) for AgeAccelPheno and 1.65 (1.37, 2.00) for AgeAccelGrim, even after adjustment for age, sex and stage. AgeAccelHorvath and AgeAccelHannum were not associated with all-cause or CRC-specific mortality. In stage-specific analyses, associations were much stronger for patients with early or intermediate stage cancers (stages I, II and III) than for patients with metastatic (stage IV) cancers. Associations were weaker and less often statistically significant for CRC-specific mortality. Adding DNAmMRscore, AgeAccelPheno or AgeAccelGrim to models including age, sex and tumor stage improved predictive performance moderately.


DNAmMRscore, AgeAccelPheno and AgeAccelGrim could serve as non-invasive CRC prognostic biomarkers independent of other commonly used markers. Further research should aim for tailoring and refining such algorithms for CRC patients and to explore their value for enhanced prediction of treatment success and treatment decisions.


Colorectal cancer (CRC) is one of the leading causes of cancer death, accounting for approximately 9% of the total cancer deaths globally [1]. While declines in CRC mortality rates occurred in Western countries in recent years, CRC mortality rates continue to increase in many middle- and low-income countries [2]. Besides enhanced early detection, enhanced prediction of patients’ prognosis might open new avenues of more effective, personalized treatment strategies to further reduce mortality rates [3]. The tumor-node-metastasis (TNM) staging system is widely utilized to predict CRC prognosis and to guide adjuvant therapy after potential curative surgery. However, the TNM system is not satisfactory in predicting clinical outcomes for patients with intermediate stages [4], and markers that have prognostic value beyond the TNM system are highly desirable.

Research on prognostic markers for CRC patients has largely focused on characteristics of the tumor tissue, whereas less research has been devoted to other indicators of CRC patient prognosis. Recently, a number of studies have disclosed major prognostic value of aging-related changes in methylation of whole blood DNA with respect to mortality in general population cohorts [5,6,7,8,9]. If and to what extent they may also be useful for predicting chances of survival of CRC patients has, to our knowledge, not previously been addressed in large-scale studies. We aimed to evaluate the prognostic value of five recently proposed aging-related algorithms of DNA methylation (DNAm) derived from whole blood DNA with respect to total and CRC-specific mortality in a large cohort of CRC patients from Germany.


Study design and population

Our analysis is based on prospective follow-up of CRC patients recruited in the context of the German DACHS (Darmkrebs: Chancen der Verhütung durch Screening) Study, an ongoing population-based case–control study on CRC. Details of the DACHS study design have been described elsewhere [10,11,12,13]. In brief, patients with a first diagnosis of CRC (ICD 10 codes C18-C20) aged at least 30 years (without an upper age limit) are recruited in all of the 22 clinics providing first-line treatment for CRC in the Rhine-Neckar region in Southern Germany. The current analysis includes patients diagnosed in 2003–2010 for whom comprehensive follow-up with respect to survival outcomes was completed and for whom DNA methylation microarray data from blood samples taken at baseline were available.

Data collection

The patients were recruited by their treating physician during first hospital stay due to CRC and notified to the study center at the German Cancer Research Center after receipt of informed consent. Personal interviews by trained interviewers were scheduled at the earliest possible convenience, either during hospital stay or shortly thereafter at patients’ homes, in which sociodemographic information, medical and lifestyle information was collected using a standardized questionnaire. Comprehensive medical data, including data on patient and tumor characteristics and treatment, were extracted from medical records. Peripheral blood samples were collected after the interview and stored at − 80 °C. The time of blood drawing could be prior (within 2 weeks) to surgery and after surgery including before, during and after adjuvant therapy. Standardized follow-up information on newly diagnosed diseases and recurrences was provided by patients’ physicians 3 and 5 years after diagnosis. Data on vital status, date and cause of death were obtained from local population registers and public health authorities. All patients provided written informed consent. The study was approved by the ethical committees of the Medical Faculty of the University of Heidelberg and the Medical Chambers of Baden-Württemberg and Rhineland-Palatinate.

DNAm assessment

DNA was extracted from whole blood samples using standard procedures. Whole blood DNA methylation profiles were obtained using the Infinium MethylationEPIC BeadChip Kit that covers over 850,000 CpG sites (Illumina, Inc, San Diego, CA, USA) according to the manufacturer's protocol. We excluded probes with detection P value > 0.01 or missing value > 10% from the analysis. Pre-processing and normalization of DNA methylation data were conducted following the pipeline of Lehne et al. [14]. The methylation proportions at each CpG site (beta values) were calculated using normalized intensity values. Leukocyte composition was estimated using Houseman’s algorithms [15].

DNAm aging algorithms calculation

Table 1 shows basic information on five DNAm aging algorithms, including Horvath’s algorithm [5], Hannum’s algorithm [6], DNAm mortality risk score (DNAmMRscore) [7], DNAmPhenoAge [8] and DNAmGrimAge [9]. Typically, DNAm aging algorithms are constructed by regressing the chronological age or a surrogate measure of biological age on a set of CpG sites from specific tissues using penalized regression analyses, such as LASSO or elastic net regression [16]. Horvath’s algorithm was developed based on 353 CpGs that were related to a transformed version of chronological age [5]. Hannum’s algorithm was built based on 71 age-related CpGs [6]. Unlike Horvath’s algorithm and Hannum’s algorithm, DNAmMRscore, DNAmPhenoAge and DNAmGrimAge were developed by replacing prediction of chronological age with prediction of lifespan and/or surrogate of health span [7,8,9]. To develop DNAmMRscore, 10 of 58 mortality-related CpG sites were selected by LASSO Cox regression model [7]. DNAmPhenoAge is based on 513 CpGs, which were associated with a phenotypic age, a combination of chronological age and nine biomarkers that reflect the function of liver, kidney, metabolism and immune system [8]. Similarly, AgeAccelGrim was constructed with age, sex as well as 1030 CpGs, which were related to smoking pack-year and seven mortality-related plasma proteins [9].

Table 1 Overview of DNA methylation aging algorithms

Age acceleration (AgeAccel) is defined as the residual resulting from regressing DNAm algorithms on chronological age [5]. Thus, a positive value of AgeAccel indicates accelerated aging and premature mortality. In this analysis, the age acceleration of Horvath’s algorithm, Hannum’s algorithm, DNAm PhenoAge and DNAm GrimAge were used and denoted by AgeAccelHorvath, AgeAccelHannum, AgeAccelPheno and AgeAccelGrim, respectively. They were computed using an online DNAm aging algorithm calculator ( [5]. DNAmMRscore was not transformed to the AgeAccel version since it originally was designed as a predictor of mortality [7]. In addition, two CpGs of the original DNAmMRscore, which had been derived using a 450 K CpG DNA methylation microarray, were not included in the EPIC microarray data. We thus developed an equation (as follows) of constructing DNAmMRscore based on eight CpGs by regressing the original DNAmMRscore of ten CpGs on the remaining eight CpGs in the 450 K microarray data of the German ESTHER Study [17], which had been used to develop the DNAmMRscore [7].

$$\begin{aligned} {\text{DNAmMRscore}}&= - 0.36909 - 1.09957 \times cg01612140\\ & \quad - 1.65446 \times cg05575921 + 3.12883 \times cg08362785\\ & \quad - 0.22268 \times cg10321156 - 0.30369 \times cg14975410 \\ & \quad - 0.31940 \times cg19572487 - 3.39726 \times cg24704287\\ & \quad - 1.93238 \times cg25983901 \\ \end{aligned}$$

Statistical methods

The correlations among AgeAccelHorvath, AgeAccelHannum, DNAmMRscore, AgeAccelPheno and AgeAccelGrim were assessed with Pearson correlation coefficients and scatter plots. The distribution of the DNAm aging algorithms was described by median and interquartile range (IQR) and compared across categorical baseline characteristics of the study population by Kruskal–Wallis test.

Cox proportional hazards regression accounting for delayed entry was used to assess the associations of DNAm aging algorithms [per standard deviation (SD) increase and classified in tertiles] with all-cause mortality (or overall survival) and CRC-specific mortality (or CRC-specific survival). In addition, competing risk was considered in the analysis for CRC-specific mortality. The Schoenfeld Residuals method was applied to test if the algorithms violate the assumption of Cox regression. A “clinical model” was performed as the main model adjusting for the factors that are easily obtained in clinical settings, including chronological age, sex, stage, measurement batch and leukocyte composition (Houseman’s algorithms). Furthermore, stage-specific HRs and survival curves with adjustment for age, sex, batch and leukocyte composition were used to assess whether the association between DNAm markers and CRC prognosis differs depending on tumor stages. Tests for interaction were carried out by setting variable cross-product terms of DNAm aging algorithm with stage in the model. The difference between survival curves was evaluated using the G-rho family of tests.

Sensitivity analyses were performed to investigate the association between DNAm aging algorithms and CRC prognosis with a more comprehensive adjustment for the variables that are shown in Table 2, including age, sex, batch, leukocyte composition, tumor stage, body mass index (BMI, kg/m2), smoking status (never, former and current smokers), alcohol consumption (gram of ethanol per day), tumor subsite and Charlson comorbidity index (CCI) score that was calculated from comorbidities at the time of CRC diagnosis [18]. Additionally, to exclude the influence of chemotherapy and/or radiotherapy on DNAm markers, we assessed the association between the DNAm and CRC prognosis among patients who had not received any chemotherapy or radiotherapy during the follow-up.

Table 2 Clinical characteristics at baseline in the DACHS study

Predictive accuracy and discriminating ability of DNAm aging algorithms were evaluated using Harrell's concordance statistics (C-statistics) and were compared with age, sex and stage. A C-statistic value of 0.5 suggests no discrimination, and 1.0 indicates perfect discrimination.

Hazard ratios and Harrell’s C-statistics were derived using the PROC PHREG in SAS version 9.4 (SAS Institute, Cary, NC). Correlation matrix and adjusted survival curves were produced using the R 3.6.0 with the packages corrplot and survminer, respectively [19]. Statistical significance was defined by P < 0.05 in two-sided testing.


Clinical characteristics of study population

We included 2206 eligible patients diagnosed with CRC, of whom 18.1%, 34.4%, 32.9% and 14.0% were diagnosed in stage I, II, III and IV, respectively. Over a median of 6.2 years (IQR 3.7–10.1) of follow-up, a total of 1079 deaths occurred, including 596 deaths due to CRC.

Table 2 describes baseline characteristics of the study population, which included more men (58.8%) than women and had a median age of 69 years. Most patients were diagnosed in either stage II (34.6%) or stage III (33.1%), and more than 40% had relevant comorbidity (CCI > 0). The distribution of AgeAccelHorvath, AgeAccelHannum, DNAmMRscore, AgeAccelPheno and AgeAccelGrim according to categorical baseline characteristics is presented in Additional file 1: Table S1. The levels of all DNAm aging algorithms were higher in females, smokers, those consuming more alcohol and patients with higher CCI and advanced stage CRC. Additional file 1: Fig. S1 presents correlations of AgeAccelHannum, DNAmMRscore, AgeAccelPheno and AgeAccelGrim with leukocyte composition. Levels of DNAm aging algorithms did not vary by the year of blood sampling (Additional file 1: Fig. S2).

Correlation among DNAm aging algorithms

All DNAm aging algorithms were statistically significantly correlated with each other, as shown in Additional file 1: Fig. S3. DNAmMRscore showed a moderate positive correlation with AgeAccelHannum (ρ = 0.46), AgeAccelPheno (ρ = 0.46) and AgeAccelGrim (ρ = 0.63), but a weaker correlation with AgeAccelHorvath (ρ = 0.14).

Association of DNAm aging algorithms with CRC prognosis

Table 3 shows the association between DNAm aging algorithms and all-cause mortality of CRC patients. In the analyses including patients with any stage, we observed marginal non-significant associations for AgeAccelHorvath and AgeAccelHannum and statistically significant associations for DNAmMRscore, AgeAccelPheno and AgeAccelGrim. HRs (95%CIs) were 1.17 (0.99, 1.38), 1.12 (0.94, 1.33), 1.66 (1.32, 2.09), 1.35 (1.14, 1.59) and 1.65 (1.37, 2.00) for the association of all-cause mortality with upper (vs. lower) tertiles of AgeAccelHorvath, AgeAccelHannum, DNAmMRscore, AgeAccelPheno and AgeAccelGrim, respectively. In stage-specific analysis, as shown in Table 3 and Figs. 1, 2 and 3, associations of DNAm aging algorithms and overall survival attenuated with increased severity of CRC. Survival was worst among the patients with highest levels of DNAm aging algorithms for early and intermediate stage. Among stage IV patients, medium levels of AgeAccelPheno were associated with highest risk of mortality (Fig. 2). However, the interaction between the algorithms and stages was not statistically significant.

Table 3 Associations of DNA methylation aging markers with all-cause mortality
Fig. 1

Stage-specific survival curves for overall and cancer-specific survival of CRC patients by tertiles of DNAmMRscore. a Overall and b CRC-specific survival curve among stage I and II patients; c overall and d CRC-specific survival curve among stage III; e overall and f CRC-specific survival curve among stage IV. Stage-specific survival curves were adjusted for age, sex, batch and leukocyte composition (Houseman’s algorithm)

Fig. 2

Stage-specific survival curves for overall and cancer-specific survival of CRC patients by tertiles of AgeAccelPheno. a Overall and b CRC-specific survival curve among stage I and II patients; c overall and d CRC-specific survival curve among stage III; e overall and f CRC-specific survival curve among stage IV. Stage-specific survival curves were adjusted for age, sex, batch and leukocyte composition (Houseman’s algorithm)

Fig. 3

Stage-specific survival curves for overall and cancer-specific survival of CRC patients by tertiles of AgeAccelGrim. a Overall and b CRC-specific survival curve among stage I and II patients; c overall and d CRC-specific survival curve among stage III; e overall and f CRC-specific survival curve among stage IV. Stage-specific survival curves were adjusted for age, sex, batch and leukocyte composition (Houseman’s algorithm)

As shown in Table 4, associations of higher DNAm aging algorithms with poorer survival were weaker for CRC-specific survival than for overall survival. HRs (95% CIs) for the comparison of the upper tertile with the lower tertile of AgeAccelHorvath, AgeAccelHannum, DNAmMRscore, AgeAccelPheno and AgeAccelGrim were 1.17 (0.94, 1.46), 1.06 (0.83, 1.36), 1.54 (1.11, 2.15), 1.25 (0.98, 1.59) and 1.28 (0.84, 1.93), respectively. Table 4 and Figs. 1, 2 and 3 show that only AgeAccelHorvath was statistically significantly associated with CRC-specific mortality among stage I and II patients. Among stage III patients, the associations were statistically significant for DNAmMRscore and AgeAccelPheno. Among stage IV patients, AgeAccelGrim showed a marginally significant association with CRC-specific mortality.

Table 4 Associations of DNA methylation aging markers with CRC-specific mortality

God Additional file 1: Tables S2 and S3 show that additional adjustments for BMI, smoking status, alcohol consumption, tumor subsite and CCI changed the association of DNAm aging algorithms with all-cause mortality and CRC-specific mortality only slightly. Additional file 1: Table S4 shows that the associations of DNAmMRscore, AgeAccelPheno and AgeAccelGrim with both outcomes were stronger among patients who received surgery only. AgeAccelHorvath was statistically significantly associated with all-cause mortality, but not with CRC-specific mortality.

Predictive utility of DNAmMRscore, AgeAccelPheno and AgeAccelGrim

Table 5 presents the discrimination ability of various combinations of CRC prognostic factors, including age, sex, stage, DNAmMRscore, AgeAccelPheno and AgeAccelGrim. The performance of prediction was moderately improved after adding DNAm aging algorithms in models including age, sex and stage. For all-cause mortality, models including AgeAccelGrim showed tentatively stronger predictive ability than the others among patients of all stages and in patients with stages I and II or III. For CRC-specific mortality, similar improvements in predictive ability were achieved by adding either one of the three algorithms to the models. Moreover, a model of combining DNAmMRscore, AgeAccelPheno and AgeAccelGrim did not significantly improve the predictive performance compared with the single algorithm model (data not shown).

Table 5 Harrell’s C-statistics (95% CI) for all-cause mortality and CRC-specific mortality prediction


To our knowledge, this study is the first to investigate longitudinal association of five frequently used DNAm aging algorithms with CRC prognosis. Of the five algorithms, DNAmMRscore, AgeAccelPheno and AgeAccelGrim were positively associated with all-cause and CRC-specific mortality. Associations were strongest for DNAmMRscore and were generally stronger for all-cause mortality than for CRC-specific mortality. Adding either of DNAmMRscore, AgeAccelPheno or AgeAccelGrim to models including age, sex and stage moderately increased prognostic performance with respect to either all-cause mortality or CRC-specific mortality within all stages, including stage IV.

Previous studies have shown that Horvath’s and Hannum’s algorithms are statistically significantly associated with all-cause mortality in older general populations [20,21,22,23]. Consistent with our findings, DNAmMRsocre, PhenoAge and GrimAge outperformed the first generation of DNAm aging algorithms regarding mortality prediction [8, 9, 24,25,26]. Few studies have focused on the prognostic values of DNAm aging algorithms among cancer patients. Dugué and colleagues compared different variations of Horvath’s and Hannum’s algorithms and concluded that the increased age acceleration was associated with higher cancer mortality [27]. Moreover, Zheng et al. observed a significantly positive association of Horvath’s algorithm with overall survival of CRC for the comparison of age acceleration group and age deceleration group, which is not supported by our study [28]. In Zheng’s analysis, the Cox model was adjusted for only tumor stage and molecular subtype, which may not be sufficient to exclude confounding due to age, sex and leukocyte composition alteration.

DNAmMRscore, AgeAccelPheno and AgeAccelGrim were modestly correlated with each other in our study. Unlike other algorithms, DNAmMRscore is explicitly trained to predict Mortality. It is developed based on much fewer CpG sites that were related to all-mortality, severe conditions and smoking [7]. More clinical and/or lifestyle characteristics were considered in the development of AgeAccelPheno and AgeAccelGrim. As for AgeAccelPheno, chronological age and nine mortality-related clinical markers such as C-reactive protein were integrated and were regressed on DNAm data [8]. Finally, AgeAccelGrim was computed using the methylation pattern of CpG sites, which were associated with seven plasma proteins, smoking pack-year and all-cause mortality [9]. The significant correlation of DNAmMRscore, AgeAccelPheno and AgeAccelGrim with comorbidity suggests that the predictive power for CRC prognosis can be improved by regressing clinical outcomes and biomarkers on DNAm data in the process of CpG sites selection. In other words, DNAmMRscore, AgeAccelPheno and AgeAccelGrim were likely to capture pathophysiological information in the prediction of mortality risk among CRC patients. Although DNAmMRscore, AgeAccelPheno and AgeAccelGrim showed similar predictive performance regarding CRC prognosis, DNAmMRscore achieved such prognostic performance with much fewer CpG sites.

Even though there has been substantial improvement in the prognosis of patients with CRC over the last decades, it remains challenging to stratify patients with specific CRC stages and to make decisions on treatments from which they can benefit most [29]. Although there has been intensive search for blood-based biomarkers with prognostic or predictive ability, few of them retained their prognostic relevance after adjustment for or stratification by CRC stage. Our study showed that DNAm aging algorithms, especially DNAmMRscore, AgeAccelPheno and AgeAccelGrim, were associated with overall survival and disease-specific survival among patients with CRC, independent of age, sex and stage. Therefore, a combination of those DNAm aging algorithms with other clinical factors, such as age, sex and stage, may have the potential to enhance judgment of patients’ prognosis and to improve patient management in clinical practice. However, the associations between DNAm markers and CRC prognosis were weak and mostly statistically non-significant among patients with advanced (stage IV) CRC, among whom prognosis is generally extremely poor. Also, case numbers were smallest in this group which limited statistical power to detect possible associations. Sample size limitations also prohibited in-depth analyses on potential use of the DNAm aging algorithms for predicting success of specific therapies within CRC stages which should be addressed in further, even much larger studies.

Besides potential use for prognostic classification or prediction of treatment success, DNAm aging algorithms can be utilized to explore potential mechanisms and/or synergies underlying the relationship between aging and tumor progression in CRC patients. While our study was the first to demonstrate associations of composite DNAm aging algorithms with CRC prognosis further work should address in more detail which components of the algorithms or which other DNAm markers might be most predictive for CRC prognosis and treatment success, and elucidate in more detail the underlying biological mechanisms. Further studies are also needed to develop novel prognostic DNAm markers and algorithms that are more specific to CRC.

The strengths of this study include the prospective design, large case numbers, long-term follow-up, the well-recorded causes of death, detailed information on pathological data and treatment data. The large sample size allowed detecting moderate size associations which might not be observed in smaller studies. There are also potential limitations that are worth noting. First, surgery, chemo- and radiotherapy administration could affect leukocyte distribution and subsequently have an impact on DNAm levels. Therefore, the leukocyte composition was adjusted for in all Cox regression models to minimize the bias. Sensitivity analyses were performed to investigate the potential bias caused by the timing of blood sampling relative to treatment. Similar results were observed among the patients who received surgery only (Additional file 1: Table S4). Moreover, results barely changed after additionally adjusting the Cox regression model for the timing of blood sampling relative to treatment (data not shown). Second, even though we thoroughly adjusted for potential confounders, residual confounding cannot be completely excluded because of the observational nature of our study. Third, the relatively smaller number of CRC-specific deaths limited the statistical power; therefore, further studies with larger sample size are needed. Last, we investigated a Caucasian population. Caution is therefore required when generalizing the findings to non-Caucasian populations.

In conclusion, DNAmMRscore, AgeAccelPheno and AgeAccelGrim, which incorporate clinical biomarkers and/or features, showed a strong positive association with all-cause mortality among patients with CRC, even within specific CRC stages. They have slight prognostic value beyond age, sex and stage. Further research should address the potential of refinement of DNAm algorithms for predicting prognosis and explore the value of such refined algorithms for predicting success of specific treatments, which may contribute to paving the way for guiding therapeutic decision as well as drug development.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not made publicly available due to ethical and data security requirements but can be made available for researchers on the basis of a research proposal (to be submitted to the corresponding author).


  1. 1.

    Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424.

    Google Scholar 

  2. 2.

    Arnold M, Sierra MS, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global patterns and trends in colorectal cancer incidence and mortality. Gut. 2017;66:683–91.

    Article  Google Scholar 

  3. 3.

    Brenner H, Chen C. The colorectal cancer epidemic: challenges and opportunities for primary, secondary and tertiary prevention. Br J Cancer. 2018;119:785–92.

    Article  Google Scholar 

  4. 4.

    Schneider NI, Langner C. Prognostic stratification of colorectal cancer patients: current perspectives. Cancer Manag Res. 2014;6:291–300.

    PubMed  PubMed Central  Google Scholar 

  5. 5.

    Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14:R115.

    Article  Google Scholar 

  6. 6.

    Hannum G, Guinney J, Zhao L, Zhang L, Hughes G, Sadda S, et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol Cell. 2013;49:359–67.

    CAS  Article  Google Scholar 

  7. 7.

    Zhang Y, Wilson R, Heiss J, Breitling LP, Saum KU, Schottker B, et al. DNA methylation signatures in peripheral blood strongly predict all-cause mortality. Nat Commun. 2017;8:14617.

    CAS  Article  Google Scholar 

  8. 8.

    Levine ME, Lu AT, Quach A, Chen BH, Assimes TL, Bandinelli S, et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging-US. 2018;10:573–91.

    Article  Google Scholar 

  9. 9.

    Lu AT, Quach A, Wilson JG, Reiner AP, Aviv A, Raj K, et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging-US. 2019;11:303–27.

    CAS  Article  Google Scholar 

  10. 10.

    Brenner H, Chang-Claude J, Seiler CM, Rickert A, Hoffmeister M. Protection from colorectal cancer after colonoscopy: a population-based, case-control study. Ann Intern Med. 2011;154:22–30.

    Article  Google Scholar 

  11. 11.

    Brenner H, Chang-Claude J, Jansen L, Knebel P, Stock C, Hoffmeister M. Reduced risk of colorectal cancer up to 10 years after screening, surveillance, or diagnostic colonoscopy. Gastroenterology. 2014;146:709–17.

    Article  Google Scholar 

  12. 12.

    Hoffmeister M, Jansen L, Rudolph A, Toth C, Kloor M, Roth W, et al. Statin use and survival after colorectal cancer: the importance of comprehensive confounder adjustment. J Natl Cancer Inst. 2015;107:djv045.

    Article  Google Scholar 

  13. 13.

    Carr PR, Weigl K, Edelmann D, Jansen L, Chang-Claude J, Brenner H, Hoffmeister M. Estimation of absolute risk of colorectal cancer based on healthy lifestyle, genetic risk, and colonoscopy status in a population-based study. Gastroenterology. 2020;159(1):129-138.e9.

    CAS  Article  Google Scholar 

  14. 14.

    Lehne B, Drong AW, Loh M, Zhang W, Scott WR, Tan ST, et al. A coherent approach for analysis of the Illumina HumanMethylation450 BeadChip improves data quality and performance in epigenome-wide association studies. Genome Biol. 2015;16:37.

    Article  Google Scholar 

  15. 15.

    Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinform. 2012;13:86.

    Article  Google Scholar 

  16. 16.

    Horvath S, Raj K. DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat Rev Genet. 2018;19:371–84.

    CAS  Article  Google Scholar 

  17. 17.

    Gào X, Wilsgaard T, Jansen EH, Holleczek B, Zhang Y, Xuan Y, et al. Pre-diagnostic derivatives of reactive oxygen metabolites and the occurrence of lung, colorectal, breast and prostate cancer: An individual participant data meta-analysis of two large population-based studies. Int J Cancer. 2019;145:49–57.

    Article  Google Scholar 

  18. 18.

    Boakye D, Jansen L, Schneider M, Chang-Claude J, Hoffmeister M, Brenner H. Personalizing the prediction of colorectal cancer prognosis by incorporating comorbidities and functional status into prognostic nomograms. Cancers. 2019;11:1435.

    CAS  Article  Google Scholar 

  19. 19.

    R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna; 2019.

  20. 20.

    Marioni RE, Shah S, McRae AF, Chen BH, Colicino E, Harris SE, et al. DNA methylation age of blood predicts all-cause mortality in later life. Genome Biol. 2015;16:25.

    Article  Google Scholar 

  21. 21.

    Chen BH, Marioni RE, Colicino E, Peters MJ, Ward-Caviness CK, Tsai PC, et al. DNA methylation-based measures of biological age: meta-analysis predicting time to death. Aging-US. 2016;8:1844–65.

    CAS  Article  Google Scholar 

  22. 22.

    Christiansen L, Lenart A, Tan Q, Vaupel JW, Aviv A, McGue M, et al. DNA methylation age is associated with mortality in a longitudinal Danish twin study. Aging Cell. 2016;15:149–54.

    CAS  Article  Google Scholar 

  23. 23.

    Perna L, Zhang Y, Mons U, Holleczek B, Saum KU, Brenner H. Epigenetic age acceleration predicts cancer, cardiovascular, and all-cause mortality in a German case cohort. Clin Epigenet. 2016;8:64.

    Article  Google Scholar 

  24. 24.

    Zhang Y, Saum KU, Schottker B, Holleczek B, Brenner H. Methylomic survival predictors, frailty, and mortality. Aging-US. 2018;10:339–57.

    CAS  Article  Google Scholar 

  25. 25.

    Li X, Ploner A, Wang Y, Magnusson PK, Reynolds C, Finkel D, et al. Longitudinal trajectories, correlations and mortality associations of nine biological ages across 20-years follow-up. eLife. 2020;9:e54870.

    Article  Google Scholar 

  26. 26.

    Hillary RF, Stevenson AJ, McCartney DL, Campbell A, Walker RM, Howard DM, et al. Epigenetic measures of ageing predict the prevalence and incidence of leading causes of death and disease burden. Clin Epigenet. 2020;12:115.

    Article  Google Scholar 

  27. 27.

    Dugué PA, Bassett JK, Joo JE, Baglietto L, Jung CH, Wong EM, et al. Association of DNA methylation-based biological age with health risk factors and overall and cause-specific mortality. Am J Epidemiol. 2018;187:529–38.

    Article  Google Scholar 

  28. 28.

    Zheng C, Li L, Xu R. Association of epigenetic clock with consensus molecular subtypes and overall survival of colorectal cancer. Cancer Epidemiol Biomarkers Prev. 2019;28:1720–4.

    CAS  Article  Google Scholar 

  29. 29.

    Brenner H, Kloor M, Pox CP. Colorectal cancer. Lancet. 2014;383:1490–502.

    Article  Google Scholar 

Download references


We thank the study participants and the interviewers who collected the data. We also thank the following hospitals and cooperating institutions that recruited patients for this study: Chirurgische Universita¨tsklinik Heidelberg, Klinik am Gesundbrunnen Heilbronn, St Vincentiuskrankenhaus Speyer, St Josefskrankenhaus Heidelberg, Chirurgische Universita¨tsklinik Mannheim, Diakonissenkrankenhaus Speyer, Krankenhaus Salem Heidelberg, Kreiskrankenhaus Schwetzingen, St Marienkrankenhaus Ludwigshafen, Klinikum Ludwigshafen, Stadtklinik Frankenthal, Diakoniekrankenhaus Mannheim, Kreiskrankenhaus Sinsheim, Klinikum am Plattenwald Bad Friedrichshall, Kreiskrankenhaus Weinheim, Kreiskrankenhaus Eberbach, Kreiskrankenhaus Buchen, Kreiskrankenhaus Mosbach, Enddarmzentrum Mannheim, Kreiskrankenhaus Brackenheim and Cancer Registry of Rhineland-Palatinate, Mainz.


Open Access funding enabled and organized by Projekt DEAL. This work was funded by the German Research Council (BR 1704/6-1, BR 1704/6-3, BR 1704/6-4, CH 117/1-1, HO 5117/ 2-1, HE 5998/2-1, KL 2354/3-1, RO 2270/8-1 and BR 1704/17-1), the Interdisciplinary Research Program of the National Center for Tumor Diseases (NCT), Germany, and the German Federal Ministry of Education and Research (01KH0404, 01ER0814, 01ER0815, 01ER1505A and 01ER1505B). The sponsors had no role in the study design, in the collection, analysis and interpretation of data and preparation, review or approval of the manuscript.

Author information




Conception and design: XG, HB. Development of methodology: XG, YZ, HB. Acquisition of data: JC-C, MH, HB. Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): XG, YZ, DB, XL. Writing, review and/or revision of the manuscript: XG, YZ, DB, XL, JC-C, MH, HB. Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): XG, YZ. Study supervision: JC-C, MH, HB. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Xīn Gào.

Ethics declarations

Ethics approval and consent to participate

The DACHS Study has been approved by the ethics committees of the Medical Faculty of the University of Heidelberg and the Medical Chambers of Baden-Württemberg and Rhineland-Palatinate. The study is ongoing and is conducted in accordance with the declaration of Helsinki. Written informed consent was obtained from all study participants.

Consent for publication

Not applicable.

Competing interests

No potential conflicts of interest were disclosed.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1:

Supplementary tables and figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Gào, X., Zhang, Y., Boakye, D. et al. Whole blood DNA methylation aging markers predict colorectal cancer survival: a prospective cohort study. Clin Epigenet 12, 184 (2020).

Download citation


  • DNA methylation
  • Aging
  • Whole blood
  • Colorectal cancer
  • Prognosis
  • Mortality