Skip to main content

DNA methylation biomarkers of myocardial infarction and cardiovascular disease



The epigenetic landscape underlying cardiovascular disease (CVD) is not completely understood and the clinical value of the identified biomarkers is still limited. We aimed to identify differentially methylated loci associated with acute myocardial infarction (AMI) and assess their validity as predictive and causal biomarkers.


We designed a case–control, two-stage, epigenome-wide association study on AMI (ndiscovery = 391, nvalidation = 204). DNA methylation was assessed using the Infinium MethylationEPIC BeadChip. We performed a fixed-effects meta-analysis of the two samples. 34 CpGs were associated with AMI. Only 12 of them were available in two independent cohort studies (n ~ 1800 and n ~ 2500) with incident coronary and cardiovascular disease (CHD and CVD, respectively). The Infinium HumanMethylation450 BeadChip was used in those two studies. Four of the 12 CpGs were validated in association with incident CHD: AHRR-mapping cg05575921, PTCD2-mapping cg25769469, intergenic cg21566642 and MPO-mapping cg04988978. We then assessed whether methylation risk scores based on those CpGs improved the predictive capacity of the Framingham risk function, but they did not. Finally, we aimed to study the causality of those associations using a Mendelian randomization approach but only one of the CpGs had a genetic influence and therefore the results were not conclusive.


We have identified 34 CpGs related to AMI. These loci highlight the relevance of smoking, lipid metabolism, and inflammation in the biological mechanisms related to AMI. Four were additionally associated with incident CHD and CVD but did not provide additional predictive information.


Cardiovascular disease (CVD) and more specifically coronary heart disease (CHD) remains the number one cause of death and disease burden worldwide [1, 2]. At the individual level, prevention is based on the estimation of cardiovascular risk [3]. However, the sensitivity of cardiovascular risk estimation is low and a significant proportion of CHD events occurs in individuals classified as having moderate or low risk [4]. Additionally, the use of currently available drugs to control classical cardiovascular risk factors (CVRFs) does not prevent all CHD events, underlining the need to identify new strategies for reducing this residual cardiovascular risk [5]. Thus, information encoded in biological mechanisms should be unravelled to find new predictive biomarkers and potential therapeutic targets. Among these biomarkers, DNA methylation marks arise as emerging candidates.

DNA methylation is an epigenetic mechanism consisting on chemical modifications of cytosines, mostly followed by guanines (CpGs) [6]. Epigenome-wide association studies (EWASs) make it possible to find DNA methylation biomarkers of different traits and outcomes. In fact, DNA methylation pattern is associated with multiple chronic diseases [7], including CVD and CHD [8,9,10,11,12,13]. However, the clinical value of the identified biomarkers is still limited, and the epigenetic landscape underlying CVD is not completely understood.

The most common technology to assess DNA methylation is based on commercial arrays, which do not cover the whole methylome. Moreover, most current knowledge on the relation between DNA methylation and cardiovascular risk comes from studies based on the Infinium HumanMethylation450 BeadChip (Illumina, CA, USA; from now on, 450 k) [14] – which has been replaced by the Infinium MethylationEPIC BeadChip (Illumina, CA, USA; from now on, EPIC). Compared to the 450 k, EPIC interrogates 413,745 more methylation sites (but excludes 42,859) increasing the genomic coverage. Moreover, EPIC is enriched with functional sites analyses such as enhancers, DNase hypersensitive sites, and miRNA promoter regions [15]. Thus, the new chip has the potential to identify novel DNA methylation-based biomarkers of cardiovascular events.

We hypothesized that DNA methylation is associated with MI risk, and that some of these epigenetic marks could be predictive of future risk, and have causal effects on cardiovascular outcomes. Thus, this study had three aims: 1) to unravel genomic methylation loci associated with myocardial infarction (MI), 2) to assess their predictive capacity of cardiovascular risk, and 3) to decipher the causality of those associations.


Quality control of DNA methylation data, cardiovascular outcomes and covariates

We finally included 391 individuals (196 cases and 195 controls) in the REGICOR-1 sample (Girona Heart Registry; REgistre GIroní del COR), 204 individuals (101 cases and 103 controls) in the REGICOR-2 sample, 1,863 women in the WHI Women’s Health Initiative) sample, and 2,540 participants in the FOS (Framingham Offspring Study) sample. The main sociodemographic and clinical characteristics of the three populations are shown in Tables 1 and 2. Regarding the number of CpGs, we analysed 811,610 CpGs in the REGICOR-1 sample, 820,183 CpGs in the REGICOR-2 sample, 478,369 CpGs in the WHI sample, and 483,656 CpGs in the FOS sample. Figure 1 illustrates the steps included in this study.

Table 1 Descriptive characteristics of the populations used in the two-stage EWAS on acute myocardial infarction (AMI): REGICOR-1 and REGICOR-2
Table 2 Descriptive characteristics of the populations used in the follow-up association studies on incident cardiovascular (CVD) and coronary heart disease (CHD) events: Women’s Health Initiative (WHI) and Framingham Offspring Study (FOS)
Fig. 1

Flow chart of the steps included in this study

Association between DNA methylation and cardiovascular outcomes

Two-stage EWAS on acute myocardial infarction

Discovery stage

The associations from the discovery stage (REGICOR-1) that were taken to the subsequent validation (p-value < 10–5), and their Manhattan and Q-Q plots are shown in the Additional file 2: Table S1, and Additional file 1: Figs. S1 and S2]. In total, we identified 68 CpGs suggestively related to MI (Additional file 1: Fig. S3). Model 1 provided 56 CpGs, of which three were also found in both model 2 and 3, and 13 in model 2. One additional CpG was found in both model 2 and 3, two in model 2 and nine in model 3.

Validation and meta-analysis

The association studies performed in the validation stage included the 68 CpGs suggestively related to MI. We meta-analysed the results of those 68 associations from both stages. We identified 34 differentially methylated CpGs related to MI, with similar effect sizes in all three models for most of the CpGs (except cg21566642, cg05575921, cg03636183). The 34 CpGs were located in 25 different loci (26 genes, with one CpG mapping to two genes) and nine intergenic regions (Table 3, and Additional file 2: Table S2).

Table 3 CpGs differentially methylated in association with prevalent myocardial infarction in the fixed-effects meta-analyses of the REGICOR case–control samples

Follow-up association studies on incident CHD and CVD events

Out of the 34 identified CpGs associated with MI, only 12 were available in the samples with incident cases (whose DNA methylation was profiled with the array 450 k). In total, we validated four CpGs after the meta-analysis of the separate association studies in the WHI and the FOS samples (p-value < 0.05/12 = 4.17 × 10–3): AHRR-mapping cg05575921, PTCD2-mapping cg25769469, intergenic cg21566642 and MPO-mapping cg04988978. The four CpGs were associated with CHD but cg25769469 was not related to CVD (Table 4, Additional file 2: Table S3).

Table 4 CpGs differentially methylated in association with incident coronary/cardiovascular disease in the fixed-effects meta-analyses of the samples from the Women’s Health Initiative (WHI) and the Framingham Offspring Study (FOS)

Association between the identified CpGs and CVRFs

Table 5 shows the associations observed between the identified CpGs and classical CVRFs. The four validated CpGs were related to some CVRF [p-value < 0.05/(4 CpGs*8 CVRF) = 1.56 × 10–3].

Table 5 Associations between the identified CpGs and classical cardiovascular risk factors (CVRFs) in the fixed-effects meta-analyses of the four samples

Association between MRSs and incidence of CHD and CVD

The associations between the methylation risk scores (MRSs) and the incidence of coronary (n = 94) and cardiovascular (n = 222) events in the FOS population are shown in Additional file 2: Table S4. The median of the follow-up periods for CVD and CHD incidence were 7.67 and 7.87 years, respectively. The MRSs were not associated with higher cardiovascular risk independently of the classical CVRFs. Consistently, the addition of any of the MRSs to the Framingham risk function did not improve its predictive capacity in the FOS cohort (Additional file 2: Table S4).

Causality of the associations between DNA methylation and cardiovascular outcomes

Of the four identified CpGs, only cg21566642 showed a genetic influence; its methylation levels in adolescence were associated with rs72617176 and in childhood with rs139595493. We did not have individual data to test the first and second Mendelian randomization assumptions, but the meQTLs were associated with the CpGs methylation levels at genome-wide significance independently of age, sex or ancestry principal components [16]. Only the Wald ratio method could be conducted, since it uses a single instrumental variable. The results did not support a causal effect of methylation at cg21566642 on either MI or CHD (Additional file 2: Table S5). We could not perform sensitivity tests for pleiotropic effects or its strength. The other three CpGs could not be instrumented.


We have identified 34 methylation loci associated with acute MI in a two-stage EWAS, analysing ~ 850,000 CpGs. All but two of these MI-associated sites (cg05575921 located in AHRR and the intergenic cg21566642) are newly reported. Of those, 12 CpGs could be studied in association with incident cases of CHD and CVD, and we identified four of them associated with incident CHD (three of them also with incident CVD). All four were also related to traditional CVRFs, supporting their role in the development of these diseases. However, their clinical utility as predictive biomarkers or drug targets was not proven.

Recently, two EWASs on incident CHD were published providing different findings from ours. Ward-Caviness et al.found nine CpGs associated with incident acute MI [9]. Agha et al. reported 52 CpGs related to incident CHD [8]. None of them was replicated in our study. This lack of concordance could be related to methodological differences (incident vs prevalent cases; myocardial infarction vs CHD; considered confounder variables; characteristics of the populations), and highlights the complexity of the study of these diseases.

CpG sites associated with acute MI events

The 34 identified CpGs showed similar effect sizes in the two REGICOR samples and we considered them potentially relevant. Similarly, all but three CpGs (AHRR-mapping cg05575921, F2RL3-mapping cg03636183, and the intergenic cg21566642) showed consistent effect sizes in the three models. The effect size of those three was reduced by half when adjusted for smoking, which highlights the important role of this risk factor in the MI context. In fact, all three sites are widely described to be related to smoking [17,18,19].

Differentially methylated genes were enriched in diverse molecular and physiological pathways, including lipid metabolism and metabolic and inflammatory diseases, underlining their relevance on the pathogenesis of CHD. Interestingly, the SERPINA1 locus also anchors genetic variants related to CHD [20], and other identified loci present with genetic variants associated with body mass index (DNMT3A, ABTB2, ZBTB16, NISCH, AHRR, DLEU1), inflammatory biomarkers or blood cell counts (AIM2, ITPKB, DNMT3A, LZTFL1, PSMB7, ZBTB16, ACTN1, SERPINA1, MPO, DNAJC5B, CPM, DLEU1, ZFPM1), blood pressure (PTCD2, PSMB7, SERPINA1, AHRR) and lipids (SERPINA1, NISCH, DLEU1, ZFPM1) [21].

Nonetheless, the case–control design of our initial discovery sample limits the inference of the biological sequence of the epigenetic marks, the related biological mechanisms, and the clinical event. One possible scenario could be that the identified DNA methylation marks occurred before the acute event, as potential biological mechanisms involved in MI pathogenesis. This may be the case of the three CpGs that were related to smoking. Conversely, as blood samples of MI cases were collected within the initial 24 h after hospitalization, the other possibility could be that methylation at the identified CpGs had changed as a consequence of the acute event or the therapeutic procedures. If the first scenario can be proven in further studies, these DNA methylation marks could be potential predictive biomarkers of MI or new therapeutic targets. If they are found to be post-MI marks, further studies could evaluate their potential as biomarkers of prognosis.

CpG sites consistently related to prevalent and incident CVD events

Twelve of the 34 identified CpGs could be evaluated in prospective samples and four of them were also related to incident cases of CHD. cg21566642 maps to an intergenic region, and cg05575921, cg04988978 and cg25769469 annotate to AHRR, MPO and PTCD2, respectively. To our knowledge, these CpGs were not associated with cardiovascular events in previous EWAS reports.

cg21566642 and cg05575921 were highly and inversely associated with smoking, which is supported by previous EWAS [18, 19]. We have also previously reported both CpGs as related to age-independent cardiovascular risk [13], and they have been related to all-cause mortality in an EWAS [22]. cg05575921 was further associated directly with cholesterol in high-density lipoproteins (HDL-C) and inversely with cholesterol in low-density lipoproteins (LDL-C) and triglyceride levels in our study. This CpG has been related to both CHD prevalence and incidence in a candidate gene study [23].

cg04988978 and cg25769469 annotate to MPO and PTCD2, respectively. Both CpGs were associated directly with HDL-C and inversely with triglyceride and glucose levels. MPO encodes the myeloperoxidase, which promotes atherosclerotic lesions by enhancing APOB oxidation within low-density lipoproteins [24] and was causally associated with incident cardiovascular outcomes [25]. One CpG located within PTCD2 was previously identified to be associated with hypertension in obstructive sleep apnea patients [26], and genetic variants in this gene have been related with blood pressure [21].

MRSs as predictive CVD biomarkers

To assess the value of the four identified CpGs as predictive biomarkers, we followed the AHA recommendations [27]. However, neither we observed an independent association between the MRSs and the incidence of CVD events in the FOS, nor we observed an improvement in the predictive capacity of the Framingham risk function when including this score. This highlights the challenge of novel biomarkers to improve cardiovascular risk prediction.

Causality of the associations between methylation loci and cardiovascular outcomes

The four CpGs associated not only with acute MI, but also incident CHD, may suggest that DNA methylation changes at those loci occur prior to the event. However, this association does not guarantee whether differential DNA methylation at those loci has a causal effect on CHD. Mendelian randomization can be used to ascertain this causal relationship. However, this approach could only be undertaken for cg21566642. Although a non-causal relationship was suggested, this must be interpreted with caution as there was a single genetic instrumental variable, and we cannot discard that the meQTL is in linkage disequilibrium with the causal variant for CHD, reverse causation or horizontal pleiotropy using this framework [28, 29]. Moreover, cg21566642 showed a genetic influence in childhood and adolescence, while CHD events typically occur during adulthood.

Strengths and limitations

The main strength of our study is that it is the first two-stage EWAS on MI to be based on more than 800,000 CpGs across the genome. Moreover, we aimed to validate our findings in prospective samples of CHD and CVD as a proxy of MI. Also, we aimed to prove the clinical relevance of our findings. However, some limitations should be acknowledged. First, two thirds of the CpGs identified in the initial case–control study could not be assessed in the incident studies as the methylation arrays differed in the number of CpGs (EPIC VS 450 k, respectively). Second, we used self-reported information about cardiovascular risk factors in the case–control study, as an event such as MI modifies risk factor levels during the acute phase. Third, we cannot infer causality since changes in methylation could have occurred as a consequence of the acute phase and disease management of the MI event. We aimed to perform MR studies of the association between the identified CpGs and cardiovascular events, but available methylation Quantitative Trait Loci (meQTL) datasets are still limited. Last, our study is based on populations of European origin and the results cannot be extrapolated to other populations.


Our study provides 34 novel DNA methylation loci related to MI. The results shed some light on the molecular landscape of MI, highlighting the importance of traditional CVRFs and inflammation in the development of CHD. Our results question the relevance of DNA methylation as a predictive biomarker.


Study design and populations

We designed an EWAS using three populations: the Girona Heart Registry (REGICOR, REgistre GIroní del COR), the Women’s Health Initiative (WHI), and the Framingham Offspring Study (FOS). We first performed a two-stage EWAS on acute MI using two independent age- and sex-matched case–control studies designed in REGICOR. Then, we validated the results in the other two populations with incident cases of CHD and CVD.

Case–control studies of acute MI in REGICOR

The sample used in the discovery stage (REGICOR-1) involved 416 individuals (208 MI cases and 208 controls). The sample in the validation stage (REGICOR-2) comprised 208 individuals (104 cases and 104 controls). Cases were selected from patients who were consecutively attended for a first acute MI in the reference hospital of the monitored area, in the province of Girona, in the northeast of Spain. Women were overrepresented to achieve their inclusion as 50% of our sample. Controls were participants in a population-based survey performed in the same monitored area. They were randomly selected from those attending the 2009–2013 follow-up visit (n = 4980), and matched by age and sex with the MI cases. All participants were of European descent and provided informed written consent. The study was approved by the local ethics committee (2015/6199/I; 2018/7855/I) and meets the principles expressed in the Declaration of Helsinki and the relevant Spanish legislation.

Samples with incident cases of CHD and CVD

The WHI sample is a case–control study nested in a cohort. The FOS sample is a prospective cohort study. Both samples were available in the database of Genotypes and Phenotypes (; Project Number #9047). The graphical abstract shows the design and flow-chart of this study.

Assessment of cardiovascular outcomes

The outcomes assessed were acute MI in REGICOR, and incident CHD and CVD in the WHI and FOS samples. Additional details are provided in the Additional file 1: Methods.

Assessment of DNA methylation

DNA methylation was assessed genome-wide from peripheral blood with commercial arrays from Illumina (CA, USA). The Infinium MethylationEPIC BeadChip, covering over 850,000 CpGs, was used in the REGICOR samples. The Infinium HumanMethylation450 BeadChip, covering over 480,000 CpGs, was used in the WHI and FOS samples. A detailed quality control pipeline for the methylation data is available in the Additional file 1: Methods. Methylation status at each CpG was reported by β-values [30].


In the REGICOR case–control studies the following covariates were considered: self-reported smoking, diabetes, hypercholesterolemia and hypertension (Additional file 1: Methods). In the WHI and FOS studies self-reported smoking and glycaemia, total and HDL cholesterol, and blood pressure measurements were considered. Moreover, we inferred the peripheral blood cell counts with the FlowSorted.Blood.450 k R package [31]. We also estimated two surrogate variables for unknown sources of potential technical or biological confounding using the sva R package [32].

Statistical analysis

All statistical analyses were performed using R version 3.4.0. The codes of the Singularity images used to run the EWASs in the high performance computing system of the Hospital del Mar Medical Research Institute are available in the repositories at A detailed description of the statistical methods is provided in the Additional file 1: Methods.

Association between DNA methylation and cardiovascular outcomes

Logistic regression was used in the analyses in the REGICOR and WHI samples, while Cox regression was used in the FOS sample. We considered the cardiovascular event (acute MI, CHD or CVD) as the outcome and DNA methylation as the exposure.

We defined three models. Model 1 was adjusted for estimated cell counts and two surrogate variables (plus age and ethnicity in the WHI sample, plus age and sex in the FOS samples). Model 2 was additionally adjusted for smoking. Model 3 was further adjusted for diabetes, hypercholesterolemia and hypertension.

In order to reduce epigenomic inflation, we corrected the coefficients, the standard errors and the p values using the bacon R package if necessary [33]. The bacon R package controls for bias and inflation using a Bayesian method based on the estimation of the empirical null distribution and was used in previous EWAS [33,34,35]. We used coefficients and standard errors from the regression models as the input data and we set a random seed at 123.

We selected those associations from the discovery stage (REGICOR-1) with a corrected p-value < 10–5 for assessment in the validation stage (REGICOR-2). Moreover, we performed a fixed-effect meta-analysis of the corrected effect sizes observed in both stages, weighted by the inverse of the variance. Thereafter, we studied the association of the identified CpGs with incident CHD and with CVD events in the WHI and the FOS samples, separately. The results from both samples were meta-analysed (for CHD and CVD, separately). We used the Bonferroni criteria to correct for multiple comparisons (0.05 divided by the number of probes analysed in each specific analysis).

Association between the identified CpGs and CVRFs

We analysed whether the methylation levels of the identified CpGs were associated with individual CVRFs in the four samples using multiple linear regression, and then meta-analysed the results. We defined DNA methylation as the outcome and adjusted for age and sex in the case of the REGICOR and the Framingham populations, and for age and ethnicity in the WHI sample. In the case of the REGICOR samples, the continuous variables were only available for the control individuals. We meta-analysed the results from the four populations using a fixed-effects meta-analysis weighted by the inverse of the variance. The p value threshold was estimated as 0.05 divided by the multiplication of the number of CVRFs and the number of CpGs assessed.

Methylation risk scores (MRSs) and predictive capacity

We developed two weighted MRSs based on the CpGs identified, each of them using the results from the meta-analyses of incident CHD or CVD, respectively. We evaluated the association between these scores and CHD and CVD incidence, respectively, in the FOS sample, using Cox regression. All analyses were adjusted for age, sex, diabetes, smoking, systolic blood pressure, hypertensive treatment, and levels of total cholesterol and HDL-C [36]. We also assessed the potential added predictive value of including the MRSs in the Framingham risk function. We evaluated the increase in the discrimination and the reclassification.

Causality of associations between DNA methylation and cardiovascular outcomes

We took a two-sample Mendelian Randomization studies using the MR-Base platform [37]. We used the MRInstruments R package to select the instrumental variables, and then, the TwoSampleMR R package. First, we considered those methylation-level quantitative trait loci (meQTL) from the Accesible Resource for Integrated Epigenomic Studies (ARIES) project [16] included in the MR-Base database [37]. Then, we interrogated their association with MI and with CHD using summary statistic data from a meta-analysis of GWAS on CHD [38]. A more detailed description of the analysis is included in the Additional file 1: Methods.

Availability of data and materials

All data generated during this study are included in this published article and its Additional files. The REGICOR datasets analysed during the current study are available from the corresponding author on reasonable request.


450 k:

Infinium HumanMethylation450 BeadChip


Myocardial infarction


Coronary heart disease


Cardiovascular disease


Cardiovascular risk


Cardiovascular risk factor


Infinium MethylationEPIC BeadChip


Epigenome-wide association study


Framingham offspring study


Methylation risk score


REgistre GIroní del COR


Women’s Health Iniciative


  1. 1.

    Roth GA, Abate D, Abate KH, et al. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392(10159):1736–88.

    Article  Google Scholar 

  2. 2.

    James SL, Abate D, Abate KH, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. The Lancet. 2018;392(10159):1789–858.

    Article  Google Scholar 

  3. 3.

    Piepoli MF, Hoes AW, Agewall S, et al. 2016 European Guidelines on cardiovascular disease prevention in clinical practice. Eur Heart J. 2016;37(29):2315–81.

    Article  Google Scholar 

  4. 4.

    Marrugat J, Vila J, Baena-Díez JM, Grau M, Sala J, Ramos R, Subirana I, Fitó M, Elosua R. Relative Validity of the 10-Year Cardiovascular Risk Estimate in a Population Cohort of the REGICOR Study. Revista Española de Cardiología (English Edition). 2011;64(5):385–94.

    Article  Google Scholar 

  5. 5.

    Patrono C. Fighting residual cardiovascular risk in stable patients with atherosclerotic vascular disease: COMPASS in context. Cardiovasc Res. 2017;113(14):e61–3.

    CAS  Article  Google Scholar 

  6. 6.

    Petronis A. Epigenetics as a unifying principle in the aetiology of complex traits and diseases. Nature. 2010;465(7299):721–7.

    CAS  Article  Google Scholar 

  7. 7.

    Jin Z, Liu Y. DNA methylation in human diseases. Genes Diseases. 2018;5(1):1–8.

    CAS  Article  Google Scholar 

  8. 8.

    Agha G, Mendelson MM, Ward-Caviness CK, et al. Blood leukocyte DNA methylation predicts risk of future myocardial infarction and coronary heart disease. Circulation. 2019;140(8):645–57.

    CAS  Article  Google Scholar 

  9. 9.

    Ward-Caviness CK, Agha G, Chen BH, et al. Analysis of repeated leukocyte DNA methylation assessments reveals persistent epigenetic alterations after an incident myocardial infarction. Clin Epigenetics. 2018;10(1):161.

    CAS  Article  Google Scholar 

  10. 10.

    Westerman K, Sebastiani P, Jacques P, Liu S, Demeo D, Ordovás JM. DNA methylation modules associate with incident cardiovascular disease and cumulative risk factor exposure. Clin Epigenet. 2019;11(1):142.

    Article  Google Scholar 

  11. 11.

    Westerman K, Fernández-Sanlés A, Patil P, Sebastiani P, Jacques P, Starr JM, Deary I, Liu Q, Liu S, Elosua R, DeMeo DL, Ordovás JM. Epigenomic assessment of cardiovascular disease risk and interactions with traditional risk metrics. J Am Heart Assoc. 2020;9(8):e015299.

    Article  Google Scholar 

  12. 12.

    Fernández-Sanlés A, Sayols-Baixeras S, Subirana I, Degano IRIR, Elosua R. Association between DNA methylation and coronary heart disease or other atherosclerotic events: a systematic review. Atherosclerosis. 2017;263:325–33.

    Article  Google Scholar 

  13. 13.

    Fernández-Sanlés A, Sayols-Baixeras S, Curcio S, Subirana I, Marrugat J, Elosua R. DNA methylation and age-Independent cardiovascular risk, an epigenome-Wide approach the REGICOR study (REgistre GIroní del COR). Arterioscler Thromb Vasc Biol. 2018;38(3):645–52.

    Article  Google Scholar 

  14. 14.

    Sandoval J, Heyn H, Moran S, Serra-Musach J, Pujana MA, Bibikova M, Esteller M. Validation of a DNA methylation microarray for 450,000 CpG sites in the human genome. Epigenetics. 2011;6(6):692–702.

    CAS  Article  Google Scholar 

  15. 15.

    Moran S, Arribas C, Esteller M. Validation of a DNA methylation microarray for 850,000 CpG sites of the human genome enriched in enhancer sequences. Epigenomics. 2016;8(3):389–99.

    CAS  Article  Google Scholar 

  16. 16.

    Gaunt TR, Shihab HA, Hemani G, Min JL, Woodward G, Lyttleton O, Zheng J, Duggirala A, McArdle WL, Ho K, Ring SM, Evans DM, Davey Smith G, Relton CL. Systematic identification of genetic influences on methylation across the human life course. Genome Biol. 2016;17(1):61.

    Article  Google Scholar 

  17. 17.

    Sayols-Baixeras S, Lluís-Ganella C, Subirana I, et al. Identification of a new locus and validation of previously reported loci showing differential methylation associated with smoking The REGICOR study. Epigenetics. 2015;10(12):1156–65.

    Article  Google Scholar 

  18. 18.

    Shenker NS, Polidoro S, van Veldhoven K, Sacerdote C, Ricceri F, Birrell MA, Belvisi MG, Brown R, Vineis P, Flanagan JM. Epigenome-wide association study in the European Prospective Investigation into Cancer and Nutrition (EPIC-Turin) identifies novel genetic loci associated with smoking. Hum Mol Genet. 2013;22(5):843–51.

    CAS  Article  Google Scholar 

  19. 19.

    Gao X, Jia M, Zhang Y, Breitling LP, Brenner H. DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies. Clin Epigenet. 2015;7(1):113.

    Article  Google Scholar 

  20. 20.

    Van Der Harst P, Verweij N. Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease. Circ Res. 2018;122(3):433–43.

    Article  Google Scholar 

  21. 21.

    Buniello A, MacArthur JAL, Cerezo M, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47(D1):D1005–12.

    CAS  Article  Google Scholar 

  22. 22.

    Svane A, Soerensen M, Lund J, Tan Q, Jylhävä J, Wang Y, Pedersen N, Hägg S, Debrabant B, Deary I, Christensen K, Christiansen L, Hjelmborg J. DNA methylation and all-cause mortality in middle-aged and elderly Danish Twins. Genes. 2018;9(2):78.

    Article  Google Scholar 

  23. 23.

    Ligthart S, Marzi C, Aslibekyan S, et al. DNA methylation signatures of chronic low-grade inflammation are associated with complex diseases. Genome Biol. 2016;17(1):255.

    Article  Google Scholar 

  24. 24.

    Teng N, Maghzal GJ, Talib J, Rashid I, Lau AK, Stocker R. The roles of myeloperoxidase in coronary artery disease and its potential implication in plaque rupture. Redox Rep. 2017;22(2):51–73.

    CAS  Article  Google Scholar 

  25. 25.

    Yao C, Chen G, Song C, et al. Genome-wide mapping of plasma protein QTLs identifies putatively causal genes and pathways for cardiovascular disease. Nat Commun. 2018;9(1):3268.

    Article  Google Scholar 

  26. 26.

    Chen Y-C, Chen T-W, Su M-C, et al. Whole genome DNA methylation analysis of obstructive sleep apnea: IL1R2, NPR2, AR, SP140 methylation and clinical phenotype. Sleep. 2016;39(4):743–55.

    Article  Google Scholar 

  27. 27.

    Hlatky MA, Greenland P, Arnett DK, et al. Criteria for evaluation of novel markers of cardiovascular risk. Circulation. 2009;119(17):2408–16.

    Article  Google Scholar 

  28. 28.

    Hemani G, Bowden J, Davey SG. Evaluating the potential role of pleiotropy in Mendelian randomization studies. Hum Mol Genet. 2018;27:R195–208.

    CAS  Article  Google Scholar 

  29. 29.

    Richardson TG, Zheng J, Davey Smith G, et al. Mendelian randomization analysis identifies CpG sites as putative mediators for genetic influences on cardiovascular disease risk. Am J Hum Genet. 2017;101:590–602.

    CAS  Article  Google Scholar 

  30. 30.

    Du P, Zhang X, Huang C-C, Jafari N, Kibbe WA, Hou L, Lin SM. Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis. BMC Bioinf. 2010;11(1):587.

    CAS  Article  Google Scholar 

  31. 31.

    Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, Wiencke JK, Kelsey KT. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinf. 2012;13(1):86.

    Article  Google Scholar 

  32. 32.

    Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–3.

    CAS  Article  Google Scholar 

  33. 33.

    van Iterson M, van Zwet EW, Heijmans BT, BIOS Consortium BT. Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution. Genome Biol. 2017;18(1):19.

    Article  Google Scholar 

  34. 34.

    Meeks KAC, Henneman P, Venema A, et al. Epigenome-wide association study in whole blood on type 2 diabetes among sub-Saharan African individuals: findings from the RODAM study. Int J Epidemiol. 2018;48:58–70.

    Article  Google Scholar 

  35. 35.

    Siemelink MA, van der Laan SW, Haitjema S, et al. Smoking is Associated to DNA methylation in atherosclerotic carotid lesions. Circ Genomic Precis Med. 2018.

    Article  Google Scholar 

  36. 36.

    D’Agostino RB, Vasan RS, Pencina MJ, Wolf PA, Cobain M, Massaro JM, Kannel WB. General cardiovascular risk profile for use in primary care. Circulation. 2008.

    Article  PubMed  PubMed Central  Google Scholar 

  37. 37.

    Hemani G, Zheng J, Elsworth B, et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife. 2018.

    Article  PubMed  PubMed Central  Google Scholar 

  38. 38.

    Nikpay M, Goel A, Won H-H, et al. A comprehensive 1000 Genomes–based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 2015;47(10):1121–30.

    CAS  Article  Google Scholar 

Download references


We thank Elaine M. Lilly, PhD, for her critical reading and revision of the English text.


The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Carlos III Health Institute–European Regional Development Fund [Grant Numbers FIS PI18/00017, FIS PI15/00051, PI12/00232, CIBERCV, CIBERESP, CIBERONC]; PERIS from Agència de Gestió d’Ajuts Universitaris i de Recerca [Grant Number SLT002/16/00088]; the Government of Catalonia through the Agency for Management of University and Research Grants [Grant Numbers 2014SGR240, 2017SGR946]; the Spanish Ministry of Economy and Competitiveness [Grant Number BES-2014–069718 to AF-S]; and Carlos III Health Institute-FEDER (grant number IFI14/00007 to SS-B). The Framingham Heart Study (FHS) is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with Boston University [Contract No. N01-HC-25195 and HHSN268201500001I]. The Women’s Health Initiative (WHI) program is funded by the NHLBI [Contracts N01WH22110, 24152, 32100-2, 32105-6, 32108-9, 32111-13, 32115, 32118-32119, 32122, 42107-26, 42129–32, and 44221]. This manuscript was not prepared in collaboration with investigators of the FHS/ WHI, has not been reviewed and/or approved by the FHS/WHI, and does not necessarily reflect the opinions or views of the FHS and WHI investigators or the NHLBI.

Author information




AF-S, SS-B and RE-L contributed to the conception or design of the work. All authors contributed to the acquisition, analysis, or interpretation of data for the work. AF-S and RE-L drafted the manuscript. SS-B, IS, MS, SP-F, MC-M, ME, JM critically revised the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Roberto Elosua.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the local ethics committee (2015/6199/I; 2018/7855/I) and meets the principles expressed in the Declaration of Helsinki and the relevant Spanish legislation.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Additional material.

Additional file 2: Table S1.

Discovery stage of the EWAS on acute myocardial infarction (REGICOR-1 sample). Model 1 was adjusted for estimated cell counts and two surrogate variables. Model 2 was further adjusted for smoking status. Model 3 was additionally adjusted for diabetes, hypercholesterolemia and hypertension. Coefficients, standard errors and p-values are given for each model before and after the correction of the inflation using the bacon R package. Suggestive significant associations (p-value<10-5) are in bold. The total number of suggestive significant associations in each model is given. Table S2. Meta-analyses of the results from the discovery (REGICOR-1) and the validation stage (REGICOR-2). Model 1 was adjusted for estimated cell counts and two surrogate variables. Model 2 was further adjusted for smoking status. Model 3 was additionally adjusted for diabetes, hypercholesterolemia and hypertension. Coefficients, standard errors and p-values of REGICOR-1 are those corrected with the bacon R package. Significant associations (p-value<6.17 × 10-8) are highlighted in bold. The total number of significant associations in each model is given. Table S3. Meta-analysis of the results from the follow-up association studies performed in the samples with incident cases of cardiovascular (CVD) and coronary heart disease (CHD). Model 1 was adjusted for age, estimated cell counts and two surrogate variables (plus ethnicity in WHI and sex in FOS). Model 2 was further adjusted for smoking status. Model 3 was additionally adjusted for diabetes, hypercholesterolaemia and hypertension. Significant associations (4.17 × 10-3) found in the fixed-effects meta-analysis are highlighted in bold. The total number of significant associations in each model is given. Table S4. Utility of the methylation risk scores (MRS): association with cardiovascular (CVD) or coronary (CHD) incidence and assessment of their predictive capacity. MRSs were based on the results from model 1 of incident CHD and CVD. Analyses of the association with the CVD or CHD incidence were adjusted for age, sex, total cholesterol and HDL-C levels, diabetes, smoking status, systolic blood pressure, and hypertensive treatment, which are the cardiovascular risk factors considered in the Framingham risk function. Analysis of the improvement in the predictive capacity of the Framingham function was performed with and without the corresponding MRS. Table S5. Results of the Wald ratio method applied to determine the causality between the identified CpGs and coronary heart disease or myocardial infarction.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Fernández-Sanlés, A., Sayols-Baixeras, S., Subirana, I. et al. DNA methylation biomarkers of myocardial infarction and cardiovascular disease. Clin Epigenet 13, 86 (2021).

Download citation


  • DNA methylation
  • Epigenome-wide association study
  • Predictive biomarkers
  • Myocardial infarction
  • Cardiovascular disease