DNA methylation in peripheral blood leukocytes for the association with glucose metabolism and invasive breast cancer
Clinical Epigenetics volume 15, Article number: 23 (2023)
Insulin resistance (IR) is a well-established factor for breast cancer (BC) risk in postmenopausal women, but the interrelated molecular pathways on the methylome are not explicitly described. We conducted a population-level epigenome-wide association (EWA) study for DNA methylation (DNAm) probes that are associated with IR and prospectively correlated with BC development, both overall and in BC subtypes among postmenopausal women.
We used data from Women’s Health Initiative (WHI) ancillary studies for our EWA analyses and evaluated the associations of site-specific DNAm across the genome with IR phenotypes by multiple regressions adjusting for age and leukocyte heterogeneities. For our analysis of the top 20 IR-CpGs with BC risk, we used the WHI and the Cancer Genomic Atlas (TCGA), using multiple Cox proportional hazards and logit regressions, respectively, accounting for age, diabetes, obesity, leukocyte heterogeneities, and tumor purity (for TCGA). We further conducted a Gene Set Enrichment Analysis.
We detected several EWA-CpGs in TXNIP, CPT1A, PHGDH, and ABCG1. In particular, cg19693031 in TXNIP was replicated in all IR phenotypes, measured by fasting levels of glucose, insulin, and homeostatic model assessment-IR. Of those replicated IR-genes, 3 genes (CPT1A, PHGDH, and ABCG1) were further correlated with BC risk; and 1 individual CpG (cg01676795 in POR) was commonly detected across the 2 cohorts.
Our study contributes to better understanding of the interconnected molecular pathways on the methylome between IR and BC carcinogenesis and suggests potential use of DNAm markers in the peripheral blood cells as preventive targets to detect an at-risk group for IR and BC in postmenopausal women.
Breast cancer (BC), the topmost leading cause of cancer incidence in women of the USA and worldwide [1, 2], is a heterogeneous disease with multiple clinical, histopathological, and molecular subtypes, which is characterized by both genetic and epigenetic alterations [3, 4]. For epigenetic events, DNA methylation (DNAm) is a well-characterized major epigenetic modification that involves mitotically heritable and reversible attachment of methyl groups at the 5′ carbon of cytosine in CpG dinucleotides (CpGs), influencing DNA transcription without altering the DNA sequence [5, 6]. Whereas several DNAm studies for BC initiation and progression support global hypomethylation [7,8,9] and focal hypermethylation, such that some tumor-suppressor genes are frequently hypermethylated at CpG islands and promoters, thus being inactivated [8, 10, 11], the role of the epigenetic mechanisms in BC tumorigenesis has not been conclusive. For example, there is no consistent trend toward an association between identified CpGs and the risk of BC across studies, suggesting the need for large population-level epigenetic studies, which prospectively evaluate BC development, specifically in BC molecular-subtype stratifications [7, 8].
In particular, among postmenopausal women, the obesity–insulin resistance (IR) connection is a well-established factor for BC risk/progression [1, 12,13,14,15], but their interrelated molecular pathways on the methylome have not been established. In detail, IR and type 2 diabetes (T2DM) are influenced by environmental and genomic factors as well as by their interplay [16,17,18,19]. In prior genome-wide association studies (GWASs), a majority of genes are associated with insulin secretion, pointing to pancreatic islet defects but does not represent impaired insulin action [19,20,21]; and these genes explain a small portion of the estimated heritability [19, 22]. The analysis of epigenetics may address these issues. For example, obesity status/adipose tissues and long-term exposure of beta cell lines to hyperglycemia altered DNAm of genes involved in glucose metabolism and their gene expression, leading to impaired insulin secretion as well as sensitivity [17, 23,24,25,26,27,28,29,30]. Thus, aberrant DNAm may directly influence the function of pancreatic beta cells as well as other organs involved in glucose homeostasis. Also, considering that age, as measured via epigenetic age [31, 32], influences DNAm, changes in DNAm that are associated with IR account for aging in the methylome and thus may be a better indicator of inter-/intra-individual genomic variability of IR. As such, DNAm can be a biomarker of decreased insulin sensitivity, but few epigenome-wide association studies (EWASs) have so far examined DNAm in IR [30, 33,34,35]. In addition, the previous EWASs for obesity/metabolic syndrome showed limited evidence owing to a lack of findings’ validation and no comparison of findings between peripheral blood and tissues.
Further, one study  for IR in T2DM pancreatic islets reported that differentially methylated genes were enriched in pathways of cancer and MAPK signaling, suggesting a close link of epigenomic mechanisms between IR and cancer. Given that even a modest change in DNAm causes a substantial effect on gene expression and that in late-onset disease, it has a large effect on disease over a long time period , DNAm markers can serve as a biomarker to detect an early at-risk group for morbid conditions, such as IR and BC, even several years before the clinical diagnosis is made.
Our study was a population-level EWAS to detect DNAm probes that are associated with IR phenotypes and that, by using data from a prospective evaluation of BC development, are further directly correlated with BC risk, both overall and in BC subtypes among postmenopausal women. DNAm is tissue specific, but the correlations between peripheral blood and tissue are gene specific. For example, the methylation levels of several genes in relation to IR, T2DM, and/or BC are highly correlated between peripheral blood and tissue [37,38,39,40]. Thus, we first conducted a peripheral blood leukocytes (PBLs)–based EWAS and compared the methylation levels of detected CpGs with those of the CpGs within BC and adjacent normal breast tissues. Corresponding to the results in a published study  of gene-methylation parallelisms between peripheral blood cells and tissues in glucose metabolism, the PBLs may be the best non-invasive alternative tissue, standing for a surrogate DNAm marker that reflects multiple glucometabolic pathways. With the detected EWA-based IR-CpGs, we further tested for the associations with BC risk in PBLs and conducted validation tests in BC tissues. This allowed us to determine whether our IR-CpGs at genome-wide significance that were associated with BC risk are systemic or tissue specific or common in both.
Materials and methods
Our EWA analysis used data from the Women’s Health Initiative (WHI) cohort, a large, prospective study of postmenopausal women, whose ages were 50–79 years at the time of enrollment between 1993 and 1998 at 40 clinical centers in the USA, consisting of 2 study arms, namely the clinical trial (CT) and the observational study (OS) . For DNAm data, we included 3 WHI ancillary studies (ASs) with available genome-wide DNAm measured in PBLs (Fig. 1): for discovery, AS315 (Epigenetic Mechanisms of Particulate Matter-Mediated Cardiovascular Disease, random minority oversample from WHI CT, n = 2,243); for validation, Broad Agency Award (BAA23, Integrative Genomics for Risk of Coronary Heart Disease [CHD] and Related Phenotypes, case–control study of CHD from WHI CT and OS, n = 2,107) combined with AS311 (Bladder Cancer and Leukocyte Methylation, matched case–control study of bladder cancer from WHI CT and OS, n = 882) [43, 44]. Racial/ethnic variation exists in BC-related DNAm [37, 45]; for the purpose of our EWA analysis, we restricted the study population to those women who reported their race or ethnicity as non-Hispanic white, a majority of the WHI ASs population, and who had available IR phenotypes assessed via fasting blood levels of glucose (FG) and insulin (FI) (n = 1,132).
For our analysis of the validated IR-CpGs with the risk of BC development, our discovery cohort included those 3 WHI ASs with available BC outcomes but excluding women (n = 46) who had been followed up for less than 1 year and/or had been diagnosed with any type of cancer at enrollment, leaving a total of 1,086 women (Fig. 1). These women had been followed up through March 6, 2021, with a mean of 17 years follow-up, and 80 of them had developed invasive BC. Our replication cohort was derived from the Cancer Genomic Atlas (TCGA) BC Study (n = 862), housing tissue-derived genome-wide DNAm data and molecular profiles of different BC subtypes from BC tissues . Our analyses for BC were restricted to women who are white and postmenopausal with available BC subtypes, but distant-metastasis free, resulting in a total of 412 (= 361 BC tissues + 51 adjacent normal breast tissues) (Fig. 1). The institutional review boards of each WHI clinical center and the University of California, Los Angeles, approved this study.
Data collection and BC outcome
Participants enrolled in the WHI completed self-administered questionnaires at screening, providing demographic information (e.g., age, race) and medical histories, such as DM. Trained staff obtained anthropometric measurements, including height, weight, and waist and hip circumferences at baseline. Invasive BC development was initially ascertained through self-report of a new cancer diagnosis by all participants, further determined by a committee of physicians on the basis of a review of the patients’ medical records and pathology and cytology reports, and coded into the central WHI database according to the National Cancer Institute’s Surveillance, Epidemiology, and End-Results guidelines . The time from enrollment until BC development, censoring, or study end-point was measured as the number of days and then converted into years.
BC patient data from TCGA used in this study include information on age, race, menopausal status, and diagnosed tumor subtype and stage. For the study purpose, data from primary invasive BC tissues and normal breast tissues adjacent to BC (either primary or metastatic) tissues were analyzed.
Epigenome-wide DNAm array and laboratory methods
Using peripheral blood leukocytes isolated from the fasting blood of the WHI participants, we extracted DNA and measured DNAm via the Illumina 450 BeadChip (Illumina Inc.; San Diego, CA) at up to 485,511 CpG sites. DNAm levels (β values) were calculated as the ratio of intensities between the methylated and unmethylated probes, ranging from 0 (completely unmethylated) to 1 (completely methylated) . DNAm was beta-mixture quantile (BMIQ)-normalized,  and batched-adjusted for stage and plate by using the empirical Bayes methods  or by using random intercept for plate and chip and a fixed effect for row. Leukocyte heterogeneities were estimated to be adjusted for in the analysis using Houseman’s method  (for CD4+ T cell, natural killer cell, monocyte, and granulocyte) and Hovarth’s method  (for plasma blast, CD8+CD28–CD45RA– T cell, and naïve CD8 T cell).
In TCGA, tissue-derived genome-wide DNAm was analyzed by using the Illumina Infinium450K array and, using minfi v.1.42.0, was normalized via normal-exponential out-of-band (Noob) background correction . The tumor purity and cell-type proportions (cancer and normal epithelial, stromal, and immune cells) of each tumor sample were estimated by using the R InfiniumPurify v. 1.3.1  and RefFreeEWAS V.2.2 , respectively.
Serum samples from the WHI participants fasting at least 8 h were drawn at enrollment by trained phlebotomists and assayed for glucose and insulin concentrations using the hexokinase method on a Hitachi 747 analyzer (Boehringer Mannheim Diagnostics, Indianapolis, IN) for glucose, and by radioimmunoassay (Linco Research, Inc., St. Louis, MO) or automated ES300 method (Boehringer Mannheim Diagnostics, Indianapolis, IN) for insulin. Results from the 2 methods for insulin measurement were comparable at insulin concentrations < 60 μIU/ml, and the intra-class correlation coefficient with repeatedly measured insulin was 0.7 . Homeostatic model assessment–IR (HOMA-IR), as a surrogate of IR, was estimated as glucose (unit: mg/dl) × insulin (unit: μIU/ml) / 405 .
For the DNAm site-specific analysis across the genome with IR phenotypes, each phenotype was log-transformed as a result of tests conducted for linear assumption and normality distribution and was also categorized as follows: FG, FI, and HOMA-IR, using 100 mg/dl, 8.6μIU/ml, and 3.0 (respectively), corresponding to the cut points of the American Heart Association/National Heart, Lung, and Blood Institute, the International Diabetes Federation, and the Adult Treatment Panel III for metabolic syndrome [58, 59]. The association between DNAm and each phenotype was evaluated via multiple linear and logistic regressions, adjusting for age and leukocyte heterogeneities. The summary of the leukocyte proportions is provided in Additional file 1: Table S1. A 2-sided p < 1E–007 (discovery) and 0.05 / number of the discovered CpGs (replication), providing Bonferroni correction, were considered statistically significant. Results were combined across discovery and replication in a meta-analysis assuming a fixed–effect model.
With the selected top 20 CpG sites that were most statistically significant after multiple-comparison corrections, we next performed in the WHI data the multiple Cox proportional hazards regression for BC development overall and within BC subtypes, with an assumption test via a Schoenfeld residual plot and rho, by accounting for age, having ever been treated for diabetes, body mass index (BMI), waist-to-hip ratio (WHR), and leukocyte heterogeneities. Using TCGA data, we further conducted validation tests of the top 20 CpGs with BC risk by using logit regression that was adjusted for age, tumor purity, and cell-type composition both overall and in the BC subtypes. For the analysis of BC risk, the modeled CpGs in both cohorts were further standardized across samples; thus, the effect size reflected a 1 standardized deviation increase in DNAm on BC risk. Given that this testing was performed on the basis of our hypothesis-driven questions (i.e., IR-DNAm in association with BC systemically or in tissues), a 2-tailed p < 0.05 was considered significant.
Differences in methylation levels of the modeled CpGs by IR phenotypes in the PBLs and by BC risk in each of PBLs and tissues, as well as differences in the DNAm status between the PBLs and tissues among women with BC and those without BC, were tested via unpaired 2-sample t tests. If β values were skewed or had outliers, Mann–Whitney/Wilcoxon’s rank-sum test was used. With the CpGs at genome-wide significance in the discovery and those of which were associated with BC risk in either TCGA or WHI, we finally conducted a Gene Set Enrichment Analysis (GSEA) by IR phenotypes and by BC subtypes, respectively, using the WebGestalt . All statistical analyses were performed using R.
Epigenome-wide association of DNAm and IR phenotypes.
Among 484,220 CpGs in the discovery data, we found several differentially methylated CpGs associated with each IR phenotype (FG, FI, and HOMA-IR) and further validated them. In detail, 19 CpGs were associated with FG, the level of which was analyzed as a continuous variable; of those, 1 CpG (cg19693031 in TXNIP) was further validated, with p < 2.6E–03(= 0.05/19) (Table 1, Figs. 2A and B). This same CpG was also replicated in the analysis for FG as a categorical variable, showing the same direction as the effect size estimated in the FG analysis as a continuous variable (Additional file 1: Table S2). Of 20 CpGs in relation to FI as a continuous variable in discovery, 7 CpGs were further validated, with p < 2.5E–03 (= 0.05/20; Table 2, Figs. 2C and D). Of those 7 CpGs, 1 CpG (cg00574957 in CPT1A) was also replicated in the analysis of FI as a categorical variable (Additional file 1: Table S3); in both linear and logistic analyses, this CpG was negatively associated with FI. For HOMA-IR as a continuous variable, 35 CpGs were detected in discovery; 7 of those were further validated (p < 1.4E–03; Table 3, Figs. 2E and F). In the analysis of HOMA-IR as a categorical variable, 4 of the validated 7 CpGs (cg14476101 in PHGDH, cg19693031 in TXNIP, cg00574958 in CPT1A, and cg06500161 in ABCG1) were also detected in discovery, yielding the same directions as those of effect sizes estimated in the linear analyses, but none of them were further validated (Additional file 1: Table S4). Finally, we conducted a meta-analysis of all the detected epigenome-wide CpGs by combining their discovery and replication data. We detected 1 CpG (cg19693031 in TXNIP) that was replicated inversely related to FG, FI, and HOMA-IR each as a continuous variable; and that CpG was also significant at the epigenome-wide level in association with FG and HOMA-IR each as a categorical variable.
Further, we conducted a subset analysis by selecting CpGs with > 5% of a mean difference in DNAm by IR phenotypes and compared their mean differences in DNAm levels by each IR phenotype across chromosome (Chr), CpG context, enhancer and/or promoter, and gene region (Additional file 1: Figure S1). The mean levels of DNAm by FG (< 100 mg/dl vs. ≥ 100 mg/dl) differed in Chr 1, 7, 8, and 16. The mean levels of DNAm by FI (≤ 8.6μIU/ml vs. > 8.6μIU/ml) and those of DNAm by HOMA-IR (< 3.0 vs. ≥ 3.0) were different in Chr 1, 7, and 8 and in Chr 4, 8, and 11, respectively. Whereas S-Shores were hypomethylated in the groups with impaired glucose metabolism measured via FG and HOMA-IR, OpenSea, N Shelf, and S Shelf were hypermethylated in those with a greater level of FI. In this group with a higher level of FI, the enhancer was hypermethylated, whereas the promoter was hypomethylated. Gene regions, including intergenic, gene body, and 5' untranslated regions (5' UTR), were hypermethylated in the groups with, respectively, greater levels of FG, FI, and HOMA-IR, but the 200–1500 bp upstream of transcription start site (TSS1500) was hypomethylated in the group with a greater level of either FI or HOMA-IR.
Association of the detected IR-DNAm with BC risk.
With the top 20 epigenome-wide IR-DNAm, we next tested for correlation with BC risk in the 2 independent cohorts, WHI and TCGA. In the WHI cohort, several CpGs were associated with BC development; their hazard ratios were consistent across the analyses both with and without adjustment for DM, BMI, and WHR (Table 4). In particular, 3 CpGs in WDR8 were detected across overall, estrogen receptor/progesterone receptor–positive (ER/PR +), and human epidermal growth factor receptor-2–negative (HER2/neu–) subtypes, with a positive association with BC risk (Table 4, Additional file 1: Figure S2). None of the CpGs replicated in the analysis of IR phenotypes were detected in the analysis for BC risk, but 2 epigenome-wide level CpGs detected in discovery (cg17058475 and cg16246545) in CPT1A and PHGDH (replicated genes in relation to IR phenotypes), respectively, were associated with the risk of BC.
In the TCGA cohort, multiple CpGs were significant across BC subtypes; specifically, 2 CpGs (cg06500161 and cg27243685) in ABCG1 (replicated gene in IR phenotypes) were significantly associated with BC risk (Table 5, Additional file 1: Figure S3). Only 1 CpG (cg01676795 in POR) was commonly detected across the WHI and TCGA analyses. This CpG with a 1 standardized deviation increase in DNAm had 75% (in the WHI) and a 5 times greater risk (in the TCGA) for the ER/PR + subtype. Further, we compared DNAm levels between the WHI and TCGA from the IR-CpGs associated with BC that are shared by the 2 cohorts in terms of Chr, CpGs, CpG context, and gene region. Whereas DNAm levels of some CpG contexts and/or gene regions differed significantly between the 2 cohorts among the non-BC subcohorts (Additional file 1: Figure S4), no significant difference in DNAm levels between the cohorts was observed within the BC subcohorts (Fig. 3), suggesting DNAm parallelisms between PBLs and tissues in IR and BC.
GSEA by IR phenotypes and by BC subtypes.
Using GSEA strategies, we conducted multiple analyses of gene ontology (GO) with biologic process, cellular component, and molecular functions; pathways with KEGG and Reactome; and diseases by using DisGeNET and GLAD4U databases. In regard to IR phenotypes (Additional file 1: Tables S5.1–S4.7), GO with biologic process identified a beta-catenin/T cell factors (TCF) complex assembly; its dysregulation is associated with cancer . Gene-enrichment pathways were involved in glucose intolerance, transcriptional mis-regulation in cancer, IR signaling (AKT2, RSK/RAS/MAPK), and lipid metabolism. Diseases involved in the IR pathways included nutritional and metabolic diseases, DM, and obesity. For BC subtypes, GO with a cellular component included a histone acetyltransferase and other transcription factors in the ER/PR + subtype. Genes were enriched in the pathways involving adipocytokine signaling and lipid metabolism in the HER2/neu– subtype and in those involving immune and insulin signaling (MAPK1/MAPK3, Rap 1) in both ER/PR + and HER2/neu– subtypes (Additional file 1: Tables S5.8–S4.11).
This is the first large population-level EWAS conducted in postmenopausal women for detecting differentially methylated CpGs in the PBLs that are associated with individual IR phenotypes and that are further prospectively evaluated for an association with BC development, both overall and in BC molecular subtypes. The methylation levels of the detected CpGs in IR and BC risk between the PBLs and the BC tissues were comparable, consistent with the findings of a gene-methylation parallelism study in glucose metabolism between peripheral blood cells and tissues . This suggests that PBLs may serve as the best source of surrogate DNAm markers in non-invasive tissues, reflecting multiple interconnected glucometabolic carcinogenesis pathways.
Several EWA-CpGs in IR phenotypes detected in our study were also reported in previous studies, supporting our study’s replication and robustness. For example, cg19693031 in TXNIP, inversely associated with FG, FI, and IR in our study, was observed in previous studies with the same direction of association [62,63,64,65,66]. Thioredoxin-interacting protein (TXNIP) plays a key role in pancreatic beta cell biology involving oxidative stress and endothelial cell inflammation and its vascular complications , and it regulates glucose homeostasis by promoting fructose absorption in the small intestine . The TXNIP gene is activated in both hyperglycemic animals and human adipose tissues , and it regulates glucometabolic pathways in human skeletal muscle . Thus, our finding of hypermethylated DNA probe in TXNIP (i.e., a negative effect on the gene expression) associated with decreased IR has been supported.
Also, cg00574957 in CPT1A was negatively associated with FI and IR in both our and previous studies [63, 64, 71], showing the biological plausibility of its association with IR, including its role in obesity, metabolic syndrome, and fatty acid metabolism . As this CpG is independent of nearby single-polymorphism nucleotides (SNPs) located within 1 Mb upstream or downstream of this locus, representing rs1369 index, the decreased CPT1A expression can be caused solely by increased methylation at this CpG site . CPT1A, 1 of the 3 isoforms of CPT-1, was found mostly in the liver, where it is involved in the regulation of mitochondrial fatty acid oxidation (FAO). CPT1A deficiency causes the metabolic disorder of FAO [74, 75]. A decrease in mitochondrial fatty acid uptake results in elevated intramuscular lipid levels, but upregulates glucose oxidation and improves whole-body insulin sensitivity in a mouse model ; this is supportive of our finding of an inverse association between increased DNAm of the CpG (i.e., reduced gene expression) and FI/IR. However, most human gene studies have reported that this gene’s function is connected to fatty acid metabolism, not to clinical glucometabolic phenotypes, warranting a future functional study.
Similarly, cg14476101 in PHGDH was inversely associated with FI and IR in our study. Previous EWASs and Mendelian Randomization studies confirmed the association between hypermethylation at that locus and lower fatty-liver risk, T2DM, and adiposity [76, 77]. Also, the role of this CpG in regulating the blood concentration of steroid hormones was upregulated by obesity . Together, these findings propose a plausible link between the PHGDH gene and lipid and adipocytokine metabolic pathways that can be altered by the methylation level of cg14476101.
In contrast, we found that cg06500161 in ABCG1 was positively associated with FI and IR. This CpG site is a well-known DNAm probe associated with glucometabolic phenotypes [30, 62, 63, 79, 80], and the gene’s expression was inversely associated with the methylation level at this CpG [30, 62]. ABCG1 is a crucial regulator of cholesterol efflux from macrophages to high density lipoprotein (HDL); thus, suppressed gene activity by increased DNAm at this site can contribute to lowering the HDL level , which is a known independent risk factor for glucometabolic disorders. Also, the link between ABCG1 and T2DM/glucose traits has been reported previously in both human and animal studies [82,83,84], supporting our finding of increased DNAm of this site’s being associated with IR phenotypes.
Of those validated IR-genes, 3 genes (CPT1A, PHGDH, and ABCG1) were further correlated with BC risk. In particular, the ABC transporter gene (ABCG1) expression associated with cholesterol efflux in the liver results in inhibition of cell proliferation and stimulation of cell apoptosis in BC cells . This highlights a potential epigenetic link between lipid–glucometabolic alteration and BC tumorigenesis and progression that deserves further study. In our study, the detected CpGs in those 3 genes were EWA-based IR-DNAm probes, which are novel with respect to their association with BC risk.
Although the methylation levels of the CpGs in relation to IR and BC that are common across Chr, CpG contexts, and the gene regions were comparable between the WHI and TCGA cohorts, only 1 individual IR-CpG (cg01676795 in POR) was common in its relationship to BC risk in both cohorts. P450 oxidoreductase (POR) gene expression has been studied in few cancer types, presenting significant overall suppression of POR expression in muscle-invasive bladder cancer  and differentially expressed gene proteins enriched in neutrophil and T cell activation in hepatocellular carcinoma ; those findings support the important role of POR in carcinogenesis via alteration of the immune tumor microenvironment. Our finding of this CpG in POR in association with BC risk is novel, which calls for a future study on the methylation in this gene linked to BC by taking into account the effects of nearby SNPs.
Our analysis for BC risk in the TCGA included BC tissues and adjacent normal tissues. Different findings could result from the analysis between BC tissues and normal tissues (obtained from patients without BC), although we adjusted for tumor purity in the analysis. A few DNAm probes from the TCGA presented an extreme risk magnitude, warranting a further replication study with a larger independent dataset. To increase the comparability of analyses between the 2 cohorts, our study did not account for lifestyle factors in a comprehensive fashion and did not consider interactions with DNAm, which may affect the relationships between DNAm, IR, and BC. The validation data reflect a small fraction of the 2 ASs (BAA23; AS311) owing to the limited availability of IR phenotypes, resulting in less strong statistical power. In addition, given that each AS had its own study purpose, samples selected for our study may not fully represent the source population. Finally, our study population was confined to white postmenopausal women, so the generalizability of our results to other populations is limited.
In conclusion, we found several differentially methylated CpGs, which are both well-established and novel, at the epigenome-wide level in relation to IR that were further correlated with BC development. Our findings warrant further validation in larger, independent epigenetic and mechanistic studies. Our study contributes to better understanding of the interconnected molecular pathways on the methylome between glucose intolerance and BC carcinogenesis and suggests the potential use of DNAm markers in PBLs as preventive targets for detecting an at-risk group for IR and BC among postmenopausal women.
Availability of data and materials
The data that support the findings of this study are available in accordance with policies developed by the NHLBI and WHI in order to protect sensitive participant information and approved by the Fred Hutchinson Cancer Research Center, which currently serves as the IRB of record for the WHI. Data requests may be made by emailing helpdesk@WHI.org.
American cancer society. Breast cancer facts & figures 2019–2020. In: Atlanta: American cancer society, Inc. 2019: https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/breast-cancer-facts-and-figures/breast-cancer-facts-and-figures-2019-2020.pdf.
American cancer society. Cancer fact and figures 2022. In: Atlanta: American cancer society, Inc. 2022: https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/annual-cancer-facts-and-figures/2022/2022-cancer-facts-and-figures.pdf.
Wood LD, Parsons DW, Jones S, et al. The genomic landscapes of human breast and colorectal cancers. Science. 2007;318(5853):1108–13.
Esteller M. Epigenetics in cancer. N Engl J Med. 2008;358(11):1148–59.
Dupont C, Armant DR, Brenner CA. Epigenetics: definition, mechanisms and clinical perspective. Seminars Reproduct Med. 2009;27(5):351–7.
Dick KJ, Nelson CP, Tsaprouni L, et al. DNA methylation and body-mass index: a genome-wide analysis. Lancet. 2014;383(9933):1990–8.
Ennour-Idrissi K, Dragic D, Durocher F, Diorio C. Epigenome-wide DNA methylation and risk of breast cancer: a systematic review. BMC Cancer. 2020;20(1):1048.
Tang Q, Cheng J, Cao X, Surowy H, Burwinkel B. Blood-based DNA methylation as biomarker for breast cancer: a systematic review. Clin Epigenetics. 2016;8:115.
Ruike Y, Imanaka Y, Sato F, Shimizu K, Tsujimoto G. Genome-wide analysis of aberrant methylation in human breast cancer cells using methyl-DNA immunoprecipitation combined with high-throughput sequencing. BMC Genomics. 2010;11:137.
Feng W, Shen L, Wen S, et al. Correlation between CpG methylation profiles and hormone receptor status in breast cancers. Breast Cancer Res. 2007;9(4):R57.
Jovanovic J, Ronneberg JA, Tost J, Kristensen V. The epigenetics of breast cancer. Mol Oncol. 2010;4(3):242–54.
Boyd DB. Insulin and cancer. Integr Cancer Ther. 2003;2(4):315–29.
Calle EE, Kaaks R. Overweight, obesity and cancer: epidemiological evidence and proposed mechanisms. Nat Rev Cancer. 2004;4(8):579–91.
Kabat GC, Kim M, Caan BJ, et al. Repeated measures of serum glucose and insulin in relation to postmenopausal breast cancer. Int J Cancer. 2009;125(11):2704–10.
Gunter MJ, Hoover DR, Yu H, et al. Insulin, insulin-like growth factor-I, and risk of breast cancer in postmenopausal women. J Natl Cancer Inst. 2009;101(1):48–60.
Weichhaus M, Broom J, Wahle K, Bermano G. A novel role for insulin resistance in the connection between obesity and postmenopausal breast cancer. Int J Oncol. 2012;41(2):745–52.
Liu J, Carnero-Montoro E, van Dongen J, et al. An integrative cross-omics analysis of DNA methylation sites of glucose and insulin homeostasis. Nat Commun. 2019;10(1):2581.
Franks PW, Mesa JL, Harding AH, Wareham NJ. Gene-lifestyle interaction on risk of type 2 diabetes. Nutr Metab Cardiovasc Dis. 2007;17(2):104–24.
McCarthy MI. Genomics, type 2 diabetes, and obesity. N Engl J Med. 2010;363(24):2339–50.
Rosengren AH, Braun M, Mahdi T, et al. Reduced insulin exocytosis in human pancreatic beta-cells with gene variants linked to type 2 diabetes. Diabetes. 2012;61(7):1726–33.
Ruchat SM, Elks CE, Loos RJ, et al. Association between insulin secretion, insulin sensitivity and type 2 diabetes susceptibility variants identified in genome-wide association studies. Acta Diabetol. 2009;46(3):217–26.
Shu X, Wu L, Khankari NK, et al. Associations of obesity and circulating insulin and glucose with breast cancer risk: a Mendelian randomization analysis. Int J Epidemiol. 2019;48(3):795–806.
Eizirik DL, Korbutt GS, Hellerstrom C. Prolonged exposure of human pancreatic islets to high glucose concentrations in vitro impairs the beta-cell function. J Clin Invest. 1992;90(4):1263–8.
Hall E, Dekker Nitert M, Volkov P, et al. The effects of high glucose exposure on global gene expression and DNA methylation in human pancreatic islets. Mol Cell Endocrinol. 2018;472:57–67.
Taneera J, Fadista J, Ahlqvist E, et al. Identification of novel genes for glucose metabolism based upon expression pattern in human islets and effect on insulin secretion and glycemia. Hum Mol Genet. 2015;24(7):1945–55.
Maedler K, Sergeev P, Ris F, et al. Glucose-induced beta cell production of IL-1beta contributes to glucotoxicity in human pancreatic islets. J Clin Invest. 2002;110(6):851–60.
Volkov P, Olsson AH, Gillberg L, et al. A genome-wide mQTL analysis in human adipose tissue identifies genetic variants associated with DNA methylation, gene expression and metabolic traits. PLoS ONE. 2016;11(6): e0157776.
Malodobra-Mazur M, Alama A, Bednarska-Chabowska D, Pawelka D, Myszczyszyn A, Dobosz T. Obesity-induced insulin resistance via changes in the DNA methylation profile of insulin pathway genes. Adv Clin Exp Med. 2019;28(12):1599–607.
Rohde K, Klos M, Hopp L, et al. IRS1 DNA promoter methylation and expression in human adipose tissue are related to fat distribution and metabolic traits. Sci Rep. 2017;7(1):12369.
Kriebel J, Herder C, Rathmann W, et al. Association between DNA methylation in whole blood and measures of glucose metabolism: KORA F4 study. PLoS ONE. 2016;11(3): e0152314.
Lee Y, Choufani S, Weksberg R, et al. Placental epigenetic clocks: estimating gestational age using placental DNA methylation levels. Aging (Albany NY). 2019;11(12):4238–53.
Dayeh T, Volkov P, Salo S, et al. Genome-wide DNA methylation analysis of human pancreatic islets from type 2 diabetic and non-diabetic donors identifies candidate genes that influence insulin secretion. PLoS Genet. 2014;10(3): e1004160.
Hidalgo B, Irvin MR, Sha J, et al. Epigenome-wide association study of fasting measures of glucose, insulin, and HOMA-IR in the genetics of lipid lowering drugs and diet network study. Diabetes. 2014;63(2):801–7.
Chambers JC, Loh M, Lehne B, et al. Epigenome-wide association of DNA methylation markers in peripheral blood from Indian Asians and Europeans with incident type 2 diabetes: a nested case-control study. Lancet Diabetes Endocrinol. 2015;3(7):526–34.
Kulkarni H, Kos MZ, Neary J, et al. Novel epigenetic determinants of type 2 diabetes in Mexican-American families. Hum Mol Genet. 2015;24(18):5330–44.
Toperoff G, Aran D, Kark JD, et al. Genome-wide survey reveals predisposing diabetes type 2-related DNA methylation variations in human peripheral blood. Hum Mol Genet. 2012;21(2):371–83.
Conway K, Edmiston SN, Tse CK, et al. Racial variation in breast tumor promoter methylation in the Carolina breast cancer study. Cancer Epidemiol Biomarkers Prev. 2015;24(6):921–30.
Brennan K, Garcia-Closas M, Orr N, et al. Intragenic ATM methylation in peripheral blood DNA as a biomarker of breast cancer risk. Cancer Res. 2012;72(9):2304–13.
Widschwendter M, Apostolidou S, Raum E, et al. Epigenotyping in peripheral blood cell DNA and breast cancer risk: a proof of principle study. PLoS ONE. 2008;3(7): e2656.
Shah UJ, Xie W, Flyvbjerg A, et al. Differential methylation of the type 2 diabetes susceptibility locus KCNQ1 is associated with insulin sensitivity and is predicted by CpG site specific genetic variation. Diabetes Res Clin Pract. 2019;148:189–99.
Bacos K, Gillberg L, Volkov P, et al. Blood-based biomarkers of age-associated epigenetic changes in human islets associate with insulin secretion and diabetes. Nat Commun. 2016;7:11089.
The Women’s Health Initiative Study Group. Design of the Women’s Health Initiative clinical trial and observational study. The Women’s Health initiative study group. Control Clin Trials. 1998;19(1):61–109.
Holliday KM, Gondalia R, Baldassari A, et al. Gaseous air pollutants and DNA methylation in a methylome-wide association study of an ethnically and environmentally diverse population of U.S adults. Environ Res. 2022;212:113360.
Do WL, Whitsel EA, Costeira R, et al. Epigenome-wide association study of diet quality in the Women’s Health Initiative and TwinsUK cohort. Int J Epidemiol. 2021;50(2):675–84.
Conway K, Edmiston SN, Parrish E, et al. Breast tumor DNA methylation patterns associated with smoking in the Carolina breast cancer study. Breast Cancer Res Treat. 2017;163(2):349–61.
Liu J, Lichtenberg T, Hoadley KA, et al. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell. 2018;173(2):400–16.
National cancer institute. SEER program: comparative staging guide for cancer. In:June 1993.
Dunning MJ, Barbosa-Morais NL, Lynch AG, Tavare S, Ritchie ME. Statistical issues in the analysis of Illumina data. BMC Bioinformatics. 2008;9:85.
Teschendorff AE, Marabita F, Lechner M, et al. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinformatics. 2013;29(2):189–96.
Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8(1):118–27.
Houseman EA, Accomando WP, Koestler DC, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics. 2012;13:86.
Horvath S. DNA methylation age of human tissues and cell types. Genome Biol. 2013;14(10):R115.
Aryee MJ, Jaffe AE, Corrada-Bravo H, et al. Minfi: a flexible and comprehensive bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–9.
Qin Y, Feng H, Chen M, Wu H, Zheng X. InfiniumPurify: An R package for estimating and accounting for tumor purity in cancer methylation research. Genes Dis. 2018;5(1):43–5.
Houseman EA, Molitor J, Marsit CJ. Reference-free cell mixture adjustments in analysis of DNA methylation data. Bioinformatics. 2014;30(10):1431–9.
Langer RD, White E, Lewis CE, Kotchen JM, Hendrix SL, Trevisan M. The women’s health initiative observational study: baseline characteristics of participants and reliability of baseline measures. Ann Epidemiol. 2003;13(9 Suppl):S107-121.
Matthews DR, Hosker JP, Rudenski AS, Naylor BA, Treacher DF, Turner RC. Homeostasis model assessment: insulin resistance and beta-cell function from fasting plasma glucose and insulin concentrations in man. Diabetologia. 1985;28(7):412–9.
Gayoso-Diz P, Otero-Gonzalez A, Rodriguez-Alvarez MX, et al. Insulin resistance (HOMA-IR) cut-off values and the metabolic syndrome in a general adult population: effect of gender and age: EPIRCE cross-sectional study. BMC Endocr Disord. 2013;13:47.
Grundy SM, Cleeman JI, Daniels SR, et al. Diagnosis and management of the metabolic syndrome: an American heart association/national heart, lung, and blood institute scientific statement. Circulation. 2005;112(17):2735–52.
Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019;47(W1):W199–205.
Kolligs FT, Bommer G, Goke B. Wnt/beta-catenin/tcf signaling: a critical pathway in gastrointestinal tumorigenesis. Digestion. 2002;66(3):131–44.
Nuotio ML, Pervjakova N, Joensuu A, et al. An epigenome-wide association study of metabolic syndrome and its components. Sci Rep. 2020;10(1):20567.
Hidalgo BA, Minniefield B, Patki A, et al. A 6-CpG validated methylation risk score model for metabolic syndrome: the HyperGEN and GOLDN studies. PLoS ONE. 2021;16(11): e0259836.
Al Muftah WA, Al-Shafai M, Zaghlool SB, et al. Epigenetic associations of type 2 diabetes and BMI in an Arab population. Clin Epigenetics. 2016;8:13.
Soriano-Tarraga C, Jimenez-Conde J, Giralt-Steinhauer E, et al. Epigenome-wide association study identifies TXNIP gene associated with type 2 diabetes mellitus and sustained hyperglycemia. Hum Mol Genet. 2016;25(3):609–19.
Florath I, Butterbach K, Heiss J, et al. Type 2 diabetes and leucocyte DNA methylation: an epigenome-wide association study in over 1,500 older adults. Diabetologia. 2016;59(1):130–8.
Perrone L, Devi TS, Hosoya K, Terasaki T, Singh LP. Thioredoxin interacting protein (TXNIP) induces inflammation through chromatin modification in retinal capillary endothelial cells under diabetic conditions. J Cell Physiol. 2009;221(1):262–72.
Ding XQ, Wu WY, Jiao RQ, et al. Curcumin and allopurinol ameliorate fructose-induced hepatic inflammation in rats via miR-200a-mediated TXNIP/NLRP3 inflammasome inhibition. Pharmacol Res. 2018;137:64–75.
Koenen TB, Stienstra R, van Tits LJ, et al. Hyperglycemia activates caspase-1 and TXNIP-mediated IL-1beta transcription in human adipose tissue. Diabetes. 2011;60(2):517–24.
Parikh H, Carlsson E, Chutkow WA, et al. TXNIP regulates peripheral glucose metabolism in humans. PLoS Med. 2007;4(5): e158.
Das M, Sha J, Hidalgo B, et al. Association of DNA Methylation at CPT1A Locus with Metabolic Syndrome in the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) Study. PLoS ONE. 2016;11(1): e0145789.
Schlaepfer IR, Joshi M. CPT1A-mediated Fat Oxidation, Mechanisms, and Therapeutic Potential. Endocrinology. 2020;161(2).
Irvin MR, Zhi D, Joehanes R, et al. Epigenome-wide association study of fasting blood lipids in the Genetics of lipid-lowering drugs and diet network study. Circulation. 2014;130(7):565–72.
Keung W, Ussher JR, Jaswal JS, et al. Inhibition of carnitine palmitoyltransferase-1 activity alleviates insulin resistance in diet-induced obese mice. Diabetes. 2013;62(3):711–20.
Bonnefont JP, Demaugre F, Prip-Buus C, et al. Carnitine palmitoyltransferase deficiencies. Mol Genet Metab. 1999;68(4):424–40.
Ma J, Nano J, Ding J, et al. A peripheral blood DNA methylation signature of hepatic fat reveals a potential causal pathway for nonalcoholic fatty liver disease. Diabetes. 2019;68(5):1073–83.
Wahl S, Drong A, Lehne B, et al. Epigenome-wide association study of body mass index, and the adverse outcomes of adiposity. Nature. 2017;541(7635):81–6.
Petersen AK, Zeilinger S, Kastenmuller G, et al. Epigenetics meets metabolomics: an epigenome-wide association study with blood serum metabolic traits. Hum Mol Genet. 2014;23(2):534–45.
Tobi EW, Slieker RC, Luijk R, et al. DNA methylation as a mediator of the association between prenatal adversity and risk factors for metabolic disease in adulthood. Sci Adv. 2018;4(1):4364.
Dayeh T, Tuomi T, Almgren P, et al. DNA methylation of loci within ABCG1 and PHOSPHO1 in blood DNA is associated with future type 2 diabetes risk. Epigenetics. 2016;11(7):482–8.
Westerterp M, Bochem AE, Yvan-Charvet L, Murphy AJ, Wang N, Tall AR. ATP-binding cassette transporters, atherosclerosis, and inflammation. Circ Res. 2014;114(1):157–70.
Kruit JK, Wijesekara N, Westwell-Roper C, et al. Loss of both ABCA1 and ABCG1 results in increased disturbances in islet sterol homeostasis, inflammation, and impaired beta-cell function. Diabetes. 2012;61(3):659–64.
Mauldin JP, Nagelin MH, Wojcik AJ, et al. Reduced expression of ATP-binding cassette transporter G1 increases cholesterol accumulation in macrophages of patients with type 2 diabetes mellitus. Circulation. 2008;117(21):2785–92.
Sturek JM, Castle JD, Trace AP, et al. An intracellular role for ABCG1-mediated cholesterol transport in the regulated secretory pathway of mouse pancreatic beta cells. J Clin Invest. 2010;120(7):2575–89.
El Roz A, Bard JM, Huvelin JM, Nazih H. LXR agonists and ABCG1-dependent cholesterol efflux in MCF-7 breast cancer cells: relation to proliferation and apoptosis. Anticancer Res. 2012;32(7):3007–13.
Baker SC, Arlt VM, Indra R, et al. Differentiation-associated urothelial cytochrome P450 oxidoreductase predicates the xenobiotic-metabolizing activity of “luminal” muscle-invasive bladder cancers. Mol Carcinog. 2018;57(5):606–18.
Fang Y, Yang H, Hu G, et al. The POR rs10954732 polymorphism decreases susceptibility to hepatocellular carcinoma and hepsin as a prognostic biomarker correlated with immune infiltration based on proteomics. J Transl Med. 2022;20(1):88.
Part of the data for this project was provided by the WHI program, which is funded by the National Heart, Lung, and Blood Institute, the National Institutes of Health, and the U.S. Department of Health and Human Services through 75N92021D00001, 75N92021D00002, 75N92021D00003, 75N92021D00004, and 75N92021D00005. The datasets used for the analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/sites/entrez?db=gap through dbGaP accession (phs000200.v11.p3).
Program Office: National Heart, Lung, and Blood Institute, Bethesda, MD: Jacques Rossouw, Shari Ludlam, Dale Burwen, Joan McGowan, Leslie Ford, and Nancy Geller.
Clinical Coordinating Center: Fred Hutchinson Cancer Research Center, Seattle, WA: Garnet Anderson, Ross Prentice, Andrea LaCroix, and Charles Kooperberg.
Investigators and Academic Centers: JoAnn E. Manson, Brigham and Women's Hospital, Harvard Medical School, Boston, MA; Barbara V. Howard, MedStar Health Research Institute/Howard University, Washington, DC; Marcia L. Stefanick, Stanford Prevention Research Center, Stanford, CA; Rebecca Jackson, The Ohio State University, Columbus, OH; Cynthia A. Thomson, University of Arizona, Tucson/Phoenix, AZ; Jean Wactawski-Wende, University at Buffalo, Buffalo, NY; Marian Limacher, University of Florida, Gainesville/Jacksonville, FL; Robert Wallace, University of Iowa, Iowa City/Davenport, IA; Lewis Kuller, University of Pittsburgh, Pittsburgh, PA; and Sally Shumaker, Wake Forest University School of Medicine, Winston-Salem, NC.
This study was supported by the National Institute of Nursing Research of the National Institutes of Health under Award Number K01NR017852.
Ethics approval and consent to participate
Every contributing clinical site reviewed all study protocols and consent documentation and approved submission of cases to TCGA. The ethics committee of the TCGA in the National Cancer Institute has verified the institutional review board approval. All the participant data for the WHI used in this study were deidentified by the NHLBI and WHI, and consent was obtained from the participants at the source. The study was approved by the institutional review boards of each participating clinical center of the WHI and the University of California, Los Angeles.
Consent for publication
The content of this manuscript has not been previously published and is not under consideration for publication elsewhere.
All authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Jung, S.Y., Bhatti, P. & Pellegrini, M. DNA methylation in peripheral blood leukocytes for the association with glucose metabolism and invasive breast cancer. Clin Epigenet 15, 23 (2023). https://doi.org/10.1186/s13148-023-01435-7