Skip to main content

Methylome and transcriptome analyses reveal insights into the epigenetic basis for the good survival of hypomethylated ER-positive breast cancer subtype



Breast cancer (BRCA) is a heterogeneous disease, characterized by different histopathological and clinical features and responses to various therapeutic measures. Despite the research progress of DNA methylation in classification and diagnosis of BRCA and the close relationship between DNA methylation and hormone receptor status, especially estrogen receptor (ER), the epigenetic mechanisms in various BRCA subtypes and the biomarkers associated with diagnostic characteristics of patients under specific hormone receptor status remain elusive.


In this study, we collected and analyzed methylation data from 785 invasive BRCA and 98 normal breast tissue samples from The Cancer Genome Atlas (TCGA) database. Consensus classification analysis revealed that ER-positive BRCA samples were constitutive of two distinct methylation subgroups; with the hypomethylated subgroup showing good survival probability. This finding was further supported by another cohort of ER-positive BRCA containing 30 subjects. Additionally, we identified 977 hypomethylated CpG loci showing significant associations with good survival probability in ER-positive BRCA. Genes with these loci were enriched in cancer-related pathways (e.g., Wnt signaling pathway). Among them, the upregulated 47 genes were also in line with good survival probability of ER-positive BRCA, while they showed significantly negative correlations between their expression and methylation level of certain hypomethylated loci. Functional assay in numerous literatures provided further evidences supporting that some of the loci have close links with the modulation of tumor-suppressive mechanisms via regulation gene transcription (e.g., SFRP1 and WIF1).


Our study identified a hypomethylated ER-positive BRCA subtype. Notably, this subgroup presented the best survival probability compared with the hypermethylated ER-positive and hypomethylated ER-negative BRCA subtypes. Specifically, we found that certain upregulated genes (e.g., SFRP1 and WIF1) have great potential to suppress the progression of ER-positive BRCA, concurrently exist negative correlations between their expression and methylation of corresponding hypomethylated CpG loci. Therefore, our study indicates that different epigenetic mechanisms likely exist in ER-positive BRCA and provides novel clinical biomarkers specific to ER-positive BRCA diagnosis and therapy.


Breast cancer (BRCA) most frequently occurs in females, accounting for the greatest number of cancer-related deaths among women [1, 2]. Despite progress in BRCA therapy, the incidence of BRCA has continued to increase substantially [3] and remains a major public health burden worldwide [4]. BRCA is a complex heterogeneous disease [5, 6], containing about 20 morphologically distinct subtypes, such as inflammatory, pregnancy-associated, and comedo [7]. As different subtypes of BRCA exhibit distinct clinical outcomes and require subtype-specific treatment [6, 8, 9], the pathogenic mechanisms and diagnostic biomarkers under molecular-based BRCA subtypes were broadly concerned.

Recent years, transcriptional profile variations have been used to classify cancers into distinct subtypes [10], which are associated with different biological characteristics and clinical outcomes. Based on the expression status of several key genes, BRCA can be divided into at least four categories, including luminal A, luminal B, HER2-type, and basal-like [11]. For the time being, the crucial role of epigenetic mechanisms in modulating gene transcription has been of growing interest. Epigenetics involves the regulation of gene expression without changes in the genomic sequence [12]. To date, DNA methylation, a most studied epigenetic modification, is closely linked with tumorigenesis via gene expression modulation [13, 14]. In addition, evidence has revealed the difference in the molecular subtypes of certain cancer based on gene expression and DNA methylation [15]. Notably, a growing number of studies have shown that different BRCA subtypes display distinct methylation profiles [16,17,18,19]. The methylation status of certain CpG sites has been effectively used to evaluate the survival potential of BRCA [20]. Notably, hormone receptor, especially estrogen receptor (ER), is considered crucial factors leading to the different clinical outcomes of BRCA [21, 22]. Although abundant evidence has revealed the effects of hormone receptor on dynamic DNA methylation changes in the pathogenesis of BRCA [18, 23, 24], the DNA methylation signatures associated with the pathological development and diagnosis of BRCA under the context of certain hormone receptor have not yet been fully researched.

In the present study, we performed methylation-based BRCA classification using the Illumina Infinium Human Methylation450K dataset, which contained 785 invasive BRCA and 98 normal breast tissue samples from The Cancer Genome Atlas dataset (TCGA) using the UCSC Xena portal ( [25]. In accordance with previous studies [18], our results strengthened current evidence supporting the crucial effects of hormone receptor (i.e., ER) on methylation-based clusters in BRCA. Interestingly, we identified two methylation-based subgroups in ER-positive BRCA and further determined that the hypomethylated subgroup presents a good survival probability, supported by another independent dataset of ER-positive BRCA. Moreover, the hypomethylated loci in this subgroup may play roles to suppress tumor progression via promoting the transcription of putative suppressor genes in ER-positive BRCA subgroup.


Data collection and pre-processing

The methylation dataset of samples and associated clinical information were obtained TCGA database using the UCSC Xena platform ( [25]. Based on the “01” and “11” barcodes in TCGA, we collected a dataset containing 785 BRCA and 98 normal breast tissue samples detected by the Illumina Infinium HumanMethylation450 BeadChips platform, which covered 485,577 CpG loci. The methylation levels of the CpG sites in each sample were represented as a β value ranging from 0 to 1. We first filtered the CpG sites with any missing values across the samples and retained 363,870 CpG sites for subsequent analysis. In addition, the sample-matched RNA-seq data, which quantified the gene expression value by log2(x + 1) transformed RSEM (RNA-Seq by Expectation-Maximization) normalized count, was also downloaded from TCGA using same data portal. In this study, the methylation and expression values were normalized by the ‘normalize.quantiles’ function in R package “preprocessCore”. Another methylation dataset of BRCA was collected from NCBI’s Gene Expression Omnibus (GEO) portal with accession number GSE37754, in which only the 30 ER-positive tumor samples were considered. Moreover, 124 samples from 5 different types of normal tissues with methylation and RNA-Seq datasets were downloaded from TCGA (

Consensus cluster analysis

Initially, we extracted 18,870 most informative (variable) CpG sites using a standard deviation (SD) threshold larger than 0.2 in the tumor samples. To perform methylation-based consensus classification of BRCA, we utilized the R package “ConsensusClusterPlus”, which provides quantitative and visual stability evidence for estimating the number of unsupervised clusters in a dataset [26]. In the clustering processes, a k-means algorithm and 1-Pearson correlation distance metric were used. To obtain stable clusters, we performed 100 iterations with a parameter of maxK = 20. In each iteration, 80% of tumor samples were sampled.

Gene annotation and enrichment analysis

Gene annotation analysis of the CpG sites was obtained in the data descriptions of Illumina HumanMethylation450 BeadChips (GPL13534) from the National Center for Biotechnology Information (NCBI, Gene set enrichment analysis was performed using web tool Metascape [27]. Protein-protein interaction networks were constructed using STRING (v11.0) with default parameters ( [28].

Statistical analysis

The statistical analyses were performed on the R platform ( We analyzed the CpG sites with methylation differences between the two groups using Student’s t tests and performed the methylation-gene expression correlation analysis using Pearson correlation coefficients. For survival analyses in 10 years’ interval, we used the “survival” package with the cox model in R using sample data, including death or last follow-up, from TCGA. When calculating the association of the methylation status of CpG sites and survival probability, we considered the samples with mean methylation level greater than 75% quantile as hypermethylated subgroup while those with value lesser than 25% quantile as hypomethylated subgroup. We drew the heatmaps and survival curves using the “pheatmap” and “survminer” packages, respectively.


Identification of two major DNA methylation-based clusters in BRCA

We collected methylation data screened by Infinium HumanMethylation450 BeadChips (covering 485,577 CpG loci) from the TCGA database and analyzed a total of 785 BRCA and 98 normal breast tissue samples. After removing the CpG sites with missing methylation β values in the 883 samples, 363,870 CpG sites were retained. We then set a threshold of tumor methylation variation with a standard deviation (SD) > 0.2 and extracted 18,870 CpG sites for subsequent analyses.

To determine the methylation-based clusters of BRCA, we performed unsupervised hierarchical cluster analysis of the cancer samples using 1-Pearson correlation distance with the ‘ConsensusClusterPlus’ package [25]. When setting the cluster threshold to K = 6, we identified two major clusters that contained 90.0% of all cancer samples, with 415 and 292 samples in cluster 1 and cluster 2, respectively (Fig. 1a). We then utilized a stringent threshold of K = 20 and found that most samples in the two clusters remained stable, retaining 337 (81.2%, 337/415) and 259 (88.7%, 259/292) samples in cluster 1 and cluster 2, respectively (Fig. 1b). In addition, we calculated the average methylation levels of the 18,504 CpG sites in cluster 1, cluster 2, and normal samples. Compared with the normal samples, the samples in cluster 1 had the highest methylation score, whereas those in cluster 2 had an intermediate methylation score (Fig. 1c). The heatmap also revealed markedly distinct methylation profiles between clusters 1 and 2 of the BRCA samples (Fig. 1d).

Fig. 1
figure 1

DNA methylation-based clusters of BRCA samples. a Consensus clustering of BRCA samples with K = 6. b Consensus clustering of BRCA samples with K = 20. c Mean methylation levels of cluster 1, cluster 2, and normal samples. d Heatmap of BRCA methylation differences between cluster 1, cluster 2, and normal samples

A hypomethylated ER-positive BRCA subgroup with good survival probability

We next investigated the histological types (e.g., infiltrating ductal carcinoma, infiltrating lobular carcinoma) of the BRCA samples in clusters 1 and 2. Results showed that the two clusters exhibited similar histological composition ratios (Fig. 2a). In addition, we carried out a survival analysis to explore the overall survival probability of BRCA in clusters 1 and 2. No significant survival differences were found between the two clusters (p = 0.30) (Fig. 2b).

Fig. 2
figure 2

Clinical signatures of clusters in BRCA samples. a Histological types of BRCA in cluster 1 and cluster 2. b Kaplan-Meier survival curve of BRCA in cluster 1 and cluster 2. c Estrogen receptor status of BRCA in cluster 1 and cluster 2. d Kaplan-Meier survival curve of BRCA in hyper-ER-positive, hypo-ER-positive and hypo-ER-negative subgroups

As hormone receptor status, especially ER, has a close relationship with tumor progression and methylation modification [24, 29], we explored the status of ER in clusters 1 and 2 based on the information recorded in the clinical dataset from TCGA. Results showed that most (93.2%; 314/337) samples in cluster 1 were ER-positive BRCA. Interestingly, the samples in cluster 2 were more complex and included 43.2% (112/259) ER-positive BRCA and 52.1% (135/259) ER-negative BRCA (Fig. 2c). Similarly, the overall methylation level of ER-positive tumors in cluster 2 found to be lower than that of ER-positive tumors in cluster 1 (Additional file 1: Figure S1). Thus, according to the hormone receptor status and methylation profiles, we divided the tumors into three subgroups, namely hypermethylated ER-positive BRCA (hyper-ER-positive), hypomethylated ER-positive BRCA (hypo-ER-positive), and hypomethylated ER-negative BRCA (hypo-ER-negative). We then analyzed the survival probability in the three subgroups and found a significantly good survival probability for hypo-ER-positive BRCA compared with hyper-ER-positive and hypo-ER-negative BRCA (p < 0.05) (Fig. 2d).

We then investigated the distributions of PAM50 gene signature-based BRCA subtypes (e.g., luminal A and luminal B) on these 3 methylation-based subgroups [30]. Results showed that hypo/hyper-ER-positive BRCA subgroups contain a number of luminal A and luminal B tumors, while the hypo-ER-negative subgroup mainly consists of basal tumors (Additional file 1: Figure S2A). Here, we focused on the ER-positive BRCA patients and found that the hypo-ER-positive subgroup contains a small number of luminal B breast cancers compared with that in the hyper-ER-positive subgroup (9.8% vs. 27.4%), echoing the fact of worse survival for luminal B BRCA patients (Additional file 1: Figure S2B). In addition, we also observed a similar distribution pattern for the progesterone receptor (PR) status in the methylation-based BRCA subgroups (Additional file 1: Figure S2C). These findings indicated a complex relationship in subtyping the BRCA patients based on gene expression and DNA methylation, which has been observed in other tumors [15].

A similar observation in another independent population

To test whether our findings can be observed in other populations, we collected and analyzed another methylation dataset containing 30 ER-positive BRCA samples. Similarly, consensus clustering analysis still divided the cancer samples into two major subgroups (Fig. 3a), which were confirmed by the heatmap plot (Fig. 3b). Consistently, further analysis showed that the hypomethylated ER-positive BRCA subgroup (i.e., ER+, cluster 2) indeed presents a good survival probability (Fig. 3c, d).

Fig. 3
figure 3

Consensus clustering analysis of BRCA in an independent population. a Consensus clustering of ER-positive BRCA with K = 10. b Heatmap plot of methylation differences of ER-positive cancers in cluster 1 and cluster 2. c Mean methylation level of the two major BRCA subgroups. d Kaplan-Meier survival curve of the two subtyping ER-positive cancers

Abundant CpG sites with methylation differences between hypo-ER-positive and hyper-ER-positive BRCA subtypes

As the ER-positive BRCA had two methylation-based subgroups showing a significant difference in survival probability, we focused on those CpG sites that contributed to the classification of ER-positive BRCA. Using a threshold BH-adjusted p value <  0.01 and |log2 (fold-change)| > 1, we identified 3032 differentially methylated CpG sites (DMCs) between the hyper-ER-positive and hypo-ER-positive BRCA subgroups. Among these DMCs, approximately 91.4% (2772/3032) were hypomethylated in the hypo-ER-positive subgroup (Fig. 4a). Then, we also divided the ER-positive BRCA into two subgroups based on the mean methylation of the 2772 hypomethylated DMCs (hypo-DMCs; see “Materials and methods”). The survival analysis further revealed that the ER-positive cancer samples with low methylation content have a good survival probability (p = 0.0008) (Fig. 4b and (Additional file 1: Figure S3A). Whereas, no significant association was observed between the methylation content of these hypo-DMCs and the survival of hypo-ER-negative BRCA subgroup (Additional file 1: Figure S3B).

Fig. 4
figure 4

CpG sites with methylation differences between the two ER-positive BRCA subgroups. a Percentage of hypermethylated and hypomethylated sites between hypo-ER-positive and hyper-ER-positive cancer subgroups. b Survival curve in the two subgroups based on the methylation level of the 2772 hypo-DMCs. c Distribution of hypo-DMCs in various genomic sequences. d Genes with hypo-DMCs enriched in pathways related with cancer

According to the gene annotation analysis, the hypo-DMCs were located on the TSS1500 (509), TSS200 (542), 5’UTR (319), 1stExon (305), Body (550), 3’UTR (43), and intergenic regions (504) (Fig. 4c). In addition, most hypo-DMCs (94.5%; 2619/2772) were located on regions associated with CpG islands (Additional file 1: Figure S4). To explore the roles of the hypo-DMCs in tumorigenesis, we performed gene set enrichment analysis. Results showed that 1190 genes associated with the hypo-DMCs were significantly enriched in pathways, such as Wnt (hsa04310), Hippo (has04309), Rap1 (hsa04015), basal cell carcinoma (hsa05217), transcriptional misregulation in cancer (hsa05202), and breast cancer (hsa05224) (q < 0.05) (Fig. 4d). These pathways indeed play important roles in cancer progression [31,32,33], indicating the potential roles of these hypo-DMCs in inhibiting ER-positive BRCA progression.

Tumor suppression by key hypo-DMCs in ER-positive BRCA subgroup

To further confirm the potential tumor-suppressing roles of the hypo-DMCs, we reanalyzed the association between methylation level and survival probability. Among them, 979 CpG loci have significant associations between methylation level and survival in ER-positive BRCA (p < 0.05). As expectedly, 977 loci present the relationship between hypomethylation and good survival probability, with the hazard ratio (HR) lesser than 0.48 (Additional file 1: Figure S5A). Similarly, genes with these loci were still enriched in cancer-related pathways, such as Wnt, Rap1, microRNA in cancer and transcriptional misregulation in cancer (q < 0.05) (Additional file 1: Figure S5B). We then performed multivariate cox model analysis to correct the effects of covariates (i.e., hormone receptor, histological type, pathological stage, and PAM50 gene signatures based subtype) and found that 68.3% (667/977) of the loci still display significant associations between their methylation level and survival of BRCA patients.

To test whether the hypo-DMCs have the potential to promote transcription of genes with tumor-suppressing roles, we performed methylation and expression correlation analysis and identified a total of 178 genes that displaying negative associations between their expression and methylation of certain hypo-DMCs (Fig. 5a). Further protein-protein interaction analysis showed that the proteins have significantly more interactions than expected (Fig. 5b). Similarly, the proteins in the network were also enriched in pathways including NF-kappa B, Wnt, microRNAs in cancer, focal adhesion and transcriptional misregulation in cancer (FDR < 0.05) (Fig. 5b).

Fig. 5
figure 5

Hypo-DMCs showing significant associations between hypomethylation and good survival probability in ER-positive BRCA. a Heatmap of upregulated genes with negative associations between their expression and methylation of the hypo-DMCs. b Protein-protein interaction network of genes with the hypo-DMCs

We then analyzed the correlation between the expression level of genes and survival probability of ER-positive BRCA and identified 47 (47/178) genes with high expression and good survival probability (Fig. 6a and (Additional file 1: Table S1). Consistently, these 47 genes were enriched in pathways including Wnt signaling pathway, NF-kappa B signaling pathway, microRNA in cancer and transcriptional misregulation in cancer (p < 0.05) (Fig. 6b).

Fig. 6
figure 6

Epigenetic activation of putative tumor suppressor genes in improving survival of ER-positive BRCA. a Forest plot for assesses of the 47 genes on survival in ER-positive BRCA (bars represent the 95% CI). b KEGG pathways enriched by the 47 genes. c Methylation differences in cg24319902 between hypo-ER-positive and hyper-ER-positive BRCA subgroups (p < 0.05). d Association between methylation level of cg24319902 and expression level of SFRP1. e Survival plot of SFRP1 expression with survival in ER-positive cancer. f Methylation differences in cg21383810 between hypo-ER-positive and hyper-ER-positive cancer (p < 0.05). g Association between methylation level of cg21383810 and expression level of WIF1. h Survival plot of WIF1 expression with survival in ER-positive cancer


BRCA is classified into distinct subtypes by various molecular features, with consequently different treatment strategies [34, 35]. Abundant research has revealed that hormone receptor (i.e., ER) status is one of the most important features of BRCA [36, 37]. Epidemiological research has shown that about 70% of BRCA patients are ER-positive [38]. Nevertheless, ER-positive BRCA still constitutes a more heterogeneous group for diagnosis and treatment [39, 40]. For instance, according to the HER2 status and KI67 protein level, ER-positive BRCA also consists of two major subtypes, i.e., luminal A and luminal B, with luminal B tumors presenting a worse prognosis than luminal A subtype [35, 40]. Currently, accumulating evidence has shown that epigenetic modifications, in especial DNA methylation, play important roles in BRCA progression and present close associations with the BRCA heterogeneity [41, 42]. Therefore, the therapeutic targets and biomarkers on epigenetic layers specific to molecular-based BRCA subclasses (e.g., ER-positive) remain to be investigated.

In the present study, we identified a hypomethylated ER-positive BRCA subgroup, with a considerable number of samples displaying similar methylation levels as ER-negative BRCA. More remarkably, this subgroup presented the best survival probability compared with the hyper-ER-positive and hypo-ER-negative BRCA. This finding is in contrast to the general view that ER-positive BRCA always exhibits higher methylation and better survival than that of ER-negative BRCA [18, 29].

Additionally, we identified hundreds of hypo-DMCs that showing significant associations between hypomethylation and good survival in the hypo-ER-positive BRCA. Indeed, further analysis revealed that the genes with these sites preferentially locate on cancer-related pathways, such as the Wnt and Rap1 signaling pathways [31, 32]. These findings imply that the hypo-DMCs play crucial roles in suppressing tumor progression via regulation of gene activity in pathways associated with ER-positive BRCA progression. Moreover, our observations were supported by the survival analysis of certain DMCs and corresponding genes in the ER-positive BRCA. By scanning the literatures associated function of corresponding genes, some cases indeed supported the roles of hypo-DMCs in inhibition of tumor promotion by regulating gene transcription. For example, cg24319902 is a CpG site located in the TSS200 region of SFRP1, loss expression of which is reported to be associated with the progression of BRCA [43]. The methylation level of this CpG site decreased in hypo-ER-positive BRCA subgroup compared with that in hyper-ER-positive BRCA subgroup (Fig. 6c). Due to the negative correlation between its methylation and the expression level of SFRP1 (Fig. 6d and (Additional file 1: Figure S6A), the hypomethylation of cg21790626 likely reduced the severity of BRCA in hypo-ER-positive subgroup. Indeed, the higher expression level of SFRP1 was associated with a good survival of ER-positive cancers in the TCGA dataset (Fig. 6e). In addition, cg21383810 was also hypomethylated in hypo-ER-positive BRCA subgroup (Fig. 6f). This CpG site, located in the TSS1500 region of WIF1, was found to be negatively associated with the expression of WIF1 (Fig. 6g and (Additional file 1: Figure S6B). Given that the contribution of epigenetic silencing WIF1 to breast tumorigenesis as reported in the previous study [44], it is likely that the loss of methylation in the hypo-ER-positive cancer samples will suppress disease acceleration, which was also supported by the association between its expression and patients’ chances of survival (Fig. 6h). Taken together, our study indicates that the hypomethylation of ER-positive BRCA in some CpG loci has contributions to the modulation of certain tumor suppression mechanisms. This research provides potential epigenetic diagnostic biomarkers and therapeutic targets specific to ER-positive BRCA. And yet for that, further clinical and experimental studies will facilitate the extending of these findings.





Breast cancer


Confidence interval


Differentially methylated CpG site


Estrogen receptor


Hazard ratio


RNA-Seq by Expectation-Maximization


Standard deviation


Secreted frizzled-related protein 1


The Cancer Genome Atlas


WNT inhibitory factor 1


  1. Hortobagyi GN, de la Garza SJ, Pritchard K, Amadori D, Haidinger R, Hudis CA, et al. The global breast Cancer burden: variations in epidemiology and survival. Clin Breast Cancer. 2005;6:391–401.

    Article  PubMed  Google Scholar 

  2. Bray F, McCarron P, Parkin DM. The changing global patterns of female breast cancer incidence and mortality. Breast Cancer Res. 2004;6:229–39.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Smith BD, Smith GL, Hurria A, Hortobagyi GN, Buchholz TA. Future of Cancer incidence in the United States: burdens upon an aging, Changing Nation. J Clin Oncol. 2009;27:2758–65.

    Article  PubMed  Google Scholar 

  4. Siegel RL, Miller KD, Jemal A. Cancer statistics. CA Cancer J Clin. 2018;68:7–30.

    Article  Google Scholar 

  5. Polyak K. Heterogeneity in breast cancer. J Clin Invest. 2011;121:3786–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Viale G. The current state of breast cancer classification. Ann Oncol. 2012;23:x207–10.

    Article  PubMed  Google Scholar 

  7. Hsiao Y-H, Chou M-C, Fowler C, Mason JT, Man Y. Breast cancer heterogeneity: mechanisms, proofs, and implications. J Cancer. 2010;1:6–13.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Kravchenko J, Akushevich I, Seewaldt VL, Abernethy AP, Lyerly HK. Breast cancer as heterogeneous disease: contributing factors and carcinogenesis mechanisms. Breast Cancer Res Treat. 2011;128:483–93.

    Article  CAS  PubMed  Google Scholar 

  9. Rakha EA, Reis-Filho JS, Baehner F, Dabbs DJ, Decker T, Eusebi V, et al. Breast cancer prognostic classification in the molecular era: the role of histological grade. Breast Cancer Res. 2010;12:207.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Perou CM, Sørlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, et al. Molecular portraits of human breast tumours. Nature. 2000;406:747–52.

    Article  CAS  PubMed  Google Scholar 

  11. Tamimi RM, Colditz GA, Hazra A, Baer HJ, Hankinson SE, Rosner B, et al. Traditional breast cancer risk factors in relation to molecular subtypes of breast cancer. Breast Cancer Res Treat. 2012;131:159–67.

    Article  CAS  PubMed  Google Scholar 

  12. Jones PA. The role of DNA methylation in mammalian epigenetics. Science. 2001;293:1068–70.

    Article  CAS  PubMed  Google Scholar 

  13. Jones PA, Baylin SB. The fundamental role of epigenetic events in cancer. Nat Rev Genet. 2002;3:415–28.

    Article  CAS  PubMed  Google Scholar 

  14. Tsai H-C, Baylin SB. Cancer epigenetics: linking basic biology to clinical medicine. Cell Res. 2011;21:502–17.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Ren Z, Wang W, Li J. Identifying molecular subtypes in human colon cancer using gene expression and DNA methylation microarray data. Int J Oncol. 2015;48:690–702.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  16. Feng W, Shen L, Wen S, Rosen DG, Jelinek J, Hu X, et al. Correlation between CpG methylation profiles and hormone receptor status in breast cancers. Breast Cancer Res. 2007;9:R57.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  17. Bediaga NG, Acha-Sagredo A, Guerra I, Viguri A, Albaina C, Ruiz Diaz I, et al. DNA methylation epigenotypes in breast cancer molecular subtypes. Breast Cancer Res. 2010;12:R77.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  18. Fackler MJ, Umbricht CB, Williams D, Argani P, Cruz L-A, Merino VF, et al. Genome-wide methylation analysis identifies genes specific to breast Cancer hormone receptor status and risk of recurrence. Cancer Res. 2011;71:6195–207.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Bardowell SA, Parker J, Fan C, Crandell J, Perou CM, Swift-Scanlan T. Differential methylation relative to breast cancer subtype and matched normal tissue reveals distinct patterns. Breast Cancer Res Treat. 2013;142:365–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Conway K, Edmiston SN, May R, Kuan PF, Chu H, Bryant C, et al. DNA methylation profiling in the Carolina breast cancer study defines cancer subclasses differing in clinicopathologic characteristics and survival. Breast Cancer Res. 2014;16:450.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Mcguirem WL, Pearson H, Secalof A. Current status of estrogen and progesterone receptors in breast cancer. Cancer. 1977;39:2934–47.

    Article  Google Scholar 

  22. Britton JA, Gammon MD, Schoenberg JB, Stanford JL, Coates RJ, Swanson CA, et al. Risk of breast cancer classified by joint estrogen receptor and progesterone receptor status among women 20-44 years of age. Am J Epidemiol. 2002;156:507–16.

    Article  PubMed  Google Scholar 

  23. Benevolenskaya EV, ABMMK I, Ahsan H, Kibriya MG, Jasmine F, Wolff B, et al. DNA methylation and hormone receptor status in breast cancer. Clin Epigenetics. 2016;8:7.

    Article  CAS  Google Scholar 

  24. Widschwendter M, Siegmund KD, Müller HM, Fiegl H, Marth C, Müller-Holzner E, et al. Association of breast cancer DNA methylation profiles with hormone receptor status and response to tamoxifen. Cancer Res. 2004;64:3807–13.

    Article  CAS  PubMed  Google Scholar 

  25. Goldman M, Craft B, Brooks A, Zhu J, Haussler D. The UCSC Xena Platform for cancer genomics data visualization and interpretation. bioRxiv. 2018; 326470.

  26. Wilkerson MD, Hayes DN. ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics. 2010;26:1572–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10:1523.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  28. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47:D607–13.

    Article  CAS  PubMed  Google Scholar 

  29. Han G. ER, PR and HER2 testing in breast cancer. Diagn Histopathol. 2014;20:440–5.

    Article  Google Scholar 

  30. Liu MC, Pitcher BN, Mardis ER, Davies SR, Friedman PN, Snider JE, et al. PAM50 gene signatures and breast cancer prognosis with adjuvant anthracycline- and taxane-based chemotherapy: correlative analysis of C9741 (Alliance). Breast Cancer. 2016;2:15023.

    CAS  PubMed  Google Scholar 

  31. Lindvall C, Bu W, Williams BO, Li Y. Wnt signaling, stem cells, and the cellular origin of breast Cancer. Stem Cell Rev. 2007;3:157–68.

    Article  CAS  PubMed  Google Scholar 

  32. Howe LR, Brown AMC. Wnt signaling and breast cancer. Cancer Biol Ther. 2004;3:36–41.

    Article  CAS  PubMed  Google Scholar 

  33. Mo J-S, Park HW, Guan K-L. The hippo signaling pathway in stem cell biology and cancer. EMBO Rep. 2014;15:642–56.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Haque R, Ahmed SA, Inzhakova G, Shi J, Avila C, Polikoff J, et al. Impact of breast cancer subtypes and treatment on survival: an analysis spanning two decades. Cancer Epidem Biomar. 2012;21:1848–55.

    Article  Google Scholar 

  35. Dai X, Li T, Bai Z, Yang Y, Liu X, Zhan J, et al. Breast cancer intrinsic subtype classification, clinical use and future trends. Am J Cancer Res. 2015;5:2929–43.

    CAS  PubMed  PubMed Central  Google Scholar 

  36. Dunnwald LK, Rossing MA, Li CI. Hormone receptor status, tumor characteristics, and prognosis: a prospective cohort of breast cancer patients. Breast Cancer Res. 2007;9:R6.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  37. Mason BH, Holdaway IM, Mullins PR, Yee LH, Kay RG. Progesterone and estrogen receptors as prognostic variables in breast cancer. Cancer Res. 1983;43:2985–90.

    CAS  PubMed  Google Scholar 

  38. Kumar MM, Ponvijay K, Nandhini R, Nagarajan R, Jose J, Srinivas G, et al. A mouse model for luminal epithelial like ER positive subtype of human breast cancer. BMC Cancer. 2007;7:180.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  39. Yersal O, Barutca S. Biological subtypes of breast cancer: prognostic and therapeutic implications. World J Clin Oncol. 2014;5:412–24.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Hashmi AA, Aijaz S, Khan SM, Mahboob R, Irfan M, Zafar NI, et al. Prognostic parameters of luminal a and luminal B intrinsic breast cancer subtypes of Pakistani patients. World J Surg Oncol. 2018;16:1.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Basse C, Arock M. The increasing roles of epigenetics in breast cancer: implications for pathogenicity, biomarkers, prevention and treatment. I J Cancer. 2015;137:2785–94.

    CAS  Google Scholar 

  42. Szyf M. DNA methylation signatures for breast cancer classification and prognosis. Genome Med. 2012;4:26.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Klopocki E, Kristiansen G, Wild P, Klaman I, Castanosvelez E, Singer G, Stohr R, Simon R, Sauter G, Leibiger H. Loss of SFRP1 is associated with breast cancer progression and poor prognosis in early stage tumors. Int J Oncol. 2004;25:641–9.

    CAS  PubMed  Google Scholar 

  44. Ai L, Tao Q, Zhong S, Fields CR, Kim W-J, Lee MW, et al. Inactivation of Wnt inhibitory factor-1 (WIF1) expression by epigenetic silencing is a common event in breast cancer. Carcinogenesis. 2006;27:1341–8.

    Article  CAS  PubMed  Google Scholar 

Download references


We would like to thank Dr. Christine Watts for help in editing the manuscript.


This study was supported by the National Natural Science Foundation of China (31672317), National Key Research and Development Program of China (2016YFD0200900), and Competitive Planning Projects of Hubei Academy of Agricultural Sciences (2016jzxjh012).

Author information

Authors and Affiliations



YP and FHX designed and supervised the project. XQC performed data analyses and wrote the paper. FZ, QCS, and CZ revised the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Fu-Hui Xiao or Yu Peng.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee at Hubei University.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing financial interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Figure S1.

Mean methylation level of BRCA samples in cluster 1: ER-positive, cluster 2: ER-positive, and cluster 2: ER-negative. Figure S2. Relationship of the PAM50 gene signatures and methylation-based BRCA subgroups. Figure S3. Survival analysis based on the 2722 hypo-DMCs. Figure S4. Distribution of sites in regions related to CpG islands. Figure S5. The 977 loci with relationships between hypomethylation and good survival probability of ER-positive breast cancer. Figure S6. Methylation-gene expression associations for the cg24319902-SFRP1 and cg21383810-WIF1 pairs in normal tissues. Table S1. Information of hypo-DMC activating genes that presenting a significant association between high expression and good survival probability in ER-positive breast cancer.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, XQ., Zhang, F., Su, QC. et al. Methylome and transcriptome analyses reveal insights into the epigenetic basis for the good survival of hypomethylated ER-positive breast cancer subtype. Clin Epigenet 12, 16 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: