- Open Access
Genome-wide DNA methylation profiles of low- and high-grade adenoma reveals potential biomarkers for early detection of colorectal carcinoma
Clinical Epigenetics volume 12, Article number: 56 (2020)
Abnormal DNA methylation is a hallmark of human cancers and may be a promising biomarker for early diagnosis of human cancers. However, the majority of DNA methylation biomarkers that have been identified are based on the hypothesis that early differential methylation regions (DMRs) are maintained throughout carcinogenesis and could be detected at all stages of cancer.
In this study, we identified potential early biomarkers of colorectal cancer (CRC) development by genome-wide DNA methylation assay (Illumina infinium450, 450 K) of normal (N = 20) and pre-colorectal cancer samples including 18 low-grade adenoma (LGA) and 22 high-grade adenoma (HGA), integrated with GEO and ArrayExpress datasets (N = 833).
We identified 209 and 8692 CpG sites that were significantly hyper-methylated in LGA and HGA, respectively. Pathway analysis identified nervous system-related methylation changes that are significantly associated with early adenoma development. Integration analysis revealed that DNA methylation in the promoter region of ADHFE1 has the most potential for being an early diagnostic biomarker for colorectal adenoma and cancer (sensitivity = 0.96, specificity = 0.95, area under the curve = 0.97).
Overall, we demonstrated that DNA methylation have been shown significant changes in the stage of LGA and HGA in the development of colon cancer. Genome-wide DNA methylation to LGA and HGA provided an important proxy to identify promising early diagnosis biomarkers for colorectal cancer.
Colorectal cancer (CRC) is the third leading cause of cancer-related deaths worldwide [1, 2]. Current evidence indicates that genetic mutations and epigenetic alterations progressively accumulate in the tumor genome during carcinogenesis, and these alterations may serve as primary biomarkers for early detection and treatment of cancer . Abnormal alterations in methylation status specifically hyper-methylation or hypo-methylation have also been associated with abnormal tissue differentiation. Altered methylation has been observed in the promoter regions of tumor suppressor genes and miRNA have been observed in almost all cancer types [4, 5]. Over the past decades, detection of altered DNA methylation has been widely studied to develop cancer biomarkers  and the majority that have been identified are based on the hypothesis that early differential methylation regions (DMRs) are maintained throughout carcinogenesis and could be detected at all stages of cancer. For example, altered methylation patterns have been detected with hepatic disease progression in the context of hepatitis, cirrhosis, and hepatocellular carcinoma (HCC) . Moreover, recent evidence demonstrated that cell-free DNA (cfDNA) methylation can be used for early cancer diagnosis and tissue-of-origin mapping for metastatic cancer .
Abnormal alterations of DNA methylation have been recognized as an important event in cancer development . Global hypo-methylation arises early in carcinogenesis and has been linked to chromosomal instability and loss of imprinting [9, 10]. Generally, during cancer development, hundreds of genes are silenced or activated [11,12,13]. Although silencing of some genes in cancers occurs by mutation, a large proportion of carcinogenic gene silencing is a result of altered DNA methylation . DNA methylation-based silencing in cancer typically occurs at multiple CpG sites in the CpG islands present in the promoters of protein-coding genes . Although extensive epigenetic alterations have been defined over the past years, CRC is still not well understood at the molecular level . Against a background of whole-genome hypo-methylation, gene-specific promoter hyper-methylation has been found to promote CRC by downregulating the expression of key tumor suppressor genes such as CDKN2A, MLH1, and CDH1 [16,17,18]. CRC is a heterogeneous disease that typically originates from a pre-cancerous lesion, often in the form of an adenoma, eventually progressing to a malignant cancer within a temporal window that may exceed 10 years . Because CRC exceeds many other cancers in both incidence and mortality, capacity to detect and monitor molecular changes during the colorectal adenoma (AD) stage provides an excellent opportunity to prevent cancer progression and improve survival outcomes . While a large number of studies have focused on CRC, a subset of them has focused specifically on the adenoma as an intermediate stage which required more specific molecular definition. For instance, a ten-gene methylation signature in adenoma was found to change with pathologic progress . Notably, colorectal adenoma has two pathologic stages: low-grade adenoma (LGA) and high-grade adenoma (HGA) . Our research compared and defined differences of genome-wide profiling of DNA methylation, especially changes across these two pre-cancerous stages that had not been reported . We hypothesized that these alterations in LGA methylation represented candidates as potential early diagnostic biomarkers. We further posit that comprehensive understanding of the genome-wide DNA methylation profile for early stage pre-cancerous lesions (LGA and HGA) will provide important resources, early diagnosis, and candidate biomarkers for potential oncogenic progression.
In this study, we conducted a series of genome-wide DNA methylation array of 18 LGA and 22 HGA and compared the frequency, location, and pattern of methylation status of 20 normal tissue samples. Dynamic DNA methylation changes were identified for LGA and HGA, and we found that methylation changes that appeared in LGA were increased or maintained in HGA and cancer. Enrichment analyses to DMRs were performed to further investigate the potential influence of DNA methylation on functional difference in adenoma initiation and development. Moreover, we separated different methylation sites (DMSs) between LGA and normal into hyper-DMS and hypo-DMS and evaluated their respective performance for CA and CRC prediction. To validate our findings, we compared them to genome-wide DNA methylation profiles of 833 samples from public database. Finally, we describe the identification and analysis of one functional methylation signature at the promotor region of ADHFE1 as a potential biomarker for early CRC development.
Landscape of DNA methylation of pre-cancerous lesions
We profiled DNA methylation at the single-base level for 18 LGA, 22 HGA, and 20 normal tissues. We found significant genome-wide DNA methylation differences among normal-, low-, and high-grade adenoma (Fig. 1a, b). Compared to normal tissue, LGA had genome-wide hypo-methylation (P = 5.2 × 10−5, rank sum test) which was even lower in HGA (P = 3.7 × 10−6, compared with normal, rank sum test, Fig. 1c). Methylation levels of all target sites in the array demonstrated the known bimodal distribution in normal, LGA, and HGA (Fig. 1d), and the amount of fully methylated sites of lesions decreased with increasing degree of malignancy (right peak, Fig. 1d, e). Almost all DMSs in LGA compared to normal tissues kept at least an equivalent methylation level if not higher than in HGA and cancer (Additional file 1: Fig. S1). The 209 significantly hyper-methylated sites in LGA were further hyper-methylated in 22 HGA and 504 cancer samples collected from public databases (Fig. 1f and Additional file 1: Fig. S2, Table S1), and hypo-DMSs had a diametric tendency (Additional file 1: Fig. S3) suggesting that DNA demethylation may occur very early in pre-cancerous lesions. Over 60% of DMRs that were observed in both LGA (71.4%, 314/440) and HGA (61.9%, 4,213/6,805) were hypo-methylated compared to normal tissues (Fig. 1g, Additional file 1: Table S2 and S3). However, with LGA as the reference, most DMRs observed in HGA were hyper-methylated (76.0%, 660/868) (Fig. 1g, Additional file 1: Table S4). In addition, there were limited overlaps between genes with DMRs in LGA compared to normal tissues and those compared to HGA, suggesting different epigenetic process (Fig. 1h) .
Nervous system processes were associated with adenoma development
Enrichment analysis of 603 DMRs which were located between HGA and LGA, and most highly enriched functional terms, included the nervous system and those associated with signal transduction (Fig. 2a), specifically dopaminergic synapse and serotonergic synapse pathways, which play a role in the gut–brain axis model of signaling cross-talk between organ systems . These results correspond to gene methylation findings in Fig. 1g where HGA vs normal includes almost all genes that are listed in LGA vs normal and HGA vs LGA DMRs. To figure out the potential function changes from LGA to HGA, Gene Ontology (GO) enrichment was performed for 275 genes that were significantly different in methylation status between LGA vs normal and HGA vs normal without considering the differences in methylation status between HGA vs LGA. Five hundred seventy-one significantly different methylated genes were highlighted in HGA vs LGA and HGA vs normal without LGA vs normal (Fig. 2b). For the 275 genes with significantly different methylation patterns in only the LGA vs normal and HGA vs normal comparisons, GO analysis selected the top enriched terms of proteolysis as well as extracellular matrix disassembly, inorganic anion transport, and cobalamin metabolic processes. Cell adhesion, positive regulation of positive chemotaxis, and neuropeptide signaling pathway were term hits on the overlapping part between LGA vs normal and HGA vs LGA. Genes that were significantly different in methylation status between LGA and HGA were enriched for chemical synaptic transmission, transmission of nerve impulse, calcium ion transmembrane transport, and similar neural processing terms. Like the DMR enrichment analysis, terms related to the nervous system were selected yet exhibited different term patterns between HGA vs LGA compared to LGA vs normal.
Hyper-methylated CpG sites exhibited better discrimination between normal, pre-cancerous, and cancerous tissues than the hypo-methylated pattern for CRC
To distinguish the discriminatory ability of DNA methylation patterns for normal tissue, CA, and CRC, we collected 833 genome-wide DNA methylation datasets from GEO and ArrayExpress, public datasets which included 278 normal tissue samples, 51 adenoma samples, and 504 cancer samples. We separated DMSs of LGA vs normal into two groups including hyper-DMSs (209 sites) and hypo-DMSs (441 sites). We found both hyper-DMSs and hypo-DMSs could effectively distinguish methylation pattern differences between disease (adenoma and cancer) and normal samples (Fig. 3a, b). Meanwhile, we also conducted two machine learning-based predictions with the DMSs identified in our dataset and observed that hyper-methylated sites can better distinguish between normal samples and disease samples via random forest and neural network methods (Table 1). For hyper-methylated sites, the area under the curves (AUCs) of receiver operating characteristic (ROC) curves were 0.91 and 0.85, respectively. For hypo-methylated sites, AUCs of ROC curves were lower at 0.72 and 0.76, respectively (Fig. 3c, d). Unsupervised t-SNE cluster analysis produced the same result (Fig. 3e, f). To avoid inconsistent results caused by unstable methylation based on single CpG sites, we compared the mean beta value (mBV) of these sites. We found that hyper-methylated mBVs were significantly different between normal tissue and CRC (P < 2.2 × 10−16); however, there was no significant difference between the adenoma and cancer (P = 0.29, Fig. 3g) in which the average mBV of the normal tissue, adenoma, and cancer are 0.22, 0.54, and 0.57, respectively. We observed similar results for hypo-methylation sites in which the average mBV of the normal tissue, adenoma, and cancer were 0.70, 0.44, and 0.50, respectively (Fig. 3g). Finally, we found the AUCs of ROC curves with hyper-mBV and hypo-mBV were 0.98 and 0.95, respectively. Permutation analysis based on a bootstrap strategy indicated that the model based on hyper-methylated sites had better discriminatory power than the model of hypo-methylated sites (P < 2.2 × 10−8, Fig. 3h).
The promoter of ADHFE1 may be a potential biomarker for colorectal adenoma and cancer
Next, we grouped the DMRs of normal tissue and LGA into hyper- and hypo-DMRs and performed enrichment analysis by Ingenuity Pathway Analysis (IPA). The top enriched functional term for hyper-DMRs was ethanol degradation II (P = 5.4 × 10−3) which was mostly contributed to methylation sites on two genes, ADHFE1 and ACSS3, which can facilitate the conversion from ethanol to acetaldehyde and from acetic acid to acetyl-CoA, respectively (Fig. 4a). The expression of both genes were downregulated in colonic and rectal cancer tissue compared with normal tissue (P < 0.01), a result consistent with the DNA methylation changes between LGA and HGA (R2 = − 0.49 and − 0.59, Fig. 4b, c). We found that the average methylation level of CpG sites located in CpG islands within the promoter regions of ADHFE1 and ACSS3 were significantly increased in cancer samples compared to normal samples (∆mBVs = 0.2 and 0.18, respectively). We further analyzed the promoter region within the CpG island of the two genes to distinguish between normal and disease tissues. When setting the cutoff at 0.25 for the ADHFE1 promoter, the minimal error rate was only 4.68% (39/833, Fig. 4d); the heatmap of sites within the region reflected the same result (Fig. 4e). ROC curve analysis of mBV of the ADHFE1 promoter for all 833 samples produced an AUC of 0.97 with specificity and sensitivity at 0.95 and 0.96 (Fig. 4f). For cancer samples, an AUC as high as 0.98 was determined (Additional file 1: Fig. S4). For ACSS3, the minimal error rate of its promoter was 16.68% (139/833) with a cutoff set at 0.42 (Fig. 4g) which performed inferiorly to ADHFE1 in terms of discrimination power. Meanwhile, we also compared ADHFE1 with SEPT9, an FDA-approved methylation-based biomarker for CRC screening. The correlation of the two genes was 0.77, and we determined that ADHFE1 had a better prediction power than SEPT9 (Fig. 5a and Additional file 1: Fig. S5) . Furthermore, we observed ADHFE1 to have a much better separation boundary compared to SEPT9 (Fig. 5b). In view of most detected cfDNA being actually the fragments from white blood cells, we checked DNA methylation status of ADHFE1 promoter in 656 whole blood cases from public data. As expected, all sites in the promoter were consistently at low methylation level (Additional file 1: Fig. S6).
Whole-genome DNA hypo-methylation and hyper-methylation analysis of the promoter regions of cancer-related genes is regarded as a common method of characterizing diverse cancers . In our study, we found that whole-genome DNA hypo-methylation may start at the LGA stage and lead to further hypo-methylation at HGA and CRC (Fig. 1c). As many previous studies have reported, a bimodal distribution can characterize DNA methylation pattern, and we noted that the hyper-methylated peak can clearly reflect progressive hypo-methylation (Fig. 1d, e) . We identified 440 and 6805 DMRs in low- and hyper-grade adenoma, respectively, and of these DMRs, 314 (71.4%) in LGA and 4213 (61.9%) in HGA were hypo-methylated compared to normal tissues. On the contrary, most DMR (660/868, 76.0%) differences between HGA and LGA were hyper-methylated. Aside from a little overlap between HGA genes, significantly distinct DMRs were located between LGA vs normal and HGA vs LGA which indicates that LGA vs normal and HGA vs LGA are possibly not the same process with a degree difference but two different epigenetic processes. These genome-wide demethylation patterns may indicate that though hypo-methylation dominates the carcinogenesis of CRC, hyper-methylation sites may contribute more to the distinct malignancy of these lesions.
To find functional differences between differing methylation patterns in normal, pre-cancerous, and cancerous tissues, enrichment analysis was applied to 603 genes with DMRs between HGA and LGA which determined that most enriched terms were related to nervous system and signal transduction (Fig. 2a). The term gut–brain-axis describes an integrative physiology concept that incorporates all, including afferent and efferent neural, endocrine, nutrient, and immunological signals, cross-talk between the central nervous system, and the gastrointestinal system that may be dysregulated during carcinogenesis . Our Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis further highlighted the significance of dopaminergic synapse and serotonergic synapse to CRC development. Serotonin (5-hydroxytryptamine; 5-HT) is popularized as a contributor to feelings of well-being and happiness though its actual biological function is complex and multifaceted with roles in modulating cognition, reward, learning, memory, and numerous physiological processes . Brain 5-HT gets much more respect, and certainly more press and research, than the vastly larger store of 5-HT in the gut though both are important for physiological functions . Dopamine (3,4-dihydroxyphenethylamine; DA) is an organic chemical of the catecholamine and phenethylamine families that functions as both a hormone and a neurotransmitter and plays several important roles in the brain and body . In the brain, dopamine functions as a neurotransmitter to send signals to other nerve cells . Outside the central nervous system, dopamine functions primarily as a local paracrine messenger to reduce gastrointestinal motility and protect the intestinal mucosa . The interaction of tumor and the nervous system has also been found in gastric cancer and liver cancer [33, 34]. Our study suggests that the gut–brain axis and related molecules may be important contributors to the development and progression of CRC even at the adenoma stage.
DNA methylation has always been considered as a potential biomarker for many diseases due to its tissue specificity and stability . Here, we analyzed DNA methylation patterns as a mechanism to distinguish disease samples (including adenoma and cancer) from normal samples during CRC development. We identified 209 hyper-methylated sites and 441 hypo-methylated sites between LGA and normal samples and noted that both hyper- and hypo-methylated sites could effectively distinguish between normal and CRC tissues. Further validation with random forest and neural network analyses confirmed our observations. Specifically, AUCs of ROC curves for our prediction model using hyper-methylated sites were larger than those using hypo-methylated sites, despite the observation that hypo-methylated sites were more than twice the number of hyper-methylated ones. Since tumors are known to have whole-genome hypo-methylation, we speculate that gene hyper-methylation at several key sites and/or global hypo-methylation during early CA may be the driver events for CRC. To reduce bias caused by unstable methylation on single CpG sites, we compared mBV of these sites among tissue groups. We found that hyper-methylated mBVs were significantly different between normal tissue and cancers (P < 2.2 × 10−16), while no significance was found between the adenoma and CRC (P = 0.288, Fig. 3g). Permutation analysis based on bootstrap strategy suggest that the model based on hyper-methylated sites has better discrimination power than the model of hypo-methylated sites (P < 2.2 × 10−8, Fig. 3h) which may lend support to the theory that hyper-methylation at several key sites may trigger widespread hypo-methylation throughout the genome during cancer development.
Colorectal adenoma is considered the middle stage between normal status and cancer; therefore, our study focused on identifying and comparing the differences in DNA methylation patterns among normal, pre-cancerous, and cancerous colorectal tissues. IPA enrichment analysis of hyper-DMRs identified in very early stage cancers selected Ethanol degradation II as the top term for functional impact, in which ADHFE1 and ACSS3 were hit. Intense early changes in DNA methylation patterns at the promotor region of these genes support their potential use as adenoma biomarker. It is known that ADHFE1 encodes for hydroxyacid-oxoacid transhydrogenase which is responsible for the oxidation of 4-hydroxybutyrate in mammalian tissues . Some studies have also reported that the gene is associated with cell proliferation and differentiation [36,37,38]. In CRC tissue, ADHFE1 is hyper-methylated in the promoter region corresponding to downregulation of expression that may facilitate tumor growth . Our results suggest that the DNA methylation of the ADHFE1 promoter is a potential biomarker for distinguishing colorectal adenoma and cancer from normal tissue.
As the only FDA-approved liquid biopsy marker for DNA methylation, SEPT9 has been applied for colon cancers screening . Actually the detection signal of SEPT9 has been shown to be more distinguishable in tissues than at cfDNA samples . The better performance of ADHFE1 than SEPT9 at tissue level made it a promising liquid biopsy biomarker for CRC. Further efforts with a larger, more diverse sample population are needed to validate the predictive efficacy of this biomarker at cfDNA.
In addition, a recent study found a promising biomarker cg10673833 which distinguished tumor patients from healthy people by cfDNA . However, the methylation level of this marker showed only a slight upward trend from normal tissues to adenoma and cancer, in our samples as well as in public data. In view of the very low methylation of cg10673833 in the blood, most likely its detection of cancer was mainly due to largely increased metabolism of the tumor tissue that caused increased shedding of ctDNA. Comparing with cg10673833, the better discrimination of normal to adenoma and cancer by ADHFE1 raises a great potential for this candidate as a methylation marker to indicate pathological changes.
Besides ADHFE1, we obtained a group of 209 hyper-methylated DMSs in our LGA samples. For their potential being candidates of methylation markers, we examined these sites in 656 cases of whole blood from GEO. As shown in the heatmap of Additional file 1: Fig. S7, 207 out of 209 sites showed their low methylation level as < 0.3 in average, implying the potential of these sites deserving further investigation for early diagnosis.
Adenoma samples are perfect proxy for colorectal carcinoma early biomarker identification. Our study focused on adenoma, in order to get the earliest clue to detect colorectal disease. DNA methylation is a promising biomarker for cancer diagnosis and surveillance for its tissue specificity and robustness. We established the DNA methylation landscape of LGA and HGA and noted the hyper-methylated peak has a regular decrease companied with disease procession. Furthermore, we found the development of adenoma is associated with functions of nervous system, while the initiation of the adenoma is more associated with cell biological functions. Another relatively independent work was based on the precious finding in LGA, in which we found ADHFE1 is a potential early diagnosis biomarker of colorectal carcinoma and adenoma. Eight hundred thirty-three samples from the public database strongly support the gene is a promising biomarker.
Sample collection and pathological confirmation
In the Department of Gastroenterology of Peking University Third hospital from March 2015 to June 2016, we collected 18 LGA and 22 HGA specimens from patients who underwent endoscopic treatment for CA removal and obtained adjacent normal tissue specimens from 20 patients with adenoma during the treatment. Tissue specimens were embedded in paraffin, sectioned and stained with hematoxylin and eosin, and confirmed by pathologist by light microscopy. All the patients were treatment naive before their surgeries. Clinical information of patients, and sample position in corresponding microarray are provided in Additional file 1: Table S5 and S6.
DNA isolation and bisulfite conversion
DNA was isolated using QIAmp DNA Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol. Bisulfite conversion was performed using the EZ DNA Methylation-Gold Kit according to the instruction manual (Zymo Research, Irvine, CA, USA).
Methylation data processing
Epigenome-wide DNA methylation assessment for this study was performed using the Illumina Infinium Human Methylation 450 BeadChip (Illumina, San Diego, CA, USA), which simultaneously profiles the methylation status for > 485,000 CpG sites at single-nucleotide resolution and covers 96% of CpG islands with additional coverage of island shores (< 2 Kb from CpG Islands), island shelves (2–4 Kb from CpG islands), and regions flanking them. The raw data from the array was processed using the GenomeStudio Methylation (version 1.8, Illumina) module which calculated methylation levels. The GenomeStudio is the software for array data processing of Illumina, which integrates data normalization, background adjustment, and methylation calculation. Normalization was performed by comparing control probes when the option was set as controls, and background adjustment was performed automatically by the software selecting Subtract Background. The distribution of beta values before and after normalization across all was analyzed (Additional file 1: Fig. S8), and multi-dimensional scaling (MDS) according to 10,000 most variable positions showed the homogeneity of samples and their clustering according to pathological groups. Beta MDS were also analyzed according to 1000 and 20,000 most variable positions for all samples before and after normalization (Additional file 1: Fig. S9). The methylation status for each CpG site was calculated as the ratio of fluorescent signals (β = Max(M,0)/[Max(M,0) + Max(U,0) + 100]), ranging from 0 to 1 using the average probe intensity for the methylated (M) and unmethylated (U) alleles. β = 1 indicates complete methylation; β = 0 represents no methylation. Probes located on sex chromosomes or failed detection P value testing of at least one sample or SNP (single-nucleotide polymorphism) were removed from analysis using R package IMA (vision 3.1.2) . DMRs were defined as rank sum test following false discovery rate (FDR) adjusted P value < 0.05 and |∆β| > 0.15, and DMSs were defined as rank sum test following FDR adjusted P value < 0.05 and |∆β| > 0.20. Promoter regions were defined as 5′UTR, TSS200, TSS1500, and first exons.
Public datasets and processing
To ensure consistency of data processing, we only compared our samples with publically accessible samples with raw idat files. GSE68060, GSE68838, GSE77954, GSE77965, GSE81211, GSE101764, GSE107352, and GSE75546 were collected from GEO while E-MTAB-6450 was collected from ArrayExpress [43,44,45,46,47,48] (Additional file 1: Table S6). Some cell line samples and metastatic cancer samples were removed upon further study. In total, we collected 278 normal samples, 51 adenoma samples, and 504 cancer samples. All datasets using raw idat files were preprocessed using R package minfi (vision 1.28.4) . The sites which failed detection at P = 0.01 were rewritten to the nearest neighbor average to ensure an adequate number of sites for analysis. Six hundred fifty-six cases of whole blood data were collected from GEO (accession number GSE40279).
Comparison of the ability of discrimination between normal, LGA, HGA, and CRC tissue
For random forest prediction, we used R package randomForest (vision 4.6.14) with the number of trees set at 5000 . For neural network prediction, we used R package nnet (vision 7.3.12) with number of units in the hidden layer as 2, weight decay as 10−4, and with a maximum number of iterations at 400 . The R package pROC (vision 1.14.0) was used for ROC analysis to compare the abilities of various models to distinguish between hyper- and hypo-methylated sites by the area under the curve (AUC) analysis .
t-SNE analysis, PCA analysis, and gene enrichment analysis
t-Distributed stochastic neighbor embedding (t-SNE) analysis was performed by R package t-sne (vision 0.1-3) . PCA was performed by R function princomp and visualized by first two principal components. KEGG and GO enrichment were analyzed online by DAVID 6.8 (https://david.ncifcrf.gov) [54, 55]. Ingenuity Pathway Analysis (IPA) was also used for enrichment analysis for more elaborate results with the P value cutoff set at 0.05 .
Availability of data and materials
All methylation array data are available at GEO under accession number GSE139404. Other public data involved in this study included GSE68060, GSE68838, GSE77954, GSE77965, GSE81211, GSE101764, GSE107352, GSE75546, GSE40279, and E-MTAB-6450.
LGA vs Normal
Comparison of low-grade adenoma with normal tissue
HGA vs Normal
Comparison of high-grade adenoma with normal tissue
HGA vs LGA
Comparison of high-grade adenoma with low-grade adenoma
Different methylation region
Different methylation site
Receiver operating characteristic
Area under the curve
Ingenuity Pathway Analysis
Kyoto Encyclopedia of Genes and Genomes
t-distributed stochastic neighbor embedding
Principal components analysis
Mean beta values
False discovery rate
5′ Untranslated region
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA Cancer J Clin. 2018;68(1):7–30.
Chen W, Zheng R, Baade PD, Zhang S, Zeng H, Bray F, Jemal A, Yu XQ, He J. Cancer statistics in China, 2015. CA Cancer J Clin. 2016;66(2):115–32.
Kuipers EJ, Grady WM, Lieberman D, Seufferlein T, Sung JJ, Boelens PG, van de Velde CJ, Watanabe T. Colorectal cancer. Nat Rev Dis Primers. 2015;1:15065.
Guo S, Diep D, Plongthongkum N, Fung HL, Zhang K, Zhang K. Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA. Nat Genet. 2017;49(4):635–42.
Wang X, Wang L, Guo S, Bao Y, Ma Y, Yan F, Xu K, Xu Z, Jin L, Lu D, et al. Hypermethylation reduces expression of tumor-suppressor PLZF and regulates proliferation and apoptosis in non-small-cell lung cancers. FASEB journal : official publication of the Federation of American Societies for Experimental Biology. 2013;27(10):4194–203.
Guo S, Yan F, Xu J, Bao Y, Zhu J, Wang X, Wu J, Li Y, Pu W, Liu Y, et al. Identification and validation of the methylation biomarkers of non-small cell lung cancer (NSCLC). Clin Epigenetics. 2015;7:3.
Zhao Y, Xue F, Sun J, Guo S, Zhang H, Qiu B, Geng J, Gu J, Zhou X, Wang W, et al. Genome-wide methylation profiling of the different stages of hepatitis B virus-related hepatocellular carcinoma development in plasma cell-free DNA reveals potential biomarkers for early detection and high-risk monitoring of hepatocellular carcinoma. Clin Epigenetics. 2014;6(1):30.
Patai AV, Molnár B, Kalmár A, Schöller A, Tóth K, Tulassay Z. Role of DNA methylation in colorectal carcinogenesis. Dig Dis. 2012;30(3):310–5.
Grady WM, Carethers JM. Genomic and epigenetic instability in colorectal cancer pathogenesis. Gastroenterology. 2008;135(4):1079–99.
Hidaka H, Higashimoto K, Aoki S, Mishima H, Hayashida C, Maeda T, Koga Y, Yatsuki H, Joh K, Noshiro H, et al. Comprehensive methylation analysis of imprinting-associated differentially methylated regions in colorectal cancer. Clin Epigenetics. 2018;10(1):150.
Shi YX, Wang Y, Li X, Zhang W, Zhou HH, Yin JY, Liu ZQ. Genome-wide DNA methylation profiling reveals novel epigenetic signatures in squamous cell lung cancer. BMC Genomics. 2017;18(1):901.
Lindqvist BM, Wingren S, Motlagh PB, Nilsson TK. Whole genome DNA methylation signature of HER2-positive breast cancer. Epigenetics. 2014;9(8):1149–62.
Raggi C, Invernizzi P. Methylation and liver cancer. Clin Res Hepatol Gastroenterol. 2013;37(6):564–71.
Jones PA. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012;13(7):484–92.
Morris MR, Latif F. The epigenetic landscape of renal cancer. Nat Rev Nephrol. 2017;13(1):47–60.
Herman JG, Merlo A, Mao L, Lapidus RG, Issa J-PJ, Davidson NE, Sidransky D, Baylin SB. Inactivation of the CDKN2/p16/MTS1 gene is frequently associated with aberrant DNA methylation in all common human cancers. Cancer Res. 1995;55(20):4525.
Kane MF, Loda M, Gaida GM, Lipman J, Mishra R, Goldman H, Jessup JM, Kolodner R. Methylation of the hMLH1 promoter correlates with lack of expression of hMLH1 in sporadic colon tumors and mismatch repair-defective human tumor cell lines. Cancer Res. 1997;57(5):808.
Yoshiura K, Kanai Y, Ochiai A, Shimoyama Y, Sugimura T, Hirohashi S. Silencing of the E-cadherin invasion-suppressor gene by CpG methylation in human carcinomas. Proc Natl Acad Sci. 1995;92(16):7416.
Witold K, Anna K, Maciej T, Jakub J. Adenomas - genetic factors in colorectal cancer prevention. Rep Pract Oncol Radiother. 2018;23(2):75–83.
Zauber AG, Winawer SJ, O'Brien MJ, Lansdorp-Vogelaar I, van Ballegooijen M, Hankey BF, Shi W, Bond JH, Schapiro M, Panish JF, et al. Colonoscopic polypectomy and long-term prevention of colorectal-cancer deaths. N Engl J Med. 2012;366(8):687–96.
Patai Á, Valcz G, Hollósi P, Kalmár A, Péterfia B, Patai Á, Wichmann B, Spisák S, Barták BK, Leiszter K, et al. Comprehensive DNA methylation analysis reveals a common ten-gene methylation signature in colorectal adenomas and carcinomas. PLoS One. 2015;10(8):e0133836.
Schlemper RJ, Riddell RH, Kato Y, Borchard F, Cooper HS, Dawsey SM, Dixon MF, Fenoglio-Preiser CM, Flejou JF, Geboes K, et al. The Vienna classification of gastrointestinal epithelial neoplasia. Gut. 2000;47(2):251–5.
Rex DK, Johnson DA, Anderson JC, Schoenfeld PS, Burke CA, Inadomi JM. American College of G: American College of Gastroenterology guidelines for colorectal cancer screening 2009 [corrected]. Am J Gastroenterol. 2009;104(3):739–50.
Perez-Silva JG, Araujo-Voces M, Quesada V. nVenn: generalized, quasi-proportional Venn and Euler diagrams. Bioinformatics. 2018;34(13):2322–4.
Clemmensen C, Muller TD, Woods SC, Berthoud HR, Seeley RJ. Tschop MH: gut-brain cross-talk in metabolic control. Cell. 2017;168(5):758–74.
Kramer A, Green J, Pollard J Jr, Tugendreich S. Causal analysis approaches in Ingenuity Pathway Analysis. Bioinformatics. 2014;30(4):523–30.
Church TR, Wandell M, Lofton-Day C, Mongin SJ, Burger M, Payne SR, Castanos-Velez E, Blumenstein BA, Rosch T, Osborn N, et al. Prospective evaluation of methylated SEPT9 in plasma for detection of asymptomatic colorectal cancer. Gut. 2014;63(2):317–25.
Kulis M, Esteller M. DNA methylation and cancer. Adv Genet. 2010;70:27–56.
Straussman R, Nejman D, Roberts D, Steinfeld I, Blum B, Benvenisty N, Simon I, Yakhini Z, Cedar H. Developmental programming of CpG island methylation profiles in the human genome. Nat Struct Mol Biol. 2009;16(5):564–71.
Swami T, Weber HC. Updates on the biology of serotonin and tryptophan hydroxylase. Curr Opin Endocrinol Diabetes Obes. 2018;25(1):12–21.
Xiaolong G, Junhai P, Yichang L, Hongkan W, Wei Z, Xianfa W. Intestinal crosstalk between microbiota and serotonin and its impact on gut motility. Curr Pharm Biotechnol. 2018;19(3):190–5.
Berke JD. What does dopamine mean? Nat Neurosci. 2018;21(6):787–93.
Jeong S, Zheng B, Wang H, Xia Q, Chen L. Nervous system and primary liver cancer. Biochim Biophys Acta Rev Cancer. 2018;1869(2):286–92.
Wang K, Zhao XH, Liu J, Zhang R, Li JP. Nervous system and gastric cancer. Biochim Biophys Acta Rev Cancer. 1873;2020(1):188313.
Pan Y, Liu G, Zhou F, Su B, Li Y. DNA methylation profiles in cancer diagnosis and therapeutics. Clin Exp Med. 2018;18(1):1–14.
Deng Y, Wang Z, Gu S, Ji C, Ying K, Xie Y, Mao Y. Cloning and characterization of a novel human alcohol dehydrogenase gene (ADHFe1). DNA Seq. 2002;13(5):301–6.
Moon JW, Lee SK, Lee YW, Lee JO, Kim N, Lee HJ, Seo JS, Kim J, Kim HS, Park SH. Alcohol induces cell proliferation via hypermethylation of ADHFE1 in colorectal cancer cells. BMC Cancer. 2014;14:377.
Tae CH, Ryu KJ, Kim SH, Kim HC, Chun HK, Min BH, Chang DK, Rhee PL, Kim JJ, Rhee JC, et al. Alcohol dehydrogenase, iron containing, 1 promoter hypermethylation associated with colorectal cancer differentiation. BMC Cancer. 2013;13:142.
Tóth K, Sipos F, Kalmár A, Patai AV, Wichmann B, Stoehr R, Golcher H, Schellerer V, Tulassay Z, Molnár B. Detection of methylated SEPT9 in plasma is a reliable screening method for both left- and right-sided colon cancers. PLoS One. 2012;7(9):e46000.
Tóth K, Wasserkort R, Sipos F, Kalmár A, Wichmann B, Leiszter K, Valcz G, Juhász M, Miheller P, Patai Á, et al. Detection of methylated septin 9 in tissue and plasma of colorectal patients with neoplasia and the relationship to the amount of circulating cell-free DNA. PLoS One. 2014;9(12):e115415.
Luo H, Zhao Q, Wei W, Zheng L, Yi S, Li G, Wang W, Sheng H, Pu H, Mo H, et al. Circulating tumor DNA methylation profiles enable early diagnosis, prognosis prediction, and screening for colorectal cancer. Sci Transl Med. 2020;12:524.
Wang D, Yan L, Hu Q, Sucheston LE, Higgins MJ, Ambrosone CB, Johnson CS, Smiraglia DJ, Liu S. IMA: an R package for high-throughput analysis of Illumina's 450 K Infinium methylation data. Bioinformatics. 2012;28(5):729–30.
Qu X, Sandmann T, Frierson H Jr, Fu L, Fuentes E, Walter K, Okrah K, Rumpel C, Moskaluk C, Lu S, et al. Integrated genomic analysis of colorectal cancer progression reveals activation of EGFR through demethylation of the EREG promoter. Oncogene. 2016;35(50):6403–15.
consortium B. Quantitative comparison of DNA methylation assays for biomarker development and clinical applications. Nat Biotechnol. 2016;34(7):726–37.
Kang K, Bae JH, Han K, Kim ES, Kim TO, Yi JM. A genome-wide methylation approach identifies a new hypermethylated gene panel in ulcerative colitis. Int J Mol Sci. 2016;17:8.
Barrow TM, Klett H, Toth R, Bohm J, Gigic B, Habermann N, Scherer D, Schrotz-King P, Skender S, Abbenhardt-Martin C, et al. Smoking is associated with hypermethylation of the APC 1A promoter in colorectal cancer: the ColoCare Study. J Pathol. 2017;243(3):366–75.
Damaso E, Castillejo A, Arias MDM, Canet-Hermida J, Navarro M, Del Valle J, Campos O, Fernandez A, Marin F, Turchetti D, et al. Primary constitutional MLH1 epimutations: a focal epigenetic event. Br J Cancer. 2018;119(8):978–87.
Bormann F, Rodriguez-Paredes M, Lasitschka F, Edelmann D, Musch T, Benner A, Bergman Y, Dieter SM, Ball CR, Glimm H, et al. Cell-of-origin DNA methylation signatures are maintained during colorectal carcinogenesis. Cell Rep. 2018;23(11):3407–18.
Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, Irizarry RA. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–9.
Wiener ALaM. Classification and regression by randomForest. R News. 2002;2:18–22.
Ripley WNVaBD. Modern applied statistics with S, Fourth edn. New York: Springer; 2002.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Muller M. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77.
Hinton GE. Visualizing high-dimensional data using t-SNE. J Mach Learn Res. 2008;9(2):2579–605.
da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57.
Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37(1):1–13.
The authors gratefully acknowledge Dr. Steven J Schrodi, Dr. Emily A. Andreae and Dr. Ingrid Glurich from the Center for Precision Medicine Research (CPMR), Marshfield Clinic Research Institute (MCRI), for reviewing, commenting on, and editing their manuscript.
This study is funded by the Youth Innovation Promotion Association CAS (2016098), Major State Basic Research Development Program (2014CB542006), Key Research Program of the Chinese Academy of Sciences (KJZD-EW-L14), and National Key Research and Development Plan of China (2016YFA0201404).
Ethics approval and consent to participate
The study protocol conformed to the ethical guidelines of the 1975 Declaration of Helsinki and was approved by the Ethics Committee of Peking University Third hospital (IRB number: 206H005). Informed written consent was obtained from all the patients and volunteers prior to the procedure.
Consent for publication
The authors disclose no potential competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Fan, J., Li, J., Guo, S. et al. Genome-wide DNA methylation profiles of low- and high-grade adenoma reveals potential biomarkers for early detection of colorectal carcinoma. Clin Epigenet 12, 56 (2020). https://doi.org/10.1186/s13148-020-00851-3
- DNA methylation
- Low-grade adenoma
- High-grade adenoma
- Colorectal cancer