Skip to main content


Genome-wide DNA methylation profiling integrated with gene expression profiling identifies PAX9 as a novel prognostic marker in chronic lymphocytic leukemia



In chronic lymphocytic leukemia (CLL), epigenomic and genomic studies have expanded the existing knowledge about the disease biology and led to the identification of potential biomarkers relevant for implementation of personalized medicine. In this study, an attempt has been made to examine and integrate the global DNA methylation changes with gene expression profile and their impact on clinical outcome in early stage CLL patients.


The integration of DNA methylation profile (n = 14) with the gene expression profile (n = 21) revealed 142 genes as hypermethylated-downregulated and; 62 genes as hypomethylated-upregulated in early stage CLL patients compared to CD19+ B-cells from healthy individuals. The mRNA expression levels of 17 genes identified to be differentially methylated and/or differentially expressed was further examined in early stage CLL patients (n = 93) by quantitative real time PCR (RQ-PCR). Significant differences were observed in the mRNA expression of MEIS1, PMEPA1, SOX7, SPRY1, CDK6, TBX2, and SPRY2 genes in CLL cells as compared to B-cells from healthy individuals. The analysis in the IGHV mutation based categories (Unmutated = 39, Mutated = 54) revealed significantly higher mRNA expression of CRY1 and PAX9 genes in the IGHV unmutated subgroup (p < 0.001). The relative risk of treatment initiation was significantly higher among patients with high expression of CRY1 (RR = 1.91, p = 0.005) or PAX9 (RR = 1.87, p = 0.001). High expression of CRY1 (HR: 3.53, p < 0.001) or PAX9 (HR: 3.14, p < 0.001) gene was significantly associated with shorter time to first treatment. The high expression of PAX9 gene (HR: 3.29, 95% CI 1.172–9.272, p = 0.016) was also predictive of shorter overall survival in CLL.


The DNA methylation changes associated with mRNA expression of CRY1 and PAX9 genes allow risk stratification of early stage CLL patients. This comprehensive analysis supports the concept that the epigenetic changes along with the altered expression of genes have the potential to predict clinical outcome in early stage CLL patients.


Chronic lymphocytic leukemia (CLL) arises from a malignant clone of B cells due to altered control of apoptosis and dysregulated rate of proliferation. Its progression is characterized by clonal proliferation and accumulation of mature neoplastic CD5+ B lymphocytes [1]. The clinical course of CLL patients is extremely variable with some patients progressing rapidly as compared to others and ultimately, requiring therapeutic intervention. Several biomarkers including immunoglobulin heavy chain variable (IGHV) gene mutations that segregate CLL patients into low and high-risk clinical groups are widely used to assess the prognosis of these patients. Low-risk patients generally display mutated IGHV gene, low CD38, and low ζ chain associated protein kinase-70 (ZAP-70) expression, while high-risk cases exhibit the reverse pattern [2,3,4,5,6].

Altered DNA methylation is one of the hallmark events in cancer. The first evidence of DNA methylation in CLL was presented by Wahlfors et al. [7] in which a global loss of methylation was reported. In addition to global hypomethylation, hypermethylation of individual gene promoters has also been reported in CLL [7,8,9,10,11]. Methylation of TWIST2 and ZAP-70 exhibited a strong association with the IGHV-mutated status [9, 12] whereas methylation of HOXA4 gene was predominantly associated with the IGHV unmutated status [13]. Further studies employing genome wide methylation profiling technologies have revealed association of differential methylation patterns with prognostic subgroups based on the IGHV mutation status [14,15,16], CD38 levels [17], ZAP-70 levels [16], immunogenetic subsets [18], and 17p-deletion status [19].

Earlier, DNA hypermethylation was thought to affect the expression of a gene negatively but the emerging research has suggested that the function and effect of DNA methylation is contextual, and the relationship between DNA methylation and transcription is more complex [20]. In CLL, although association of differential methylation patterns with specific prognostic subgroups in earlier reports highlights the potential of altered gene methylation as a tool to predict clinical outcome, further research is required to establish the relationship between the epigenome and the transcriptome. The present study was carried out to correlate the DNA methylation patterns with gene expression profile and to assess the prognostic implications of such correlations on clinical outcome in 93 early stage CLL patients.


Patient selection

Treatment naive early stage (Rai 0-II) CLL patients (n = 100) were enrolled in the study after obtaining informed consent as per the guidelines of the institute ethics committee. According to the staging criteria outlined by Rai et al. [21], 24 patients were in stage 0, 33 were in stage I and 43 were in stage II. Fourteen randomly selected CLL samples and pooled CD19+ B-cells from 10 healthy individuals were profiled for methylation. Gene expression profiling was carried out in 21 CLL samples and pooled CD19+ B-cells from 10 healthy individuals. All the CLL samples had at least ≥65% CLL phenotype cells. The clinical and laboratory characteristics of the CLL patients analysed using methylation and gene expression arrays are provided in Table 1. The mRNA expression of 17 of the genes identified to be differentially methylated and /or differentially expressed was validated using SYBR-green based RQ-PCR in 93 (Unmutated = 39, Mutated = 54) CLL patients. The median age of the CLL patients was 60 years (range 35–80 years). With a median follow-up time of 22 months (range 1-124 months), 46 patients required treatment [median time to treatment: 14 months (range 0–92 months)] and 18 patients died. On the basis of international prognostic index (IPI) score [22], 11/93 patients were assigned as low risk, 34/93 as intermediate risk, 43/93 as high risk, and 5/93 as very high risk patients.

Table 1 Clinical and laboratory characteristics of the CLL patients evaluated using methylation and gene expression arrays

IGHV mutation status

IGHV gene family usage was evaluated as per BIOMED-2 protocol [23] and the patients were assigned to IGHV mutated or unmutated subgroups based on the IGHV sequence homology (cut-off = 98%) as determined by the international ImmunoGeneTics database (IMGT; http://, Montpellier, France).

Methylated CpG island microarrays

Genomic DNA was extracted from the peripheral blood mononuclear cells (PBMC) of CLL patients (n = 14) and CD19+ sorted cells pooled from 10 healthy individuals. To isolate the CD19+ cells, mononuclear cells isolated from peripheral blood of healthy individuals were incubated with CD19 + magnetic microbeads and processed according to the manufacturer’s protocol (Milteneyi Biotech, Gladbach, Germany). In healthy individuals, CD19+ cells constitute 2-3% of the leukocyte fraction and therefore, sorted CD19+ B-cells from healthy individuals were used. In the CLL samples evaluated for microarrays, CD19+ cells constituted at least ≥65% of the leukocytes and the PBMC fraction from CLL patients was used for the study.

For methylated CpG island microarrays, 6 μg of genomic DNA was digested with Mse I restriction enzyme (New England Biolabs Inc., Ipswich, MA, USA) and labelled with anti-5 methyl cytidine antibody (Abcam, Cambridge, UK). One fraction of the labelled DNA was immunoprecipitated while the other was used as input DNA. Both the input and immunoprecipitated fractions were purified followed by whole genome amplification (WGA, Sigma Aldrich, St. Louis, MO, USA), labelled with Cy3- and Cy5-dUTP, respectively, and hybridized on 1x244K human promoter chIP-on-chip microarray slides as per the manufacturer’s recommendations (Agilent Technologies, Santa Clara, CA, USA). The slides were washed and scanned on the Agilent DNA microarray scanner D and the data was extracted with Feature Extraction® software FE version 11.5 (Agilent Technologies, Santa Clara, CA, USA).

Gene expression microarray

Total RNA obtained from PBMC of CLL patients (n = 21) and CD19+ sorted cells pooled from 10 healthy individuals was amplified and simultaneously labelled with Cy3-CTP using low input quick amp labelling kit (Agilent Technologies, Santa Clara, CA, USA). The labelled product was finally hybridized to SurePrint G3 Human Gene Expression 8x60K microarray slide as per manufacturer’s recommendation (Agilent Technologies, Santa Clara, CA, USA). The slides were washed and scanned on the Agilent DNA microarray scanner D and the data was extracted with Feature Extraction® software FE version 11.5 (Agilent Technologies, Santa Clara, CA, USA). These samples included seven CLL samples profiled for DNA methylation status.

Bisulfite genome sequencing

Genomic DNA (2 μg) was bisulfite modified and purified using Epitect Bisulfite kit as per the manufacturer’s instructions (Qiagen, Hilden , Germany). The bisulfite converted DNA was amplified for two CpG islands in PAX9 gene as depicted in Fig. 1 and sequenced with BigDye Terminator v3.1 Cycle Sequencing kit (Applied Biosystems, CA, USA) with primers designed using MethPrimer ( The percent methylation levels were computed and further analysed with Bisulfite Sequencing DNA Methylation Analysis (BISMA) software (

Fig. 1

Location of CpG islands studied for PAX9 gene methylation. a UCSC browser view of PAX9 gene (chromosome 14q13.3). The probes used for methylation microarrays were specific for CpG islands 121, 129, 39, and 76. b MethPrimer based CpG prediction and primer design for bisulfite gene sequencing for two CpG islands (3 and 7) located at PAX9 upstream region. c Bisulfite sequencing of CpG islands 3 and 7 was performed in 21 and 23 CLL patients respectively and in five healthy controls. A representative electropherogram depicting two methylated (C) or unmethylated (T) CpG sites in island 3 located in 5’ region of PAX9 is shown

Real-time quantitative PCR (RQ-PCR)

The mRNA expression based microarray findings were validated using RQ-PCR in an independant cohort of 93 early stage CLL patients for 17 genes with gene-specific primers (Additional file 1: Table S1). The experiments were performed using SYBR Green Master Mix according to the manufacturer’s protocol on Mx3005P (Agilent Technologies, Santa Clara, CA, USA). The fold change was calculated using 2-ΔΔCt method with beta-actin as an endogenous control. The Receiver’s operating characteristic (ROC) curve-derived cut-off values were used to define high or low mRNA expression levels.

Bioinformatics analysis and statistics

Methylation array data was analyzed using Genomic Workbench version 7.0 (Agilent Technologies, Santa Clara, CA, USA). On the basis of melt temperature, log-ratio data for each probe was normalized. By taking into account the Gaussian-fit curves, Z score was generated for each sample and p values were calculated. The p values were then used to determine the log-odds score for each probe. The differentially hypermethylated and hypomethylated probes between groups were filtered based on the minimum value of log2-fold change (log2FC) between the groups =0.25, p < 0.05 and the false discovery rates (FDR) of 0.2 [24].The probes with log2FC ≤ (-)0.25 were considered hypomethylated and ≥ (+)0.25 were considered hypermethylated.

The gene expression data across all arrays was log2 transformed and normalized using quantile normalization and analyzed by the Lima library from R-Bioconductor. Probes with an adjusted p-value less than 0.05 and log2FC of 1 were selected.

The correlation of log-odds values obtained from the DNA methylation arrays (p < 0.05, log2FC = 0.25) and the expression arrays for the identified genes was used as an indicator of the correlation between DNA methylation and gene expression. The probes showing hypomethylation (log2FC ≤ (-)0.25, p < 0.05) in conjunction with higher expression (log2 FC > 1, p < 0.05) between any two compared conditions were identified. Similarly, the probes exhibiting hypermethylation (log2FC ≥ 0.25, p < 0.05) in conjunction with lower expression (log2 FC < (-)1, p < 0.05) were also identified.

Receiver’s operating characteristic curve was used to calculate the cut-off value to determine the low and high expression of a particular gene. The differences in mRNA expression between the groups as obtained from RQ-PCR were compared using the Mann-Whitney Rank Sum test or Kruskal-Wallis One Way Analysis of Variance on Ranks. The relative risk (RR) of treatment initiation was assessed using the Chi-square statistic with Yate’s continuity correction. The time to first treatment (TTFT) and overall survival (OS) were compared between the groups using the Kaplan-Meier survival analysis followed by the log-rank test. Hazard ratio (HR) for each variable was calculated using the Cox proportional hazard regression (Sigma Plot Version 13.0, Systat Software, Inc.).

Data access

The DNA methylation as well as the mRNA expression data generated in the study have been submitted to the NCBI Gene Expression Omnibus (GEO) ( under accession number GSE81937.


Methylation profile

A comparison of differential methylation between CLL (n = 14) and normal CD19+ B-cells identified a total of 6129 probes to be differentially methylated which were further classified as hypermethylated (5254 probes, 2505 genes) or hypomethylated (875 probes, 753 genes). The differentially methylated probes that represented unknown genes, non-coding RNAs, hypothetical proteins, chromosomal loci, predicted open reading frames, and probes associated with sex-chromosomes were excluded from the downstream analysis. Among the differentially methylated probes, 53.8% of hypermethylated probes were located inside known gene bodies, 38.2% in the promoters, 2.6% in divergent promoters and 5.2% were located downstream of the known genes (Fig. 2a). The frequency distribution of the hypomethylated probes (Fig. 2b) was comparable to the hypermethylated probes. Of the differentially methylated probes, CpG sites were found in 73% of the hypermethylated probes and in 81% of the hypomethylated probes. The details pertaining to these probes, including the gene name, chromosomal location and distribution are provided in Additional file 1: Tables S2A and S2B.

Fig. 2

Distribution of a hypermethylated and b hypomethylated probes in CLL Vs. CD19+ normal controls

On the basis of gene functions, the CpG islands in the promoter regions of the tumor suppressor genes (KLF4, PTCH1, PAX5, PCDH10, RASSF10, IRX1, TBX5, ID4, SOX7, SLIT2) and the transcription factors (TWIST1, KLF4, TAL1, PAX2, PAX9, NR2F2, IRX4, MEIS1) were found to be hypermethylated. Approximately, 10% of the hypermethylated CpG promoters were located within the homeobox genes. Promoter regions of genes such as FOXD3, FOXE1, FOXG1, ID4, SLIT2, BNC1, SALL1, RIPK4, HAND2, SOX9, SOX11, NR2F2, TAL1, SIM2, PAX9, and TBX2 were also found to be hypermethylated in sync with earlier reported results in CLL [16, 19, 25]. In addition, hypomethylation was observed in the promoter region of NFATC1 and inside gene body in NOTCH1, SFRP1, and GPS as has been reported in earlier studies in CLL [19, 25]. Using the Database for Annotation, Visualization and Integrated Discovery (DAVID) functional analysis tool, the differentially methylated genes were evaluated for the overrepresented Gene Ontology (GO) categories and the most significant overrepresented GO biological processes were found to be related to regulation of transcription (p < 0.0001) [26, 27].

To identify the association of differential methylation profile with the IGHV mutation status, the methylation array data from 9 IGHV unmutated and 5 IGHV mutated cases was compared. This analysis elicited a distinct signature of 56 hypermethylated (p < 0.05, log2FC ≥ 0.25) and 2402 hypomethylated probes (p < 0.05, log2FC ≤ (-)0.25) in unmutated CLL. The hypermethylated probes were distributed across 46 genes and spanned promoter regions of 10 genes (Additional file 1: Table S3A).Similarly, the hypomethylated probes spread across 1332 genes and spanned promoter regions of 399 genes (Additional file 1: Table S3B). Differential methylation of several genes previously reported in the IGHV mutation based subgroups [NCOR2, KCNJ2, SIX3, CHRM1, [16]], [NRF1, CRY1, KCNJ2, SOX5 [28]] was also noticed in the present study. In addition, differential CpG promoter hypomethylation of genes already known to influence clinical outcome in other malignancies was observed and includes EMILIN2 [29], TBX5 [30], CBX8 [31],OLIG2 [32], and PCDH10 [33]. The DAVID database was used to identify biological pathways for the differentially methylated genes. Four of the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways including circadian rhythm pathway (p = 0.002), calcium signalling pathway (p = 0.03), axon guidance (p = 0.02), and gap junction pathway (p = 0.04) were found to be significantly affected in the IGHV unmutated Vs. mutated subgroup.

Correlation of methylation and gene expression analysis

To investigate the possible influence of CpG methylation status on the expression level of the corresponding genes, the gene expression profiles were integrated with the DNA methylation profiles and co-analyzed. On comparing the data of CLL patients with healthy individuals, a negative correlation in CpG methylation and gene expression was observed for 211genes (Additional file 1: Table S4). Of these, 149 genes were hypermethylated and downregulated and 62 genes were hypomethylated and upregulated including AXIN2, ID4, EBF1, SOX4, SOX7, TAL1, PMEPA1, SPRY1, CDK6, and MEIS1. Pathway analysis using the genes having negative correlation for DNA methylation and gene expression in CLL Vs. normal CD19+ cells identified significant enrichment of three KEGG pathways which included p53 signalling pathway (p = 0.002), pathways in cancer (p = 0.005), and the cell cycle pathway (p = 0.007).

A comparison of the CpG methylation and gene expression profiles among the IGHV unmutated Vs. mutated patients identified 64 differentially expressed genes (Fig. 3) including BMPR2, CRY1, FGFR2, FOSB, INPP4B, PLD5, PAX9, RGS2, RIC8B, and VIPR1 (Additional file 1: Table S5).

Fig. 3

Supervised hierarchical clustering of hypermethylated-dowregulated and hypomethylated -upregulated genes selected on the basis of significant log2FC values in IGHV mutated (n = 05) Vs. IGHV unmutated (n = 09) CLL. The Euclidean hierarchical clustering was performed using Gene Spring Gx software version 13.5 and is based on normalized intensity values. Each row represents an individual patient and each column represents a gene. A gradient color scale ranging between blue (hypermethylated and downregulated) and red (hypomethylated and upregulated) is included

Of the various genes found to be differentially methylated and/or differentially expressed, a total of 17 genes (Table 2) were validated using RQ-PCR on a cohort of 93 (22 female: 71 male) early stage CLL patients and pooled CD19+ B cells from 10 healthy volunteers. The criteria for selection of these genes was negative correlation between CpG promoter methylation and gene expression in CLL Vs. normal (MEIS1, PMEPA1, SOX7, SPRY1, CDK6, ID4, AXIN2, TNRC18) and in the IGHV unmutated Vs. mutated subgroup (CRY1, VIPR1, PAX9, RIC8B). Other genes selected for validation included NFATC1 (hypomethylated in CLL), TBX2, TSHZ3 (hypermethylated in CLL), SPRY2 (upregulated in CLL) and BIK (downregulated in CLL). We focused on these five genes as they had previously been shown in the literature to be epigenetically influenced in CLL [NFATC1 [11]], or in other malignancies [BIK [34], SPRY2 [35], TBX2 [36, 37], TSHZ3 [38, 39]]. As expected, MEIS1, PMEPA1, SOX7, SPRY1, CDK6, TBX2 were significantly downregulated (p < 0.05) while SPRY2 (p = 0.016), VIPR1 (p = 0.04) and ID4 (p = 0.03) were significantly upregulated in CLL cells as compared to healthy B-cells. Though not significant, AXIN2 was upregulated and TNRC18, NFATC1 and BIK were downregulated in CLL as compared to healthy CD19+ cells (Table 2). The expression of only CRY1 and PAX9 differed significantly (p < 0.05) with respect to the IGHV mutation status (Table 2, Fig. 4)

Table 2 Comparison of mRNA levels of selected genes (median ∆Cq and median fold change) as assessed by real time RQ-PCR in CLL (n = 93; unmutated = 39; mutated = 54) and CD19+ sorted B-cells from healthy individuals (n = 10)
Fig. 4

Box-plot representation of mRNA expression fold change as assessed by RQ-PCR for a CRY1 gene in CLL and its IGHV mutated and unmutated subgroups b PAX9 gene in CLL and its IGHV mutated and unmutated subgroups. Box-plot graphs show median (middle line), interquartile range (box), 25–75th percentile (whiskers) and statistically significant difference (p value) estimated in comparison between different groups

The status of hypomethylation of PAX9 among unmutated CLL was confirmed through bisulfite genome sequencing of CpG island 3 in close proximity to CpG110 (Fig. 1).While CpG island 7 did not reveal any significant difference in methylation levels, the average % methylation at CpG island 3 was found to be 52.74% in mutated while 24.72% among unmutated CLL. This further corroborates with the reduced expression of PAX9 in unmutated group of CLL patients as established through microarray-based observations.

Association between gene expression and clinical outcome

Of the 17 genes evaluated for mRNA expression, CRY1 (p = 0.008) and PAX9 (p < 0.001) were expressed at higher levels in Rai stage I and II as compared to stage 0. A progressive increase in the expression of CRY1 (p = 0.004) and PAX9 (p < 0.001) was observed in increasing IPI score categories ranging from 1 to 4. We further explored the association between expression level of candidate genes with relative risk of treatment initiation, TTFT and OS (Table 3). The relative risk of treatment initiation was significantly higher with high expression of PAX9 (p = 0.001) or CRY1 (p = 0.005). The high expressions of both PAX9 (HR 3.14, 95% CI 1.589–6.205, p < 0.001) as well as CRY1 (HR 3.53, 95% CI 1.789–6.987, p < 0.001) were significantly associated with shorter TTFT (Fig. 5). However, high expression of only PAX9 gene (HR 3.29, 95% CI 1.172–9.272, p = 0.016) was significantly associated with shorter OS (Fig. 6).

Table 3 Association of relative risk of treatment initiation, time to first treatment and overall survival with the mRNA expression of 17 genes selected for validation in early stage CLL patients (n = 93)
Fig. 5

Kaplan-Meier survival curves representing time to first treatment (TTFT) in early stage CLL patients with a low (n = 47) and high (n = 46) mRNA expression of CRY1 b low (n = 46) and high (n = 46) mRNA expression of PAX9 and; c unmutated (n = 39) and mutated IGHV genes (n = 54)

Fig. 6

Kaplan-Meier survival curves representing overall survival in early stage CLL patients with a low (n = 47) and high (n = 46) mRNA expression of CRY1 b low (n = 46) and high (n = 46) mRNA expression of PAX9 and; c unmutated (n = 39) and mutated IGHV genes (n = 54)


Extremely variable clinical course of early stage CLL patients highlights the importance of well-described prognostic markers for clinical management of these patients. Various prognostic markers that are currently in use include IGHV mutational status [2], genomic abnormalities [3], expression of ZAP-70 [4], and CD38 [2]. Recent studies have associated specific DNA methylation signatures with specific prognostic subgroups in CLL [14,15,16, 19]. The present study has dealt with the methylation profiling of early stage CLL patients on the basis of their IGHV mutational status.

The study has identified differential methylation of several genes such as NCOR2, SIX3, CHRM1, NRF1, CRY1, KCNJ2, and SOX5 that have been reported earlier to be differentially methylated in the IGHV-gene based subgroups [16, 28]. Besides, an association of promoter hypomethylation of MYLK with the IGHV unmutated cases was also observed in the current study. Since, higher expression of MYLK is known to be significantly correlated with poor clinical outcome [40], it is plausible that promoter hypomethylation of MYLK in the IGHV unmutated cases might be associated with poor prognosis. Furthermore, differential CpG promoter hypomethylation of two important hematopoietic transcription factors MEIS1 and TAL1 which are known methylation targets in B-cell ALL was also observed [41].

An analysis of signalling pathway network for genes with perturbed methylation profiles observed among IGHV unmutated patients indicated the involvement of calcium signalling pathway. Previous studies have suggested that altered Ca2+ signalling contributes to major tumor progression events including proliferation, migration, invasion, and metastasis [42, 43]. Recently, Muggen et al. [44] demonstrated an association of the IGHV mutational status with the level of basal Ca2+ signalling in CLL. The present study provides evidence that aberrant methylation of genes involved in the calcium signalling pathway might be one of the mechanisms responsible for net differences in the basal Ca2+ signalling events.

In the present analysis, an inverse correlation between methylation and gene expression was observed for 209 genes in CLL including transcription factors (ID4, NFATC1, TBX2, TAL1, MEIS1), SPRY family members (SPRY1, SPRY2) and SOX family members (SOX4, SOX7). Correlation of promoter methylation of ID4 gene with shortened patient survival has been already documented in CLL [45]. An association of methylation of TBX2 [37] and SPRY2 [35] with disease progression has also been demonstrated in bladder cancer and B-cell diffuse lymphoma, respectively, but so far, no studies have been reported in CLL.

Screening of inversely correlated genes associated with IGHV mutation status revealed only 21 gene promoters to be significantly hypomethylated and upregulated in unmutated cases (Additional file 1: Table S5). One of these genes encodes for bone morphogenetic protein (BMP) receptor II which is a serine/threonine receptor kinase and has previously been shown to be involved in molecular pathogenesis of hematological malignancies including acute myelomonocytic leukemia, acute promyelocytic leukemia, multiple myeloma as well as CLL [46, 47]. Cell surface expression of BMP receptors (BMPRIA and BMPRIB) have been shown to be elevated in advanced stages of CLL [47]. In-vitro studies have shown that co-expression of BMPRII facilitates BMP binding to its receptors and therefore contributes to downstream biological functions [48, 49]. This is in line with the results of the present study wherein upregulated BMPRII gene expression and hypomethylation of BMPRII gene was noticed among unmutated subgroup of CLL patients.

Alterations in methylation status and associated gene expression levels of another gene CRY1 have also been reported in prognostically distinct subsets of CLL [50] as well as in CML [51]. Our study confirms the possible influence of hypomethylation and upregulated expression of CRY1 in prognostically poor IGHV unmutated CLL and further emphasises its role as potential biomarker for relative risk of treatment initiation and TTFT in early stage CLL. In addition to CRY1, three other circadian rhythm genes NPAS2, BHLHE40, and ARNTL were also observed to be hypomethylated in the unmutated subgroup [52].

PAX9 is one of the nine members of "paired box” (PAX)-containing transcription factor family and its inhibition has been shown to induce apoptosis with increased cleavage of caspase-3 and PARP, increased expression of BAX and decreased expression of BCL-2 in oral squamous cell carcinoma [53]. In the recent years, it has emerged as one of the biomarkers of cell proliferation in lung cancer [54]. A significant association of PAX9 expression with stage, IPI score, relative risk of treatment initiation, TTFT and OS in the present study strengthens its role as an important marker of prognosis in CLL as well.

Since levels of expression of either PAX9 or CRY1 did not show significant difference in CLL patients when compared to healthy controls but rather between patients subgrouped on the basis of the IGHV mutational status, it is plausible that these two genes may be involved in progression of CLL rather than development of the disease. This explanation is further supported by progressively increasing gene expression levels of PAX9 and CRY1 in coherence with advanced Rai stage and higher IPI scores. The mechanism(s) underlying such an influence of these two genes in CLL pathogenesis are not known but might involve apoptotic [53, 55,56,57], or analogous pathways involved in cancer.

Besides, several aberrantly methylated genes were also identified in IGHV mutational status based subgroups which could serve as potential markers in CLL. The major limitation of the present study was that a limited number of genes were evaluated in a small cohort of early stage CLL patients. Further studies on large cohorts of early stage CLL patients for expression patterns of additional set of genes are required that may help in characterizing the functional role of the genes identified in the present study. Identification of relevant epigenetically influenced genes that have an impact on gene expression as well as clinical outcome may pave way for identification and development of therapeutically relevant drug targets.


The present study confirms the prognostic role of CRY1 in CLL as its aberrant methylation and expression is associated with high risk of treatment initiation and shorter time to first treatment. In addition, this study highlights PAX9 as a novel marker of prognostication in CLL as its expression was significantly associated with high risk of treatment initiation, shorter time to first treatment and overall survival.


  1. 1.

    Chiorazzi N, Rai KR, Ferrarini M. Chronic lymphocytic leukemia. N Engl J Med. 2005;352:804–15.

  2. 2.

    Damle RN, Wasil T, Fais F, Ghiotto F, Valetto A, Allen SL, et al. IgV gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia. Blood. 1999;94:1840–7.

  3. 3.

    Krober A, Seiler T, Benner A, Bullinger L, Brückle E, Lichter P, et al. V (H) mutation status, CD38 expression level, genomic aberrations, and survival in chronic lymphocytic leukemia. Blood. 2002;100:1410–6.

  4. 4.

    Dürig J, Nückel H, Cremer M, Führer A, Halfmeyer K, Fandrey J, et al. ZAP-70 expression is a prognostic factor in chronic lymphocytic leukemia. Leukemia. 2003;17:2426–34.

  5. 5.

    Gentile M, Mauro FR, Calabrese E, De Propris MS, Giammartini E, Mancini F, et al. The prognostic value of CD38 expression in chronic lymphocytic leukemia patients studied prospectively at diagnosis: a single institute experience. Br J Haematol. 2005;130:549–57.

  6. 6.

    Eisele L, Haddad T, Sellmann L, Dührsen U, Dürig J. Expression levels of CD38 on leukemic B cells but not on non-leukemic T cells are comparably stable over time and predict the course of disease in patients with chronic lymphocytic leukemia. Leuk Res. 2009;33:775–8.

  7. 7.

    Wahlfors J, Hiltunen H, Heinonen K, Hämäläinen E, Alhonen L, Jänne J. Genomic hypomethylation in human chronic lymphocytic leukemia. Blood. 1992;80:2074–80.

  8. 8.

    Rush LJ, Raval A, Funchain P, Johnson AJ, Smith L, Lucas DM, et al. Epigenetic profiling in chronic lymphocytic leukemia reveals novel methylation targets. Cancer Res. 2004;64:2424–33.

  9. 9.

    Raval A, Lucas DM, Matkovic JJ, Bennett KL, Liyanarachchi S, Young DC, et al. TWIST2 demonstrates differential methylation in immunoglobulin variable heavy chain mutated and unmutated chronic lymphocytic leukemia. J Clin Oncol. 2005;23:3877–85.

  10. 10.

    Raval A, Byrd JC, Plass C. Epigenetics in chronic lymphocytic leukemia. Semin Oncol. 2006;33:157–66.

  11. 11.

    Liu TH, Raval A, Chen SS, Matkovic JJ, Byrd JC, Plass C. CpG island methylation and expression of the secreted frizzled-related protein gene family in chronic lymphocytic leukemia. Cancer Res. 2006;66:653–8.

  12. 12.

    Corcoran M, Parker A, Orchard J, Davis Z, Wirtz M, Schmitz OJ, et al. ZAP-70 methylation status is associated with ZAP-70 expression status in chronic lymphocytic leukemia. Haematologica. 2005;90:1078–88.

  13. 13.

    Strathdee G, Sim A, Parker A, Oscier D, Brown R. Promoter hypermethylation silences expression of the HoxA4 gene and correlates with IgVh mutational status in CLL. Leukemia. 2006;20:1326–9.

  14. 14.

    Kanduri M, Cahill N, Goransson H, Enström C, Ryan F, Isaksson A, et al. Differential genome-wide array-based methylation profiles in prognostic subsets of chronic lymphocytic leukemia. Blood. 2010;115:296–305.

  15. 15.

    Cahill N, Bergh AC, Kanduri M, Göransson-Kultima H, Mansouri L, Isaksson A, et al. 450 K-array analysis of chronic lymphocytic leukemia cells reveals global DNA methylation to be relatively stable over time and similar in resting and proliferative compartments. Leukemia. 2013;27:150–8.

  16. 16.

    Ronchetti D, Tuana G, Rinaldi A, Agnelli L, Cutrona G, Mosca L, et al. Distinct patterns of global promoter methylation in early stage chronic lymphocytic leukemia. Genes Chromosomes Cancer. 2014;53:264–73.

  17. 17.

    Rahmatpanah FB, Carstens S, Hooshmand SI, Welsh EC, Sjahputera O, Taylor KH, et al. Large-scale analysis of DNA methylation in chronic lymphocytic leukemia. Epigenomics. 2009;1:39–61.

  18. 18.

    Kanduri M, Marincevic M, Halldórsdóttir AM, Mansouri L, Junevik K, Ntoufa S, et al. Distinct transcriptional control in major immunogenetic subsets of chronic lymphocytic leukemia exhibiting subset-biased global DNA methylation profiles. Epigenetics. 2012;7:1435–42.

  19. 19.

    Tong WG, Wierda WG, Lin E, Kuang SQ, Bekele BN, Estrov Z, et al. Genome-wide DNA methylation profiling of chronic lymphocytic leukemia allows identification of epigenetically repressed molecular pathways with clinical impact. Epigenetics. 2010;5:499–508.

  20. 20.

    Jones PA. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet. 2012;13:484–92.

  21. 21.

    Rai K. A critical analysis of staging in CLL. In: Gale R, Rai K, editors. Chronic Lymphocyte Leukemia: Recent progress and future direction. New York, NY: Alan R Liss; 1987. p. 59–253.

  22. 22.

    Kutsch N, Bahlo J, Byrd J C, Dohner H, Eichhorst B, Else M, et al. The international prognostic index for patients with CLL (CLL-IPI): An international meta-analysis. Journal of Clinical Oncology, ASCO Annual Meeting Abstracts. 2015; 33, suppl; abstr 7002.

  23. 23.

    van Dongen JJ, Langerak AW, Brüggemann M, Evans PA, Hummel M, Lavender FL, et al. Design and standardization of PCR primers and protocols for detection of clonal immunoglobulin and T-cell receptor gene recombinations in suspect lymphoproliferations: report of the BIOMED-2 Concerted Action BMH4-CT98-3936. Leukemia. 2003;17:2257–317.

  24. 24.

    Provençal N, Suderman MJ, Guillemin C, Vitaro F, Côté SM, Hallett M, et al. Association of childhood chronic physical aggression with a DNA methylation signature in adult human T- cells. PLoS One. 2014;9, e89839.

  25. 25.

    Pei L, Choi J, Liu J, Lee E, McCarthy B, Wilson JM, et al. Genome-wide DNA methylation analysis reveals novel epigenetic changes in chronic lymphocytic leukemia. Epigenetics. 2012;7:567–78.

  26. 26.

    da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resource. Nat Protoc. 2009;4:44–57.

  27. 27.

    da Huang W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13.

  28. 28.

    Kulis M, Heath S, Bibikova M, Queirós A, Navarro A, Clot G, et al. Epigenomic analysis detects widespread gene-body DNA hypomethylation in chronic lymphocytic leukemia. Nat Genet. 2012;44:1236–42.

  29. 29.

    Hill V, Hesson LB, Dansranjavin T, Dallol A, Bieche I, Vacher S, et al. Identification of 5 novel genes methylated in breast and other epithelial cancers. Mol Cancer. 2010;9:51.

  30. 30.

    Yu J, Ma X, Cheung KF, Li X, Tian L, Wang S, et al. Epigenetic inactivation of T-box transcription factor 5, a novel tumor suppressor gene, is associated with colon cancer. Oncogene. 2010;29:6464–74.

  31. 31.

    Xiao W, Ou C, Qin J, Xing F, Sun Y, Li Z, et al. CBX8, a novel DNA repair protein, promotes tumorigenesis in human esophageal carcinoma. Int J ClinExp Pathol. 2014;7:4817–26.

  32. 32.

    Teneng I, Tellez CS, Picchi MA, Klinge DM, Yingling CM, Snider AM, et al. Global identification of genes targeted by DNMT3b for epigenetic silencing in lung cancer. Oncogene. 2015;34:621–30.

  33. 33.

    Narayan G, Freddy A, Xie D, Liyanage H, Clark L, Kisselev S, et al. Promoter methylation-mediated inactivation of PCDH10 in acute lymphoblastic leukemia contributes to chemotherapy resistance. Genes Chromosomes Cancer. 2011;50:1043–53.

  34. 34.

    Hatzimichael E, Dasoula A, Dranitsaris G, Vassou A, Papoudou-Bai A, Stebbing J, et al. BIK (Bcl2-Interacting Killer) CpG methylation status as a potential predictive biomarker of relapsed/refractory multiple myeloma disease. Journal of Clinical Oncology, ASCO Annual Meeting Abstracts.2010; Vol 28, No 15 suppl (May 20 Supplement), 8118.

  35. 35.

    Sanchez A, Setien F, Martınez N, Oliva JL, Herranz M, Fraga MF, et al. Epigenetic inactivation of the ERK inhibitor Spry2 in B-cell diffuse lymphomas. Oncogene. 2008;27:4969–72.

  36. 36.

    Kandimalla R, van Tilborg AA, Kompier LC, Stumpel DJ, Stam RW, Bangma CH, et al. Genome-wide analysis of CpG island methylation in bladder cancer identified TBX2, TBX3, GATA2, and ZIC4 as pTa-Specific prognostic markers. Eur Urol. 2012;61:1245–56.

  37. 37.

    Beukers W, Kandimalla R, Masius R, Vermeij M, Kranse R, Leenders G, et al. Stratification based on methylation of TBX2 and TBX3 into three molecular grades predicts progression in patients with pTa-bladder cancer. Mod Pathol. 2015;28:515–22.

  38. 38.

    Yamamoto M, Cid E, Bru S, Yamamoto F. Rare and frequent promoter methylation, respectively, of TSHZ2 and 3 genes that are both downregulated in expression in breast and prostate cancers. PLoS One. 2011;6, e17149.

  39. 39.

    Dunwell T, Hesson L, Rauch TA, Wang L, Clark RE, Dallol A, et al. A genome-wide screen identifies frequently methylated genes in haematological and epithelial cancers. Mol Cancer. 2010;9:44.

  40. 40.

    Shukla A, Chaturvedi N, Ahrens A, Cutucache C, Mittal A, Bierman P, et al. Stromal tumor microenvironment in chronic lymphocytic leukemia: regulation of leukemic progression. J Leuk (Los Angel). 2013;1:113.

  41. 41.

    Musialik E, Bujko M, Kober P, Wypych A, Gawle-Krawczyk K, Matysiak M, et al. Promoter methylation and expression levels of selected hematopoietic genes in pediatric B-cell acute lymphoblastic leukemia. Blood Res. 2015;50:26–32.

  42. 42.

    Monteith GR, McAndrew D, Faddy HM, Roberts-Thomson SJ. Calcium and cancer: targeting Ca2+ transport. Nat Rev Cancer. 2007;7:519–30.

  43. 43.

    Monteith GR, Davis FM, Roberts-Thomson SJ. Calcium channels and pumps in cancer: changes and consequences. J Biol Chem. 2012;287:31666–73.

  44. 44.

    Muggen AF, Pillai SY, Kil LP, van Zelm MC, van Dongen JJ, Hendriks RW, et al. Basal Ca (2+) signalling is particularly increased in mutated chronic lymphocytic leukemia. Leukemia. 2015;29:321–8.

  45. 45.

    Chen SS, Claus R, Lucas DM, Yu L, Qian J, Ruppert AS, et al. Silencing of the inhibitor of DNA binding protein 4 (ID4) contributes to the pathogenesis of mouse and human CLL. Blood. 2011;117:862–71.

  46. 46.

    Grcević D, Marusić A, Grahovac B, Jaksić B, Kusec R. Expression of bone morphogenetic proteins in acute promyelocytic leukemia before and after combined all trans-retinoic acid and cytotoxic treatment. Leuk Res. 2003;27:731–8.

  47. 47.

    Dzietczenia J, Wróbel T, Jaźwiec B, Mazur G, Butrym A, Poręba R, et al. Expression of bone morphogenetic proteins (BMPs) receptors in patients with B-cell chronic lymphocytic leukemia (B-CLL). Int J Lab Hematol. 2010;32:e217–21.

  48. 48.

    Gilboa L, Nohe A, Geissendörfer T, Sebald W, Henis YI, Knaus P. Bone morphogenetic protein receptor complexes on the surface of live cells: a new oligomerization mode for serine/threonine kinase receptors. Mol Biol Cell. 2000;11:1023–35.

  49. 49.

    Chen D, Ji X, Harris MA, Feng JQ, Karsenty G, Celeste AJ, et al. Differential roles for bone morphogenetic protein (BMP) receptor type IB and IA in differentiation and specification of mesenchymal precursor cells to osteoblast and adipocyte lineages. J Cell Biol. 1998;142:295–305.

  50. 50.

    Hanoun M, Eisele L, Suzuki M, Greally J, Hu¨ttmann A, Aydin S, et al. Epigenetic silencing of the circadian clock gene CRY1 is associated with an indolent clinical course in chronic lymphocytic leukemia. PLoS One. 2012;7:e34347.

  51. 51.

    Yang MY, Chang JG, Lin PM, Tang KP, Chen YH, Lin HY, et al. Downregulation of circadian clock genes in chronic myeloid leukemia: alternative methylation pattern of hPER3. Cancer Sci. 2006;97:1298–307.

  52. 52.

    Liu Y, Miao Y, Wang J, Lin X, Wang L, Xu HT, et al. DEC1 is positively associated with the malignant phenotype of invasive breast cancers and negatively correlated with the expression of claudin-1. Int J Mol Med. 2013;31:855–60.

  53. 53.

    Lee JC, Sharma M, Lee YH, Lee NH, Kim SY, Yun JS, et al. PAX9 mediated cell survival in oral squamous carcinoma cell enhanced by c-myb. Cell Biochem Funct. 2008;26:892–9.

  54. 54.

    Wielscher M, Vierlinger K, Kegler U, Ziesche R, Gsur A, Weinhäusel A. Diagnostic performance of plasma DNA methylation profiles in lung cancer, pulmonary fibrosis and COPD. EBio Medicine. 2015;2:927–34.

  55. 55.

    Gorbacheva VY, Kondratov RV, Zhang R, Cherukuri S, Gudkov AV, Takahashi JS, et al. Circadian sensitivity to the chemotherapeutic agent cyclophosphamide depends on the functional status of the CLOCK/BMAL1 transactivation complex. Proc Natl Acad Sci U S A. 2005;102:3407–12.

  56. 56.

    Kondratov RV, Kondratova AA, Lee C, GorbachevaVY CMV, Antoch MP. Post-translational regulation of circadian transcriptional CLOCK(NPAS2)/BMAL1 complex by cryptochromes. Cell Cycle. 2006;5:890–5.

  57. 57.

    Gauger MA, Sancar A. Cryptochrome, circadian cycle, cell cycle checkpoints, and cancer. Cancer Res. 2005;65:6828–34.

Download references


We acknowledge Mr. Dhritiman Dan for the technical support.


The financial support was provided by the Department of Biotechnology (BT/PR11106/GBD/27/145/2008, BT/PR15438/MED/30/606/2011 and BT/PR8680/AGR/36/754/2013), Ministry of Science and Technology, Government of India; and All India Institute of Medical Sciences, New Delhi (8-60/A060/ 2011/RS) to RG for carrying out this research work.

Availability of data and materials

The DNA methylation as well as expression data generated in the study have been submitted to the NCBI Gene Expression Omnibus (GEO) ( under accession number GSE81937.

Authors’ contributions

LR and NM performed all the experiments under the guidance of RG, analyzed the experimental data, and wrote the manuscript; RG designed the study, analyzed clinical and experimental data, and wrote the manuscript; GK contributed in validation experiments and reviewed the revised manuscript; AG, LK and AS evaluated the clinical data, and reviewed the manuscript; DS analyzed the experimental data and reviewed the manuscript; JKD helped DS in analysis of experimental data. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

The study was approved by Institute Ethics Committee (AIIMS). All the patients were enrolled in the study after taking informed consent as per the guidelines of the institute ethics committee. The study numbers provided by the committee were: A-03/25.07.2007, IEC/NP-129/2011, IEC/NP-25/2012 and IEC/NP-424/2013.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Correspondence to Ritu Gupta.

Additional file

Additional file 1: Table S1.

List of primers used in RQ-PCR studies. Table S2A. List of probes hypermethylated in CLL in comparison to CD19+ cells from healthy individuals. Table S2B. List of probes hypomethylated in CLL in comparison to normal CD19+ cells. Table S3A. List of probes hypermethylated in unmutated in comparison to mutated CLL. Table S3B. List of probes hypomethylated in unmutated CLL in comparison to mutated CLL. Table S4. List of genes having negative correlation for methylation and gene expression in CLL as compared to normal 19+ cells. Table S5. List of genes having negative correlation for methylation and gene expression in unmutated CLL as compared to mutated CLL. (XLS 1339 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Rani, L., Mathur, N., Gupta, R. et al. Genome-wide DNA methylation profiling integrated with gene expression profiling identifies PAX9 as a novel prognostic marker in chronic lymphocytic leukemia. Clin Epigenet 9, 57 (2017).

Download citation


  • Promoter methylation
  • PAX9
  • Circadian rhythm
  • Transcription factors