Epigenome-wide analysis of T-cell large granular lymphocytic leukemia identifies BCL11B as a potential biomarker
Clinical Epigenetics volume 14, Article number: 148 (2022)
The molecular pathogenesis of T-cell large granular lymphocytic leukemia (T-LGLL), a mature T-cell leukemia arising commonly from T-cell receptor αβ-positive CD8+ memory cytotoxic T cells, is only partly understood. The role of deregulated methylation in T-LGLL is not well known. We analyzed the epigenetic profile of T-LGLL cells of 11 patients compared to their normal counterparts by array-based DNA methylation profiling. For identification of molecular events driving the pathogenesis of T-LGLL, we compared the differentially methylated loci between the T-LGLL cases and normal T cells with chromatin segmentation data of benign T cells from the BLUEPRINT project. Moreover, we analyzed gene expression data of T-LGLL and benign T cells and validated the results by pyrosequencing in an extended cohort of 17 patients, including five patients with sequential samples.
We identified dysregulation of DNA methylation associated with altered gene expression in T-LGLL. Since T-LGLL is a rare disease, the samples size is low. But as confirmed for each sample, hypermethylation of T-LGLL cells at various CpG sites located at enhancer regions is a hallmark of this disease. The interaction of BLC11B and C14orf64 as suggested by in silico data analysis could provide a novel pathogenetic mechanism that needs further experimental investigation.
DNA methylation is altered in T-LGLL cells compared to benign T cells. In particular, BCL11B is highly significant differentially methylated in T-LGLL cells. Although our results have to be validated in a larger patient cohort, BCL11B could be considered as a potential biomarker for this leukemia. In addition, altered gene expression and hypermethylation of enhancer regions could serve as potential mechanisms for treatment of this disease. Gene interactions of dysregulated genes, like BLC11B and C14orf64, may play an important role in pathogenic mechanisms and should be further analyzed.
T-LGLL is a mature T-cell leukemia arising commonly from T-cell receptor (TCR) αβ-positive CD3+ memory cytotoxic T cells (CD8-positive T cells) . Co-expression of CD57 is frequently observed . Patients can present with severe neutropenia and/or anemia with transfusion dependency. In neutropenic patients, life-threatening recurrent infections are observed. Morbidity and mortality are mostly due to the accompanying cytopenias . Lymphocyte infiltration of the liver, spleen, or the bone marrow is observed. Cytopenias are not mediated by bone marrow infiltration of leukemic cells but are related to immunological dysfunction . T-LGLL often arises in the context of autoimmune diseases, such as rheumatoid arthritis occurring in approximately one-third of patients with T-LGLL . The median age at diagnosis is 60 years and around 25% of adult patients are under 50 years old .
Many patients can be asymptomatic or exhibit an indolent clinical course. If treatment is necessary, immunosuppressive therapy is used as first-line treatment but does not represent a curative option. Allogeneic stem cell transplantation is currently the only curative option for patients not responding to conventional therapy . The median overall survival was reported to be around nine to ten years .
In T-LGLL patients, insufficient response to activation-induced cell death leading to efficient apoptosis is considered a major pathogenetic mechanism driving leukemogenesis. Various pro-apoptotic and pro-survival signaling pathways, including FAS/FASL, JAK2/STAT3, NF-κB, RAS/MEK/ERK, and PI3K/AKT pathways, are frequently deregulated in T-LGLL .
However, the molecular pathogenesis of T-LGLL is still only partly understood. Recurrent numeric and/or structural chromosomal imbalances are rare in T-LGLL . Only a few recurrent somatic mutations have been discovered. Activating somatic mutations in the SRC homology 2 (SH2) domain of the signal transducer and activator of transcription 3 (STAT3) gene have been identified in 23–40% of T-LGLL patients [11, 12]. In contrast to other mature T-cell lymphomas, mutations in the STAT5B gene have been reported only in 2% of T-LGLL cases . No other recurrent mutations were detected in an exome sequencing study of six T-LGLL cases . Recently, we identified recurrent non-synonymous alterations in the TNFAIP3 tumor suppressor gene in 8% of T-LGLL cases, likely contributing to deregulated NF-κB activity in this entity . Nevertheless, the genome of T-LGLL appears to be stable and the pathogenesis of this T-cell leukemia remains largely unknown.
Therefore, we aimed to characterize the epigenetic profile of T-LGLL by array-based DNA methylation profiling and compared it to that of non-neoplastic T cells of various maturation stages, e.g., naive, central memory, and effector memory T cells, to identify molecular events driving the pathogenesis of T-LGLL. To characterize which genomic regulatory elements show T-LGLL-specific changes in DNA methylation, we compared the differentially methylated loci between the T-LGLL cases and normal T cells with chromatin segmentation data of benign T cells from the BLUEPRINT project . Moreover, we analyzed gene expression data of T-LGLL and benign T cells and validated the results by pyrosequencing in an extended cohort, including five patients with sequential samples.
DNA methylation signatures segregate T-LGLL cells from their benign counterpart
Since the DNA methylation signature varies according to tissues and cell types [16, 17], identifying appropriate non-neoplastic samples for comparison is crucial for detecting de novo DNA methylation changes in tumor samples. We performed genome-wide DNA methylation profiling of FACS-sorted tumor cell subsets from eleven T-LGLL cases. Six subsets of T cells of healthy donors, including CD4+ naïve cells, CD4+ central memory and CD4+ effector memory cells, CD8+ naïve cells, CD8+ central memory, and CD8+ effector memory cells were used for the comparison to non-neoplastic human T-cell subsets. We compared the malignant T-LGLL samples to the mature T-cell subsets, since it is thought that the malignant T-LGLL cells arise from mature T cells. Unsupervised Principal Component Analysis (PCA) of all normal T-cell subsets and leukemia samples based on the 39,930 CpG loci with the highest variation in beta-values (sd/sdmax> = 0.4) clearly segregated the dataset into the different subsets of normal and malignant T cells (Fig. 1). Both principal components (PC1 (92% of variance) and PC2 (5% of variance)) separate T-LGLL samples from benign T cells and the T-LGLL cells cluster closest to CD8-positive effector and central memory T cells. Due to this pattern and previous literature about the cell of origin of T-LGLL cells, we decided to primarily compare the T-LGLL samples to CD8-positive memory T cells, which is the one also closest to the described cell of origin of T-LGLL . Thus, we compared the DNA methylation profile of 11 T-LGLL (for patient characteristics, see Additional file 1: Table S1) to eight non-neoplastic CD8-positive memory T-cell samples of healthy donors.
Two thousand two hundred and sixteen CpG probes showed an absolute differential methylation between T-LGLL and CD8-positive memory samples of over 25% (delta > 0.25). Of these, 1479 CpG loci were hypermethylated and 737 were hypomethylated in T-LGLL samples in comparison with the normal T cells (Fig. 2). None of them were probes interrogating Ch-loci (non-CpG loci). Differentially methylated positions (DMPs) were annotated with matching genomic features (Additional file 1: Table S9).
Identification of differentially methylated gene regions and candidate genes
We annotated the 2216 DMPs to identify candidate genes potentially involved in the pathobiology of T-LGLL. Then, we ranked the genes according to the proportion of significant DMPs for each gene, taking into account the methylation status (hyper or hypo) of CpGs. We found 267 genes hypermethylated and 90 genes hypomethylated in T-LGLL samples (Additional file 1: Tables S10 and S11). Among the hypermethylated genes were BCL11B, TTC39C, THEMIS, and NKX6-2 (Additional file 1: Table S10) and the hypomethylated genes included ZEB2, MAST3, RIN3, SLC15A4, and FAM53B (Additional file 1: Table S11).
We additionally performed a methylation differential analysis by region to find candidate genes. This has become of interest lately as it has been described to be associated with phenotypic changes in cancer [19, 20]. We found a total of 1096 differentially methylated regions (DMRs) with lengths from 3 to 5321 bp containing between 2 and 70 CpGs (Additional file 1: Table S12). Ordering DMRs by the mean difference of beta-values between the T-LGLL and normal T-cell samples, we found the most hypermethylated regions in the genes BCL11B, TNRC6B, TTC39C, THEMIS, and LYN; and the most hypomethylated regions in the genes MEF2A, RIN3, ZEB2, DENND3, RFC2, and RUNX3.
Teramo et al.  have reported SOCS3 being down-modulated in T-LGLL. We therefore tested methylation differences in CpG loci of the SOCS3 promoter. We did not observe significant differences between T-LGLL and healthy donor-derived CD8 memory cells, but in some cases a trend was visible for promoter CpG hypermethylation related with downregulated expression (Additional files 2 and 3: Fig. S2).
Functional enrichment analysis reveals enrichment in pathways related to immune response
We hypothesized that genes with significant changes in DNA methylation in promoters and enhancers will constitute most of the functional biological change associated with the CpG hyper- or hypomethylation. In order to find the most relevant functional changes, we performed a gene ontology analysis on genes hyper- and hypomethylated in promoters and enhancers. These genes were selected from hyper- and hypomethylated genes shown in Additional file 1: Tables S10 and S11, selecting those with UCSC_RefGene_Group assigned to TSS200 (200 bp upstream the transcription start site, TSS) or 5’UTR or assigned to Enhancer category in the 450 k annotation (Additional file 1: Tables S13–S16).
A statistical enrichment (p value < 0.01) was observed in promoters of hypermethylated genes in pathways related to the immune response, especially from T cells (Additional files 1 and 4: Table S15 and Fig. S3A). For hypomethylated genes, promoter pathways were also enriched in immune pathways, although more related to humoral responses (Additional files 1 and 5: Table S16 and Fig. S3B).
Mapping of differentially methylated CpGs and regions reveals enrichment of hypomethylated and hypermethylated loci in active enhancer sites
We mapped the differentially methylated CpGs from T-LGLL samples to the chromatin states determined in CD8-positive memory cells by the BLUEPRINT/IHEC Project (Additional file 1: Tables S7 and S17). T-LGLL hypomethylated and hypermethylated loci were enriched in active enhancer sites (“E9” state, 4.35- and 4.71-fold change, p = 2.52 × 10–65 and p = 1.99 × 10–152, respectively, Fig. 3A and 3B, Additional file 1: Table S13) defined by the activating marks H3K4me1 and H3K27Ac. A similar enrichment was found when considering significant CpGs in genes with hyper- or hypomethylated promoters/enhancers, as defined in the previous section (“E9” state, 2.99- and 4.80-fold change, p = 2 × 10–3 and p = 6.46 × 10–21, respectively, Additional file 1: Tables S15 and S16).
Gene expression analysis by qPCR reveals significant differences in candidate genes between T-LGLL and healthy donor-derived CD8+ T cells
We analyzed transcript levels of nine genes by qPCR to identify whether significantly differential methylation at CpG loci correlates with gene expression differences of candidate genes. GAPDH was used as a housekeeping gene (Additional file 1: Table S8). The selection of genes was based on (a) the most frequent differentially methylated genes and (b) genes of interest known to be relevant in pathways involved in T-cell development or lymphomagenesis.
In the group of hypermethylated genes, BCL11B, THEMIS, C14orf64 (alternative name LINC01550), and DNMT3A were analyzed using qPCR. All four genes exhibited a significant lower expression in T-LGLL compared to normal CD8 memory cells. For BCL11B expression differences were significant only for CD8 central memory cells. For the long intergenic non-protein coding (LINC) LINC01550, bulk CD8+ T cells were available for comparison. Gene expression of this gene in T-LGLL was significantly lower compared to the benign T cells (Fig. 4).
In the group of hypomethylated genes, ZEB2, IL10, MAST3, and PTEN were analyzed. ZEB2 and IL10 displayed a higher expression in T-LGLL cells than in normal CD8+ T cells (Fig. 4). MAST3 and PTEN did not show any significant differences in gene expression levels. We also studied IL6, as the corresponding protein is known to be highly expressed in T-LGLL. We could confirm significantly higher IL6 levels in T-LGLL compared to CD8-positive T cells from healthy donors (Additional file 6: Fig. S4).
In summary, we observed significant differences in the gene expression of hyper- and hypomethylated genes between T-LGLL cells and healthy donor-derived memory T cells.
BCL11B is strongly epigenetically changed in T-LGLL cells
Remarkably, among the top eight hypermethylated regions four were located in BCL11B (mean difference in beta-values of 0.78, 0.68, 0.52, 0.47) (Additional file 1: Table S10). 21 CpGs—from a total of 62 interrogated—annotated to this gene were found as DMPs with an FDR of ≤ 0.5%, all of them hypermethylated in T-LGLL samples (Additional file 1: Table S9). The four significant DMRs were located in the gene body and assigned as enhancers by ENCODE (Additional file 1: Tables S10 and S15), and these match enhancers in CD8-positive memory cells (Additional files 7 and 8: Fig. S5A and S5B). Furthermore, expression analysis using qPCR (Fig. 4) showed a significant decrease in BCL11B expression in T-LGLL samples compared to CD8-positive central memory T cells.
Hypermethylated CpGs in BCL11B and C14orf64 are epigenetic hallmarks of T-LGLL
Exploratory analysis in promoter capture Hi-C data to identify long-range interacting regions of 31,253 promoters in 17 human primary hematopoietic cell types from the BLUEPRINT/IHEC Project  showed a promoter—promoter interaction between BCL11B and C14orf64 in CD8-positive T cells (see Fig. 5). Interestingly, the long intergenic non-coding RNA C14orf64 shows a highly hypermethylated region in our analysis (mean difference in beta-values of 0.32) (Additional file 1: Table S9). BCL11B and C14orf64 both showed lower expression in T-LGLL. The close interaction in the nucleus of these genes in CD8-positive T cells and the apparent coordination of their regulation by methylation mechanisms in T-LGLL suggest a potentially relevant role for the pathogenesis and clinical behavior of T-LGLL cells.
To validate the results, we performed pyrosequencing on selected CpG loci of BCL11B and C14orf64 to estimate the potential of these genes as DNA methylation biomarkers (Additional file 1: Table S18). We selected six highly differentially methylated CpG sites located at enhancer sites of BCL11B and four CpG sites of C14orf64. DNA methylation levels of these ten CpGs were measured by BPS in all T-LGLL samples and three normal T-cell samples with sufficient material. High correlation was observed between pyrosequencing and methylation arrays as measured by bivariate correlation analysis (average Pearson’s coefficient of 0.95, p < 0.001) for each CpG site (Additional files 1 and 9: Table S6 and Fig. S1). Changes of DNA methylation level at these ten CpG sites (Additional file 1: Table S5) were evaluated during the course of disease with and without treatment in five T-LGLL patients. We applied the BPS analysis of the ten CpG sites to samples obtained at three to six different time points. One out of five patients (patient 2) achieved a complete remission while the other patients presented active disease during the follow-up period and had high methylation values in the ten CpGs assayed (Additional file 1: Table S19). Interestingly, the only patient exhibiting a complete response showed a decrease in the methylation values after initiation of therapy, in line with response to therapy (Fig. 6).
The cell of origin in CD8-expressing T-LGLL is assumed to be a terminal effector-type cytotoxic T cell . As analyzed by transcriptome sequencing and gene expression arrays, these cells are characterized by down-modulation of CD28 and overexpression of perforin, granzymes, and CD57, which is in line with their cytotoxic phenotype . Our PCA of array-based DNA methylation profiling of CD8-positive T-LGLL samples and healthy donor-derived T cell subsets revealed the closest similarity between T-LGLL and CD8+ memory T cells. Based on these methylation analyses, our results confirm this subset as the non-neoplastic counterpart as described before by transcriptomic analyses . DNA methylation analyses did not allow further differentiation between CD8 central memory and/or effector memory T cells, because PCA revealed a homogeneous clustering of these two subsets and differential methylation analyses of these subsets did not reveal any significant differences.
Only few publications report on methylation changes in human T-LGLL  and no comprehensive methylation profiling has been reported until now. Mapping of differentially methylated CpGs between T-LGLL and the non-neoplastic counterpart revealed major differences in methylation profiles, pointing to an important role for aberrant methylation in this disease.
In the group of differentially hypermethylated genes, the genes BCL11B and THEMIS were among the genes with the highest ratio of differentially methylated CpGs. The BAF chromatin remodeling complex subunit BCL11B gene is located on chromosome 14q32.2. The encoded protein, B-cell lymphoma/leukemia 11B, functions as a C2H2 zinc finger transcription factor in early T cell development . Its specific role in human mature T cells is not fully understood and needs further investigation. It is known to act as a tumor suppressor gene in various malignancies, while in some cases it has been reported to have oncogenic features [23, 24]. Low expression of BCL11B has been associated with poor prognosis in T-acute lymphoblastic leukemia . Hypermethylated BCL11B showed significantly lower mRNA expression in T-LGLL compared to normal CD8+ T cells, likely contributing to disease promotion.
THEMIS, Thymocyte Selection Associated, located on chromosome 6q22.33, encodes a protein that regulates T cell selection during late thymocyte development and is necessary for lineage commitment and maturation of T cells. More specifically, in mice Themis controls lineage selection of double positive thymocytes into either CD4- or CD8-positive T cells . Post-selection deletion of Themis reduces the homeostatic response of peripheral CD8-positive T cells to self-peptide–major histocompatibility complex in mice . Its role in T-cell leukemia/ lymphoma is largely unknown, but THEMIS was recently shown to be downregulated by promoter methylation in HTLV-1-infected CD4-positive cells of adult T-cell leukemia . Importantly, we also observed hypermethylation of THEMIS in T-LGLL resulting in its downregulation at the transcriptomic level. In aggregate, these findings suggest that epigenetic deregulation of THEMIS may play a common role in the molecular pathogenesis of different T-cell neoplasms and thus deserves further investigation. Furthermore, our data provide a rationale for testing hypomethylating agents in T-LGLL.
Analyzing the most significantly hypomethylated genes, IL10 and ZEB2 were of particular interest. IL10, produced by monocytes and lymphocytes, is involved in the regulation of the JAK/STAT pathway. Binding of IL10 to its receptor leads to phosphorylation of STAT3 and hence activates the JAK/STAT pathway . IL10 signaling is known to induce SOCS3, a negative regulator of the JAK/STAT pathway [30, 31]. SOCS3 is known to be epigenetically silenced in JAK/STAT-dependent tumors [32, 33] and can be induced by IL6, for which very high levels in T-LGLL are described . Teramo et al. have reported SOCS3 being unresponsive to IL6 triggering in T-LGLL. IL6 mediated SOCS3 expression could be restored by application of demethylation agents leading to apoptosis of T-LGLL cells . Testing of methylation differences in CpGs of the SOCS3 promoter revealed no significant differences between T-LGLL and healthy donor-derived CD8 memory cells, although variability between subjects can be observed (Additional files 2 and 3: Fig. S2). This can be explained by using different methods and statistics in the methylation analyses that tend to homogenize differences between groups hiding variability between subjects. The role of SOCS3 methylation as a hallmark for treatment personalization should be analyzed in further studies. In line with hypomethylation of IL10 in T-LGLL, we identified a significant overexpression of IL10 at the mRNA level compared to healthy donor-derived CD8 memory cells, pointing to a further potentially novel mechanism of JAK/STAT-activation in T-LGLL.
ZEB2, Zinc Finger E-Box Binding Homeobox 2, among others, regulates the terminal differentiation of CD8-positive cytotoxic T cells . It is overexpressed in several tumor entities including T-cell malignancies where it functions as an oncogene [35, 36]. We confirmed overexpression of ZEB2 by qRT-PCR in T-LGLL samples pointing to a potential oncogenic role of this gene also in T-LGLL.
Serial analyses of BCL11B methylation in individual patients suggest that methylation changes in T-LGLL appear to be stable over time. Disappearance of clonal T-LGLL cells in response to successful treatment with immunosuppressive agents resulted in the normalization of methylation states (Fig. 6). Together, this observation suggests that hypermethylation of BCL11B may be considered a novel molecular hallmark of T-LGLL and corresponds to the tumor cell content. To confirm, that monitoring of BCL11B methylation could be used both as a marker in the diagnostic process and for monitoring minimal residual disease, the validation in a larger cohort with sequential sampling of T-LGLL patients is needed.
Furthermore, our data in combination with in vitro studies reported by Teramo and colleagues  provide a rationale for the development of novel epigenetic treatment modalities in T-LGLL employing demethylating agents.
Array-based analysis of the overall methylation profile of T-LGLL samples in comparison to healthy donor-derived CD8-positive T cells showed that differentially methylated loci were enriched in enhancer sites whereas hypomethylation was not present in repressed polycomb promoter regions. These results suggest that regulation of gene expression by DNA methylation changes is acting through the modulation of enhancers. Compared to benign CD8-positive T cells, heterochromatin-typical H3K27me3 states are enriched in T-LGLL with hypermethylated regions and depleted in T-LGLL with hypomethylated regions including promoters. H3K27me3-rich genomic regions can function as silencers to repress gene expression via chromatin interactions, e.g., proximity or looping . H3K27me3 is known to be accumulated in promoters of many cancer types, where it is associated with gene repression  and a poor clinical outcome .
Methylation patterns of T-LGLL patients showed a regulation of BCL11B expression most likely through methylation of the gene. qPCR revealed low expression of this gene in T-LGLL compared to its non-neoplastic counterpart. Low BCL11B expression was an independent indicator for shorter overall survival and time to recurrence for patients with hepatocellular carcinoma . BCL11B has been confirmed as a tumor suppressor in hepatocellular carcinoma with inhibitory effects on proliferation, cell cycle progression, apoptosis, and mobility . In our analysis, aberrantly methylated CpGs were not located in the promoter but in the gene body in introns 2 and 3, a region in which an enhancer was identified based on chromatin annotation data from CD8 memory cells. Previously published data on murine Bcl11b studies have revealed DMRs in the 5′-flanking region, identified as both T cell-specific enhancer and silencer. Introns 1 and 3 were supposed to contain cis-regulatory elements , underlining the role of intronic regulatory elements in the regulation of this gene.
Identification of long-range interacting regions by promoter capture Hi-C data from the BLUEPRINT/IHEC Project  on CD8-positive T cells showed a promoter—promoter interaction between the genomic regions of BCL11B and C14orf64 (LINC01550) (Fig. 5). C14orf64 is a long intergenic non-protein coding RNA, the overexpression of which is known to be associated with poor survival in lung adenocarcinoma patients . By contrast, elevation of LINC01550 induces apoptosis and cell cycle arrest leading to a better outcome of patients with malignant melanoma . We identified a highly hypermethylated region in this LINC RNA (Additional file 1: Table S6). Visualization of the 3D chromatin contacts displayed as a network revealed an interaction of the promoters of BCL11B and LINC01550 in the nucleus of CD8-positive cells. We observed a high correlation between both genes based on expression in many cell types (Additional file 10: Fig. S6). The apparent coordination of their expression regulated by methylation mechanisms in T-LGLL suggests a relevant role for the pathogenesis and clinical behavior of the disease.
We identified dysregulation of DNA methylation associated with altered gene expression in T-LGLL. Since T-LGLL is a rare disease, the samples size is low and monitoring of BCL11B methylation in more sequential T-LGLL patient samples is needed. Hypermethylation of T-LGLL cells at various CpG sites located at enhancer regions is a hallmark of this disease. The interaction of BLC11B and C14orf64 as suggested by in silico data analysis could provide a novel pathogenetic mechanism that needs further experimental investigation.
Material and methods
Patient cohort and non-neoplastic human T-cell samples
Seventeen T-LGLL patients with high-quality DNA extracted from sorted tumor cells were included in this study. The mean tumor cell content after tumor cell enrichment was 96% (range 95–98%) as determined by FACS. For eleven T-LGLL cases, genome-wide DNA methylation profiling was performed at diagnosis. Results were confirmed by pyrosequencing in all 17 patients; for five patients, sequential samples were available during their clinical course. Patient characteristics and sample overview are described in Additional file 1: Tables S1 and S3. Gene expression analysis was performed on leukemic cells of eleven patients, depending on material availability. Samples of 16 healthy donors were included for comparison, six of them in DNA methylation profiling, three in pyrosequencing, and ten in the gene expression analysis. Non-neoplastic TCR αβ+ T cell subsets were isolated from healthy donors, matching age and gender. Healthy blood donors of benign T cells sorted for methylation array analysis were anonymized. For two healthy donors, only bulk CD8 T cells were available for gene expression analysis. Cell isolation information for non-neoplastic samples and a sample overview are described in Additional file 1: Tables S2 and S3.
The diagnosis of T-LGLL was based on (a) flow cytometry analyses from peripheral blood (PB) showing > 400/µl of CD3+CD8+CD57+ cells, and (b) evidence of clonal TCR gene rearrangements.
Sample preparation and sorting of T cell subsets
PB mononuclear cells were isolated by Pancoll density centrifugation (PAN-Biotech, Aidenbach, Germany). Leukemic cells were isolated as CD3+CD8+CD57+ T cells by fluorescence-activated cell sorting (FACS, ARIA III, BD Biosciences; Heidelberg, Germany). T cells from healthy donors were initially enriched (> 98% purity) for either CD4+ or CD8+ T cells by magnetic cell separation using the MACS system (Miltenyi Biotec, Bergisch Gladbach, Germany). Afterward, the T cell subsets from healthy donors were isolated by FACS using the following strategies: CD8 naïve T cells: CD8+CD45RA+CD27+, CD8 central memory T cells: CD8+CD45RA−CD27+, CD8 effector memory T cells: CD8+CD45RA−CD27−, CD4 naïve T cells: CD4+CD45RA+CD27+, CD4 central memory T cells: CD4+CD45RA−CD27+, and CD4 effector memory T cells: CD4+CD45RA−CD27−. FACS confirmed a 90–99% purity of the T cell subsets. The following antibodies were used: anti-CD3-PerCP, anti-CD4-PE, anti-CD8-APC-Cy7, anti-CD27-APC, and anti-CD45RA-FITC (all Miltenyi Biotech). Genomic DNA was extracted from all samples using the DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany).
Analysis of differential gene expression by reverse transcription real-time PCR (qPCR) and Pyrosequencing is described in Additional file 11.
Array-based DNA methylation analysis
Bisulfite conversion was performed using the EZ-DNA Methylation Kit® (Zymo Research Corporation, Irvine, CA, USA), followed by DNA methylation analysis with the Infinium HumanMethylation450 BeadChip (Illumina, Inc., San Diego, CA, USA). All arrays were successfully scanned using the Illumina iScanTMSystem (more detailed description in: . To identify suitable non-neoplastic samples for eleven T-LGLL samples, we included 24 T cell samples sorted from PB of six healthy blood donors. These included: CD4+ naïve T cells (CD4nai, n = 5), CD4+ central memory T cells (CD4 + cm, n = 5), CD4+ effector memory T cells (CD4 + em, n = 2), CD8+ naïve T cells (CD8-positive nai, n = 4), CD8+ central memory T cells (CD8-positive cm, n = 6), and CD8+ effector memory T cells (CD8-positive em, n = 2). After normalization and filtering of poor-quality probes, multi-mappers, probes in sex chromosomes, and single nucleotide polymorphisms, beta-values of 401,778 CpG sites were subjected to differential analysis between CD8-positive memory and T-LGLL samples, resulting in 11,375 differentially methylated loci (adjusted p value < 0.005). For statistical analyses of the DNA methylation data and Functional enrichment analysis, see Additional file 11.
Annotation of chromatin segmentation
All CpG sites that passed the filter for analysis (401,778) were annotated according to the chromatin state categories defined for effector memory CD8-positive, alpha–beta T cell data from the BLUEPRINT project (sample code C003UQ; Venous blood, male) (http://ftp.ebi.ac.uk/pub/databases/blueprint/). For that purpose, genome segmentation by chromatin states was performed in this sample. Briefly, chromatin immunoprecipitation (ChIP)-seq data for H3K4me3, H3K4me1, H3K36me3, H3K27Ac, H3K27me3, and H3K9me3 from samples were used to segment the genome of the respective cell types into defined chromatin states. Segmentation was performed with the ChromHMM package (v1.10; ) using an 11 state combinatorial chromatin model (Additional file 1: Table S7) (defined by the BLUEPRINT consortium for 20,150,128 release:
ftp://ftp.ebi.ac.uk/pub/databases/blueprint/paper_data_sets/chromatin_states_carrillo_build37). Each CpG position was located within a region with a specific chromatin state assigned. Significant differentially methylated CpGs were compared to all background CpGs in the different chromatin states. To calculate enrichment of the significant CpG dataset (hyper- or hypo-) for each chromatin state, we performed a hypergeometric test using the base package in R, considering significant enrichments with p value < 0.01.
Promoter capture Hi-C sequencing data
CD8-positive T cell data from promoter capture Hi-C experiments were obtained from Javierre et al. , details on data processing and analysis can be accessed in the material and methods section. Only those interactions with a threshold above five from their analysis were considered, following the authors’ recommendation. GARDEN-NET was used to visualize BCL11B long-range interactions for total CD8-positive T cells .
Availability of data and materials
The data are available from the Gene Expression Omnibus (TBD) in a MIAME compliant format.
Differentially methylated positions
Differentially methylated regions
Long intergenic non-protein coding
Principal component analysis
Signal transducer and activator of transcription 3
T-cell large granular lymphocytic leukemia
Transcription start site
Bigouret V, Hoffmann T, Arlettaz L, Villard J, Colonna M, Ticheli A, et al. Monoclonal T-cell expansions in asymptomatic individuals and in patients with large granular leukemia consist of cytotoxic effector T cells expressing the activating CD94:NKG2C/E and NKD2D killer cell receptors. Blood. 2003;101:3198–204.
Lamy L. Large granular lymphocyte leukemia. Cancer Control. 1998;5:25–33.
Viny AD, Lichtin A, Pohlman B, Loughran T, Maciejewski J. Chronic B-cell dyscrasias are an important clinical feature of T-LGL leukemia. Leuk Lymphoma. 2008;49:932–8.
Sun H, Wei S, Yang L. Dysfunction of immune system in the development of large granular lymphocyte leukemia. Hematology. 2019;24:139–47.
Lamy T, Loughran TP. How I treat LGL leukemia. Blood. 2011;117:2764–74.
Cheon H, Dziewulska KH, Moosic KB, Olson KC, Gru AA, Feith DJ, Loughran TP. Advances in the diagnosis and treatment of large granular lymphocytic leukemia. Curr Hematol Malig Rep. 2020;15:103–12.
Loughran TP, Zickl L, Olson TL, Wang V, Zhang D, Rajala HLM, et al. Immunosuppressive therapy of LGL leukemia: prospective multicenter phase II study by the eastern cooperative oncology group (E5998). Leukemia. 2015;29:886–94.
Shah MV, Hook CC, Call TG, Go RS. A population-based study of large granular lymphocyte leukemia. Blood Cancer J. 2016;6:e455–e455.
Steinway SN, LeBlanc F, Loughran TP. The pathogenesis and treatment of large granular lymphocyte leukemia. Blood Rev. 2014;28:87–94.
Wong KF, Chan JCW, Liu HSY, Man C, Kwong YL. Chromosomal abnormalities in T-cell large granular lymphocyte leukaemia: report of two cases and review of the literature. Br J Haematol. 2002;116:598–600.
Johansson P, Bergmann A, Rahmann S, Wohlers I, Scholtysik R, Przekopowitz M, et al. Recurrent alterations of TNFAIP3 (A20) in T-cell large granular lymphocytic leukemia. Int J Cancer. 2016;138:121–4.
Jerez A, Clemente MJ, Makishima H, Koskela H, LeBlanc F, Peng Ng K, et al. STAT3 mutations unify the pathogenesis of chronic lymphoproliferative disorders of NK cells and T-cell large granular lymphocyte leukemia. Blood. 2012;120:3048–57.
Rajala HLM, Eldfors S, Kuusanmäki H, van Adrichem AJ, Olson T, Lagström S, et al. Discovery of somatic STAT5b mutations in large granular lymphocytic leukemia. Blood. 2013;121:4541–50.
Andersson EI, Rajala HLM, Eldfors S, Ellonen P, Olson T, Jerez A, et al. Novel somatic mutations in large granular lymphocytic leukemia affecting the STAT-pathway and T-cell activation. Blood Cancer J. 2013;3:e168–e168.
Martens JHA, Stunnenberg HG. BLUEPRINT: mapping human blood cell epigenomes. Haematologica. 2013;98:1487–9.
Baron U, Türbachova I, Hellwag A, Eckhardt F, Berlin K, Hoffmuller U, et al. DNA methylation analysis as a tool for cell typing. Epigenetics. 2006;1:55–60.
Martín-Subero JI, Kreuz M, Bibikova M, Bentink S, Ammerpohl O, Wickham-Garcia E, et al. New insights into the biology and origin of mature aggressive B-cell lymphomas by combined epigenomic, genomic, and transcriptional profiling. Blood. 2009;113:2488–97.
Wlodarski MW, Nearman Z, Jankowska A, Babel N, Powers J, Leahy P, et al. Phenotypic differences between healthy effector CTL and leukemic LGL cells support the notion of antigen-triggered clonal transformation in T-LGL leukemia. J Leukoc Biol. 2008;83:589–601.
Rakyan VK, Down TA, Balding DJ, Beck S. Epigenome-wide association studies for common human diseases. Nat Rev Genet. 2011;12:529–41.
Irizarry RA, Ladd-Acosta C, Wen B, Wu Z, Montano C, Onyango P, et al. The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat Genet. 2009;41:178–86.
Teramo A, Gattazzo C, Passeri F, Lico A, Tasca G, Cabrelle A, et al. Intrinsic and extrinsic mechanisms contribute to maintain the JAK/STAT pathway aberrantly activated in T-type large granular lymphocyte leukemia. Blood. 2013;121(3843–54):S1.
Ha VL, Luong A, Li F, Casero D, Malvar J, Kim YM, et al. The T-ALL related gene BCL11B regulates the initial stages of human T-cell differentiation. Leukemia. 2017;31:2503–14.
Wiles ET, Lui-Sargent B, Bell R, Lessnick SL. BCL11B is up-regulated by EWS/FLI and contributes to the transformed phenotype in Ewing sarcoma. PLoS ONE. 2013;8: e59369.
Abbas S, Sanders MA, Zeilemaker A, Geertsma-Kleinekoort WMC, Koenders JE, Kavelaars FG, et al. Integrated genome-wide genotyping and gene expression profiling reveals BCL11B as a putative oncogene in acute myeloid leukemia with 14q32 aberrations. Haematologica. 2014;99:848–57.
Bartram I, Gökbuget N, Schlee C, Heesch S, Fransecky L, Schwartz S, et al. Low expression of T-cell transcription factor BCL11b predicts inferior survival in adult standard risk T-cell acute lymphoblastic leukemia patients. J Hematol Oncol. 2014;15(7):51.
Choi S, Cornall R, Lesourne R, Love PE. THEMIS: two models, different thresholds. Trends Immunol. 2017;38:622–32.
Brzostek J, Gautam N, Zhao X, Chen EW, Mehta M, Tung DWH, et al. T cell receptor and cytokine signal integration in CD8+ T cells is mediated by the protein Themis. Nat Immunol. 2020;21:186–98.
Watanabe T, Yamashita S, Ureshino H, Kamachi K, Kurahashi Y, Fukuda-Kurahashi Y, et al. Targeting aberrant DNA hypermethylation as a driver of ATL leukemogenesis by using the new oral demethylating agent OR-2100. Blood. 2020;136:871–84.
Carey AJ, Tan CK, Ulett GC. Infection-induced IL-10 and JAK-STAT: a review of the molecular circuitry controlling immune hyperactivity in response to pathogenic microbes. JAKSTAT. 2012;1:159–67.
Cacalano NA, Sanden D, Johnston JA. Tyrosine-phosphorylated SOCS-3 inhibits STAT activation but binds to p120 RasGAP and activates Ras. Nature Cell Biol. 2001;3:460–5.
Cassatella MA, Gasperini S, Bovolenta C, Calzetti F, Vollebregt M, Scapini P, et al. Interleukin-10 (IL-10) selectively enhances CIS3/SOCS3 mRNA expression in human neutrophils: evidence for an IL-10-induced pathway that is independent of STAT protein activation. Blood. 1999;94:2880–9.
Niwa Y, Kanda H, Shikauchi Y, Saiura A, Matsubara K, Kitagawa T, et al. Methylation silencing of SOCS-3 promotes cell growth and migration by enhancing JAK/STAT and FAK signalings in human hepatocellular carcinoma. Oncogene. 2005;24:6406–17.
Isomoto H, Mott JL, Kobayashi S, Werneburg NW, Bronk SF, Haan S, Gores GJ. Sustained IL-6/STAT-3 signaling in cholangiocarcinoma cells due to SOCS-3 epigenetic silencing. Gastroenterology. 2007;132:384–96.
Omilusik KD, Best JA, Yu B, Goossens S, Weidemann A, Nguyen JV, et al. Transcriptional repressor ZEB2 promotes terminal differentiation of CD8+ effector and memory T cell populations during infection. J Exp Med. 2015;212:2027–39.
van Vlierberghe P, Ferrando A. The molecular basis of T cell acute lymphoblastic leukemia. J Clin Invest. 2012;122:3398–406.
Durinck K, Goossens S, Peirs S, Wallaert A, van Loocke W, Matthijssens F, et al. Novel biological insights in T-cell acute lymphoblastic leukemia. Exp Hematol. 2015;43:625–39.
Cai Y, Zhang Y, Loh YP, Tng JQ, Lim MC, Cao Z, et al. H3K27me3-rich genomic regions can function as silencers to repress gene expression via chromatin interactions. Nat Commun. 2021;12:719.
Fujikawa D, Nakagawa S, Hori M, Kurokawa N, Soejima A, Nakano K, et al. Polycomb-dependent epigenetic landscape in adult T-cell leukemia. Blood. 2016;127:1790–802.
Yamagishi M, Hori M, Fujikawa D, Ohsugi T, Honma D, Adachi N, et al. Targeting excessive EZH1 and EZH2 activities for abnormal histone methylation and transcription network in malignant lymphomas. Cell Rep. 2019;29:2321-2337.e7.
Yang W-J, Sun Y-F, Jin A-L, Lv L-H, Zhu J, Wang B-L, et al. BCL11B suppresses tumor progression and stem cell traits in hepatocellular carcinoma by restoring p53 signaling activity. Cell Death Dis. 2020;11:895.
Li L, Zhang JA, Dose M, Kueh HY, Mosadeghi R, Gounari F, Rothenberg EV. A far downstream enhancer for murine Bcl11b controls its T-cell specific expression. Blood. 2013;122:902–11.
Shih J-H, Chen H-Y, Lin S-C, Yeh Y-C, Shen R, Lang Y-D, et al. Integrative analyses of noncoding RNAs reveal the potential mechanisms augmenting tumor malignancy in lung adenocarcinoma. Nucleic Acids Res. 2020;48:1175–91.
Chen J, Li P, Chen Z, Wang S, Tang S, Chen X, et al. Elevated LINC01550 induces the apoptosis and cell cycle arrest of melanoma. Med Oncol. 2021;38:1–9.
Bergmann AK, Castellano G, Alten J, Ammerpohl O, Kolarova J, Nordlund J, et al. DNA methylation profiling of pediatric B-cell lymphoblastic leukemia with KMT2A rearrangement identifies hypomethylation at enhancer sites. Pediatr Blood Cancer. 2017;64(3):e26251.
Ernst J, Kellis M. ChromHMM: automating chromatin-state discovery and characterization. Nat Methods. 2012;9:215–6.
Javierre BM, Burren OS, Wilder SP, Kreuzhuber R, Hill SM, Sewitz S, et al. Lineage-specific genome architecture links enhancers and non-coding disease variants to target gene promoters. Cell. 2016;167:1369-1384.e19.
Madrid-Mencía M, Raineri E, Cao TBN, Pancaldi V. Using GARDEN-NET and ChAseR to explore human haematopoietic 3D chromatin interaction networks. Nucleic Acids Res. 2020;48:4066–80.
We are grateful to Michael Möllmann. Parts of this work were performed at the Imaging Center Essen, a Service Core Facility of the Faculty of Medicine of the University Duisburg-Essen, Germany. We thank Ludger Klein-Hitpass and René Scholtysik (Genomics & Transcriptomics Facility (GTF), Faculty of Medicine of the University Duisburg-Essen, Germany). Data generated in the context of the BLUEPRINT Consortium (European Union’s Seventh Framework Program) were included; we thank the members of the Consortium for generating these and making them publicly available. The authors thank all patients and healthy donors who participated in this study.
Open Access funding enabled and organized by Projekt DEAL. PJ was funded by IFORES and the Dr. Werner Jackstädt Stiftung. TL and ECdSP received funding from the Spanish Ministry of Science (Plan Nacional I + D+i PID2019-110183RB-C21); RK was supported by the DFG (KU1315/9-2).
Ethics approval and consent to participate
The study was approved by the institutional review board of the University of Duisburg-Essen (reference number 10-4573) and the review board of the University of Kiel (reference number D531/17).
Consent for publication
The authors declare no competing financial interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: The duplicated image in Figure 5 has been corrected.
T-LGL patient data. Table S2. Benign samples. Table S3. Sample overview. Table S4. Primer sequences and CpGs used in pyrosequencing. Table S5. CpGs validated by pyrosequencing. Table S6. Pearson correlation of results of bisulfite sequencing and methylation array data. Table S7. Chromatin states—description. Table S8. Taqman probes used in qPCR. Table S9. Differentially methylated probes and genomic features. Table S10. Differentially hypermethylated genes. Table S11. Differentially hypomethylated genes. Table S12. Differentially methylated regions. Table S13. Gene ontology analysis for TSS200 or 5’UTR of hypermethylated genes identified in Table S10. Table S14. Gene ontology analysis for TSS200 or 5’UTR of hypomethylated genes identified in Table S11. Table S15. Gene ontology analysis for enhancers of hypermethylated genes identified in Table S10. Table S16. Gene ontology analysis for enhancers of hypomethylated genes identified in Table S11. Table S17. Chromatin states. Table S18. Pyrosequencing—raw data. Table S19. Sequential Samples—raw data.
Differential methylation of CpG loci in the SOCS3 promoter in T-LGL samples. Comparison of CpG methylation (beta-value) for CpGs in the SOCS3 promoter between CD8+ memory cells (CD8mem) and T-LGLL samples (LGL). On top, adjusted p val of differential methylation analysis (dmpFDR, top left) and adjusted p value of differential variability analysis (dmVar, top right).
Differential methylation of CpG loci in the SOCS3 promoter in T-LGL samples. Comparison of CpG methylation (beta-value) for CpGs in the SOCS3 promoter between CD8+ memory cells (CD8mem) and T-LGLL samples (LGL). On top, adjusted p val of differential methylation analysis (dmpFDR, top left) and adjusted p value of differential variability analysis (dmVar, top right).
Gene Ontology analysis of genes hyper- (A) and hypomethylated (B) of T-LGL patients. Significant Biological processes (GO database) enriched in genes associated with significantly differentially methylated CpG loci in T-LGL. Enrichment represented as odds ratio. Point size represents the gene count of each pathway. Enrichment p value obtained by overrepresentation analysis , represented by point color. A Gene Ontology analysis of hypermethylated genes in T-LGL.
Gene Ontology analysis of genes hyper- (A) and hypomethylated (B) of T-LGL patients. Significant Biological processes (GO database) enriched in genes associated with significantly differentially methylated CpG loci in T-LGL. Enrichment represented as odds ratio. Point size represents the gene count of each pathway. Enrichment p value obtained by overrepresentation analysis , represented by point color. B Gene Ontology analysis of hypomethylated genes in T-LGL.
Differential gene expression of IL6 between T-LGL and healthy donor-derived CD8+ memory T cells. Differential gene expression for IL6 was measured by qPCR. Bulk CD8+ cells from two healthy donors were used for comparison. In line with previous publications, the T-LGLL cohort analyzed exhibits a higher IL6 expression compared to healthy donor-derived C8+ cells. HD Healthy donor.
(A and B): Location of differentially methylated CpG loci in the genes BCL11B and C14orf64 (LINC01550). A Significant differentially methylated CpGs in BCL11B (T-LGLL compared to CD8+. memory T cells) were located in the gene body and assigned as enhancers by ENCODE, which match as enhancers in CD8-positive memory cells.
(A and B): Location of differentially methylated CpG loci in the genes BCL11B and C14orf64 (LINC01550). B Significant differentially methylated CpGs in C14orf64 (LINC01550) (T-LGL compared to CD8 pos. memory T cells).
Correlation of bisulfite Pyrosequencing (BPS) and methylation Array DNA methylation levels. The correlation matrix shows the Pearson correlation coefficient (r: 1 (red) to − 1 (blue) among all CpG loci analyzed by BPS. The candidate genes LINC01550 and BCL11B contained multiple CpG sites. Columns and rows represent one CpG loci of the listed candidate gene.
Expression correlation between BCL11B & C14ORF64 (LINC01550). Expression correlation between BCL11B & C14ORF64 (LINC01550) in 426 human datasets with 42563 samples from R2: Genomics analysis and visualization platform (https://hgserver1.amc.nl/cgi-bin/r2/main.cgi).
Material and Methods.
About this article
Cite this article
Johansson, P., Laguna, T., Ossowski, J. et al. Epigenome-wide analysis of T-cell large granular lymphocytic leukemia identifies BCL11B as a potential biomarker. Clin Epigenet 14, 148 (2022). https://doi.org/10.1186/s13148-022-01362-z
- Large granular lymphocytic leukemia
- DNA methylation