- Open Access
Differential methylation at MHC in CD4+ T cells is associated with multiple sclerosis independently of HLA-DRB1
Clinical Epigeneticsvolume 9, Article number: 71 (2017)
Although many genetic variants have been associated with multiple sclerosis (MS) risk, they do not explain all the disease risk and there remains uncertainty as to how these variants contribute to disease. DNA methylation is an epigenetic mechanism that can influence gene expression and has the potential to mediate the effects of environmental factors on MS. In a previous study, we found a differentially methylation region (DMR) at MHC HLA-DRB1 that was associated within relapsing-remitting MS (RRMS) patients in CD4+ T cells. This study aimed to confirm this earlier finding in an independent RRMS cohort of treatment-naïve female patients.
Total genomic DNA was extracted from CD4+ T cells of 28 female RRMS and 22 age-matched healthy controls subjects. DNA was bisulfite-converted and hybridised to Illumina 450K arrays. Beta values for all CpGs were analysed using the DMPFinder function in the MINFI program, and a follow-up prioritisation process was applied to identify the most robust MS-associated DMRs.
This study confirmed our previous findings of a hypomethylated DMR at HLA-DRB1 and a hypermethylated DMR at HLA-DRB5 in this RRMS patient cohort. In addition, we identified a large independent DMR at MHC, whereby 11 CpGs in RNF39 were hypermethylated in MS cases compared to controls (max. ∆beta = 0.19, P = 2.1 × 10−4). We did not find evidence that SNP genotype was influencing the DMR in this cohort. A smaller MHC DMR was also identified at HCG4B, and two non-MHC DMRs at PM20D1 on chr1 and ERICH1 on chr8 were also identified.
The findings from this study confirm our previous results of a DMR at HLA-DRB1 and also suggest hypermethylation in an independent MHC locus, RNF39, is associated with MS. Taken together, our results highlight the importance of epigenetic factors at the MHC locus in MS independent of treatment, age and sex. Prospective studies are now required to discern whether methylation at MHC is involved in influencing risk of disease onset or whether the disease itself has altered the methylation profile.
Multiple sclerosis (MS) is an autoimmune disease characterised by lymphocyte-mediated inflammation causing demyelination and axonal degeneration. The underlying pathogenesis of MS remains unclear, but the risk of developing MS is influenced by a combination of genetic predisposition and environmental exposures. Several large genome-wide association studies (GWAS) have clearly identified major histocompatibility complex (MHC) region on chromosome 6p21, as the most important effect size, with the majority of the single-nucleotide polymorphisms (SNPs) that reached statistical significance falling within this region [1, 2]. The largest effect single-nucleotide polymorphism is the well-established human leukocyte antigen (HLA) class II region (HLA-DRB1*1501 in particular). Despite the significant contribution of the MHC region to MS risk and the large-scale GWAS study, there remains a large proportion of unexplained heritability in terms of MS risk [1, 3].
The main environmental exposures presumed to modify MS risk are smoking, sunlight exposure and Epstein-Barr virus (EBV) (reviewed in ). Epigenetics can influence the genome without changes to the DNA sequence. Environmental exposures such as smoking and sunlight exposure have been shown to be mediated by epigenetic mechanisms, providing a plausible link between environmental factors and disease [5, 6]. One such epigenetic mechanism is DNA methylation, which is the addition of a methyl group to CpG dinucleotides. We, and others, have used genome-wide DNA methylation technologies to assess differentially methylated regions (DMRs) of CD4+ and CD8+ T cells in relapsing-remitting multiple sclerosis (RRMS) patients compared to healthy controls [7–10]. These studies have found inconsistent and/or conflicting results [7, 8, 10]. Both groups found significant differences between CD4+ and CD8+ T cells [7, 8, 10]; however, our studies found a striking methylation signal located on chromosome 6p21 with a peak signal at HLA-DRB1, present in relapsing-remitting patients compared to healthy controls . We found this signal to be specific to CD4+ T cells, but Bos and colleagues did not see this in either CD4+ or CD8+ T cells [8, 10].
The differences between the studies could be explained in several ways. In our previous study, the majority of patients were on some type of immunomodulatory treatment as opposed to the Bos et al. study, who used treatment naïve patients. Also, there were differences in analysis methods in regard to filtering of probes contained within the SNP heavy region in which the methylation signal is contained in. In an effort to determine if we could replicate our initial results under a more tightly controlled study design, we performed a genome-wide DNA methylation study of CD4+ T cells in a new cohort of RRMS patients and healthy controls. This cohort has a more accurately age- and gender-matched control cohort and is comprised entirely of treatment-naïve patients or patients who have been free from immunomodulatory therapy for at least 3 months. Identifying epigenetic loci associated with MS, independent of treatment or SNP effects, could reveal potentially modifiable targets for environmental exposures or new drug design as well as identify potential markers for blood-based biomarkers of MS risk and treatment response.
Whole blood was initially collected from 28 female RRMS patients and 28 age-matched female healthy donors (Table 1). We chose to focus only on females to reduce potential sex effects and because female patients are more at risk of RRMS compared to males. All patients were diagnosed with MS according to the McDonald criteria  and were treatment naïve (19 patients) or had not taken immunomodulatory or steroid treatment for a minimum of 3 months (9 patients). The purpose of this design feature was to control for treatment effects as much as possible. Healthy control samples were collected from volunteers off the Hunter Medical Institute Research Register.
Blood sample processing and DNA methylation arrays
Peripheral blood mononucleocytes (PBMCs) were isolated from whole blood by density gradient using Lymphoprep (Stemcell Technologies, Canada) following standard laboratory procedures. Total CD4+ T cells were extracted from the PBMC population using EasySep negative magnetic separation according to the manufacturers’ instructions (Stemcell Technologies, Canada). After isolation, cell purity was assessed by flow cytometry. Cells were stained with a FITC-conjugated CD4 antibody (60016F1 StemCell Technologies, Canada) and collected on a BD FACSCanto II flow cytometer, then analysed using FACSDiva software (BD Biosciences). All samples met the minimum purity cutoff of 90%. DNA was extracted with the Qiagen microDNA extraction kit (QIAGEN, USA) and biosulphite converted using the MethylEasy Xceed kit according to the manufacturers’ instructions. Converted DNA was then applied to the Illumina Infinium Human450K Beadchip methylation arrays (service provided by Diamantina).
An in-house data analysis pipeline that used a combination of R/Bioconductor and custom scripts was designed. Illumina 450k raw intensity data (idat files) were parsed into the Bioconductor MINFI package . Methylation data was background-corrected and quantile-normalised according to MINFI routines. Data was cleaned by removing (failed) CpG probes for which the intensity of both the methylated and unmethylated probes was <1000 units across all samples. A threshold of 1000 units was selected based on the profile of the available negative control probes. Y chromosome probes were filtered out. All probe sequences were mapped to the human genome (buildHg19) using BOWTIE  to identify potential hybridisation anomalies. In total, 33,457 CpG probes were identified to align to the human genome multiple times and were filtered out of subsequent analysis. We chose to retain probes containing SNPs and filter these out post hoc where appropriate.
Measures of methylation level (β values) were produced for each CpG probe and ranged from 0 [completely unmethylated] to 1 [completely methylated]. To identify differentially methylated positions (DMPs) associated with MS subtypes in this cohort, the DMPFinder function was implemented. This calculates an F statistic for each CpG by comparing means between the case and control group. Subtracting control mean from case mean produces a Δbeta score—a measure of differential methylation ranging from −1 to 1. The Δbeta score can be broadly interpreted as percentage up or down methylation in cases compared to controls (or effect size).
Given the relatively modest sample size, this study was underpowered to detect significant DMPs at the methylome-wide level and thus we used a series of prioritisation steps to identify the most robust loci. Specifically, a DMP was defined as containing CpGs (i) that yielded a P < 0.05, i.e. nominally associated with MS, and (ii) that yielded a ∆beta of ±0.1, i.e. a relatively large differential methylation. Subsequently, a differentially methylated region (DMR) was defined as a DMP (iii) that had ≥2 adjacent CpGs within 1000-bp physical distance and (iv) whereby adjacent CpGs yielded a ∆beta in the same direction, i.e. all three CpGs in the DMR were consistently hypo- or hyper-methylated.
Gene set (or pathways) analysis was using the over-representation analysis (ORA) routing of the WebGestalt server (www.webgestalt.org/). Specifically, we entered our gene list into the gene ontology—biological processes database query—and used the default ORA parameters to explore whether our gene lists tended to favour a particular pathway(s) at a false discovery rate (FDR) of 5%.
Identification of DMPs and DMRs associated with MS
We analysed CpG methylation data specifically for CD4+ T cells obtained from 28 female RRMS patients—all treatment naïve—and 22 healthy age- and sex-matched control subjects. NB: Six control subjects were dropped due to poor quality methylation data. Clinical characteristics of the patient group are shown in Table 1.
Following the QC steps, methylation data for 445,787 CpGs were analysed using DMPFinder to identify DMPs associated with MS. A CpG prioritisation process was used to select the most robustly associated DMPs. First, all CpGs yielding a P < 0.05 were selected as being nominally associated with MS. Secondly, of the resultant list, only CpGs yielding a ∆beta of ±0.1 were selected. These steps resulted in 275 DMPs localising to 139 different genes and 136 unannotated genomic locations (see Additional file 1). Of the 275 DMPs, 134 (49%) were hypermethylated and 141 (51%) were hypomethylated. The top hit was for cg10568066 in the ring finger protein 39 (RNF39) gene whereby the CpG was hypermethylated in the case group compared to controls (∆beta = 0.19, P = 2.1 × 10−4). A gene ontology (pathways) analysis of the 134-gene set revealed that the biological process involving “regulation of GTPase activity” was over-represented by the gene list—GO:0043087, FDR = 0.015 (Additional file 2). GTPases are involved in signal transduction and cell differentiation and have been the target of many studies of MS.
Of the 275 DMPs, 14% are located within the MHC region. Importantly, nine of these were identified in our previous study and five are located in the main DMR we previously reported to be associated with MS in CD4+ T cells, i.e. HLA-DRB1 . In addition to the previously identified sites, we also identified three new CpGs within the HLA-DRB1 DMR. All except one are located within the same 400-bp region. We also observed hypermethylated DMPs at HLA-DRB5 and HLA-DQB1 consistent with our previous study (Additional file 1) . At the HLA-DRB5 region, we identified one previously identified probe plus an additional two sites. We did not see any change in methylation at the CpG sites in HLA-DRB6 previously identified, but did find two new hypermethylated sites (Additional file 1).
To identify DMRs, we focused only on those DMPs (i) that had ≥2 adjacent CpGs located within 1000-bp physical distance and (ii) whereby adjacent CpGs yielded a ∆meth in a consistent direction, i.e. all CpGs in the DMR were either hypo- or hyper-methylated. Table 2 shows the 33 DMPs representing six DMRs identified in this study. The largest DMRs were located within the MHC region on Chr 6. In addition to the DMR at HLA-DRB1, there was another large DMR in MHC region at RNF39. For this DMR, we observed 11 CpGs that were hypermethylated in the MS case group compared to the control (average ∆beta = −0.13). These CpGs were tightly clustered within a 346-bp region of the gene body. This region spans the boundary between intron 3 and exon 4 and spreads into exon 4 of RNF39.
To investigate whether or not SNP genotype might be influencing methylation signal at the RNF39 DMR, we first examined tracks on UCSC Genome Browser and determined that there were no common SNPs in the immediate vicinity of the CpGs in the DMR. Furthermore, we examined the methylation distribution for the largest effect DMP (i.e. cg10568066), which showed a relatively even spread of beta values within the MS and control groups indicating that SNP genotype is not confounding the signal (see Additional file 3).
Also within the MHC region, we identified a smaller DMR of three CpGs within a 45-bp span at the transcription start site (TSS) of the HLA complex group 4B-non-protein coding (HCG4P6) (Table 1). Non-MHC DMRs were also identified at PM20D1 on Chr 1 and ERICH1 on Chr 8. Interestingly, none of the annotated MS genes from the 110 non-HLA loci identified in the GWAS by the IMSGC  were represented in our extended list of 134 genes that showed CpG methylation changes as determined in this study. This may suggest that the underlying genetic and epigenetic architecture of MS is quite different outside of the MHC region.
In this study, we report the results of an epigenome-wide association study of methylation levels in CD4+ T cells of treatment-naïve female RRMS patients compared to age-matched healthy controls. In our previous study, which included patients on treatment, we found a DMR consisting of eight hypomethylated CpGs in HLA-DRB1 . In the current study, we were able to confirm five of these CpG sites, plus an additional three CpG sites clustered within the same 358-bp region. Although there is a lower effect size in the current study, the differential methylation occurs in the same direction (primarily hypomethylation). These results confirm the signal we saw in our original study and suggest this is not due to treatment effects. Another study by Bos et al.  did not find any changes at the HLA-DRB1 locus. However, this study filtered all probes that had a SNP in the probe sequence on the assumption that SNPs may affect the signal at these probes. This removed all probes at HLA-DRB1 that we find to have altered methylation status in this study. We did not remove these probes using our filters, and since the methylation signal is absent in CD8+ T cells derived from the same cohort, it is unlikely that SNP genotype underpins this DMR .
Interestingly, this study also identified an independent DMR in the MHC region in CD4+ T cells as being associated with MS, specifically, a large hypermethylated DMR in RNF39. The biological relevance of this locus can only be speculated at this stage; however, it resides within the gene body and spans an intron/exon boundary, so it is plausible that hypermethylation is involved in aberrant expression of alternately spliced transcripts or a regulatory element for nearby genes. RNF39 is a poorly characterised gene. In rats, RNF39 encodes a protein that plays a role in the early phase of synaptic plasticity .
Interestingly, one of the sites identified in the RRMS cohort (cg10568066) was also identified in a recent study that investigated the role of CpG methylation sites and ageing . RNF39 is also associated with other autoimmune conditions, such as Becet’s disease, a chronic relapsing inflammatory disease . This study found a SNP near RNF39 associated with the disease, although the functional consequences of this are not yet known. In addition, hypermethylation of 11 CpGs sites within RNF39 was seen in naïve CD4+ T cells of patients with systemic lupus erythematosus (SLE) who had a history of discoid rash . Furthermore, eight of these sites are the same sites as those we have identified in this study (cg10568066, cg12633154, cg13401893, cg10930308, cg03343571, cg13185413, cg09279736, cg00947782) and have similar Δbeta values to our RRMS cohort (meanRRMS = 0.13; meanSLE = 0.16) .
Although this study controlled for sex, age and treatment effects (as much as possible), there is a possibility that methylation at RNF39 may be under the influence of other factors that were not measured in this study, e.g., exposure to Epstein-Barr virus (EBV) and smoking. A search of the literature at the time of writing did not reveal any studies reporting an association of RNF39 methylation with known environmental risk factors of MS. However, one study did reveal an association with hepatitis B vaccination response  which suggests that future, larger scale studies of RNF39 and MS should attempt to include factors such as EBV as covariates in the analysis.
In conclusion, the findings from this study confirm our previous results at the MS risk locus HLA-DRB1 and also suggest hypermethylation in an independent MHC locus, RNF39, is also associated with MS. Taken together, our results highlight the importance of epigenetic factors at the MHC locus in MS independent of treatment, age and sex. Prospective studies are now required to discern whether methylation at MHC is involved in influencing risk of disease onset or whether the disease itself has altered the methylation profile.
Differentially methylation position
Differentially methylated region
Genome-wide association study
Human leukocyte antigen
Major histocompatibility complex
Peripheral blood mononucleocytes
Relapsing-remitting multiple sclerosis
International Multiple Sclerosis Genetics C, Beecham AH, Patsopoulos NA, Xifara DK, Davis MF, Kemppinen A, et al. Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis. Nat Genet. 2013;45(11):1353–60.
Patsopoulos NA, Bayer Pharma MSGWG, Steering Committees of Studies Evaluating I-b, a CCRA, Consortium AN, GeneMsa, et al. Genome-wide meta-analysis identifies novel multiple sclerosis susceptibility loci. Ann Neurol. 2011;70(6):897–912.
International Multiple Sclerosis Genetics C. IL12A, MPHOSPH9/CDK2AP1 and RGS1 are novel multiple sclerosis susceptibility loci. Genes Immun. 2010;11(5):397–405.
Loken-Amsrud KI, Lossius A, Torkildsen O, Holmoy T. Impact of the environment on multiple sclerosis. Tidsskr Nor Laegeforen. 2015;135(9):856–60.
Zhu H, Wang X, Shi H, Su S, Harshfield GA, Gutin B, et al. A genome-wide methylation study of severe vitamin D deficiency in African American adolescents. J Pediatr. 2013;162(5):1004–9. e1
Wan ES, Qiu W, Baccarelli A, Carey VJ, Bacherman H, Rennard SI, et al. Cigarette smoking behaviors and time since quitting are associated with differential DNA methylation across the human genome. Hum Mol Genet. 2012;21(13):3073–82.
Graves M, Benton M, Lea R, Boyle M, Tajouri L, Macartney-Coxson D, et al. Methylation differences at the HLA-DRB1 locus in CD4+ T-cells are associated with multiple sclerosis. Mult Scler. 2013;20(8):1033–41.
Bos SD, Page CM, Andreassen BK, Elboudwarej E, Gustavsen MW, Briggs F, et al. Genome-wide DNA methylation profiles indicate CD8+ T cell hypermethylation in multiple sclerosis. PLoS One. 2015;10(3):e0117403.
Baranzini SE, Mudge J, van Velkinburgh JC, Khankhanian P, Khrebtukova I, Miller NA, et al. Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple sclerosis. Nature. 2010;464(7293):1351–6.
Maltby VE, Graves MC, Lea RA, Benton MC, Sanders KA, Tajouri L, et al. Genome-wide DNA methylation profiling of CD8+ T cells shows a distinct epigenetic signature to CD4+ T cells in multiple sclerosis patients. Clin Epigenetics. 2015;7:118.
Polman CH, Reingold SC, Banwell B, Clanet M, Cohen JA, Filippi M, et al. Diagnostic criteria for multiple sclerosis: 2010 revisions to the McDonald criteria. Ann Neurol. 2011;69(2):292–302.
Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta, C, Feinberg AP, Hansen KD, Irizary RA. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–69.
Liu Y, Schmidt B. Long read alignment based on maximal exact match seeds. Bioinformatics. 2012;28(18):i318–i24.
Matsuo R, Asada A, Fujitani K, Inokuchi K. LIRF, a gene induced during hippocampal long-term potentiation as an immediate-early gene, encodes a novel RING finger protein. Biochem Biophys Res Commun. 2001;289(2):479–84.
Marttila S, Kananen L, Hayrynen S, Jylhava J, Nevalainen T, Hervonen A, et al. Ageing-associated changes in the human DNA methylome: genomic locations and effects on gene expression. BMC Genomics. 2015;16:179.
Kurata R, Nakaoka H, Tajima A, Hosomichi K, Shiina T, Meguro A, et al. TRIM39 and RNF39 are associated with Behcet's disease independently of HLA-B *51 and -A *26. Biochem Biophys Res Commun. 2010;401(4):533–7.
Renauer P, Coit P, Jeffries MA, Merrill JT, McCune WJ, Maksimowicz-McKinnon K, et al. DNA methylation patterns in naive CD4+ T cells identify epigenetic susceptibility loci for malar rash and discoid rash in systemic lupus erythematosus. Lupus Sci Med. 2015;2(1):e000101.
Lu Y, Cheng Y, Yan W, Nardini C. Exploring the molecular causes of hepatitis B virus vaccination response: an approach with epigenomic and transcriptomic data. BMC Med Genet. 2014;7:12.
We would like to thank the MS patients and clinical team at the John Hunter Hospital MS clinic who participated in this study. We also acknowledge the Analytical Biomolecular Research Facility at the University of Newcastle for flow cytometry support and Diamantina Institute for performing the bisulfite conversions and hybridisations to the Illumina 450K arrays.
This study was supported by Multiple Sclerosis Research Australia (MSRA) and the Hunter Medical Research Institute. RAL and VEM are supported by fellowships from MSRA. VEM is supported by a fellowship from the Canadian Institutes for Health Research. KAS is supported by a student scholarship from the Trish foundation and MSRA.
Availability of data and materials
The datasets during and/or analysed during the current study are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
The Hunter New England Health Research Ethics Committee and University of Newcastle Ethics committee approved this study (05/04/13.09 and H-505-0607, respectively), and methods were carried out in accordance with the institutional guidelines on human subject experiments. Written and informed consent was obtained from all patient and control subjects.
Consent for publication
Dr. Lechner-Scott’s institution receives non-directed funding as well as honoraria for presentations and membership on advisory boards from Genzyme, Biogen, Merck, Teva, Roche and Novartis Australia.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
DMPs and MS. List of DMPs associated with MS in CD4s. (CSV 44.8 kb)
Gene set analysis of top CpG hits. Output of results from ORA analysis using Webgestalt. (XLSX 46.8 kb)
Individual methylation (Beta value) distribution for top hit. Plots showing the distribution of beta values for case and control group for the top CpG in the RNF39 gene illustrating the DMP is not due to SNP genotype. An example of genotype influenced methylation spread is also shown for comparison. (PDF 113 kb)