- Open Access
Epigenome-wide and transcriptome-wide analyses reveal gestational diabetes is associated with alterations in the human leukocyte antigen complex
Clinical Epigeneticsvolume 7, Article number: 79 (2015)
Gestational diabetes mellitus (GDM) affects approximately 10 % of pregnancies in the United States and increases the risk of adverse health outcomes in the offspring. These adult disease propensities may be set by anatomical and molecular alterations in the placenta associated with GDM.
To assess the mechanistic aspects of fetal programming, we measured genome-wide methylation (Infinium HumanMethylation450 BeadChips) and expression (Affymetrix transcriptome microarrays) in placental tissue of 41 GDM cases and 41 matched pregnancies without maternal complications from the Harvard Epigenetic Birth Cohort. Specific transcriptional and epigenetic perturbations associated with GDM status included alterations in the major histocompatibility complex (MHC) region, which were validated in an independent cohort, the Rhode Island Child Health Study. Gene ontology enrichment among gene regulation influenced by GDM revealed an over-representation of immune response pathways among differential expression, reflecting these coordinated changes in the MHC region. This differential methylation and expression may be capturing shifts in cellular composition, reflecting physiological changes in the placenta associated with GDM.
Our study represents the largest investigation of transcriptomic and methylomic differences associated with GDM, providing comprehensive insight into how GDM shapes the intrauterine environment, which may have implications for fetal (re)programming.
Gestational diabetes mellitus (GDM) is the most common hyperglycemic disorder in pregnancy. Characterized by glucose intolerance that becomes clinically apparent near the end of the second trimester, the prevalence of GDM has doubled over the past 20 years, affecting approximately 10 % of pregnancies in the United States [1, 2]. In addition to shared clinical features, women with GDM have a substantially elevated risk of subsequent type 2 diabetes [3, 4], particularly in the presence of obesity [4, 5]. Exposure to GDM in utero may also adversely impact the health of the offspring, increasing the risk of macrosomia, and conferring a predisposition for obesity, metabolic syndrome, cardiovascular complications, and diabetes [1, 6]. While the clinical manifestations and potential implications of GDM for the mother and fetus have been well characterized, the molecular basis of GDM pathogenesis is largely unknown.
As the primary interface of nutrient transfer between mother and fetus, altered placental physiology has been suspected to be a key component of GDM pathogenesis and its impact on disease susceptibility in the offspring. Anatomical and molecular alterations in the placenta associated with GDM include significantly lower fetal-to-placental weight ratios , aberrant vascularization , and oxidative stress, potentially due to enhanced free radical production and/or defects in the antioxidant defenses . Inadequate response to oxidative stress may contribute to enhanced inflammatory conditions, exacerbated by obesity . Since fetal gluconeogenesis is minimal , the majority of the glucose essential to fetal growth and metabolism is transferred from the maternal circulation via the placenta. Transplacental glucose flux follows this maternal-to-fetal concentration gradient by facilitated diffusion, which can be characterized as flow-limited even at pathological glucose concentrations [12, 13]. Consequently, GDM exposes the fetus to abnormally high glucose levels, resulting in fetal hyperinsulinemia  and an increase in fetal fat mass . GDM may additionally affect amino acid transport  and lipid concentrations, contributing to accelerated low-density lipoprotein oxidation .
The impact of this adverse intrauterine environment is in line with an abundance of epidemiologic evidence purporting the Developmental Origins of Health and Disease (DOHaD) [18, 19]. Due to their plasticity, epigenetic modifications provide one mechanism facilitating developmental adaptation to these conditions . GDM has previously been associated with changes in placental gene expression across the genome [21–24], potentially reflecting changes in DNA methylation [25, 26]. In this study, we sought to understand the impact of GDM on the regulation of the placental transcriptome on the maternal (decidua basalis) side and how this is associated with alterations in the epigenome. This study represents the largest and most comprehensive integration of multi-omic data in this context, facilitating the identification of complex regulatory changes associated with GDM that may impact fetal (re)programming.
Results and discussion
Genome-wide methylation and expression was assessed in the placentas (maternal-side) of 41 clinically confirmed cases of GDM and 41 matched pregnancies without maternal complications from the Harvard Epigenetic Birth Cohort (HEBC) at Brigham and Women’s Hospital, Boston, MA, USA. Samples were matched based on maternal age, pre-pregnancy BMI, method of conception, ethnicity, smoking status, and infant sex. Approximately 50 % of these women were normal weight (18.5 ≤ BMI < 25) prior to pregnancy. Aside from two previously underweight mothers, the remaining women were either overweight (20 %; 25 ≤ BMI < 30) or obese (29 %; 30 ≤ BMI). Birth weight and gestational age were not associated with GDM in this study population (Additional file 1: Table S1). Additional characteristics of the population are presented in Additional file 1: Table S1.
Among these 82 placenta samples, we assessed methylation changes across the genome associated with GDM using the 450K Infinium Methylation BeadChip. To identify site-specific differences, methylation level at each locus was modeled as a function of GDM, controlling for potential confounding variables (Additional file 1: Table S2; 20 most significant loci). Each model was adjusted for chart-abstracted maternal age, pre-pregnancy BMI, infant sex, maternal smoking, and independent surrogate variables associated with putative sources of internal bias measured with error, specifically self-reported maternal ethnicity and our indicators for batch (chip, row, and column). Due to their more intuitive biological interpretation, β values (range: 0–1; low to high methylation) were utilized to identify differentially methylated loci, using robust standard errors to account for possible heteroscedasticity associated with the impact of the methylation level on intrinsic variability. GDM was most significantly associated with a 0.00726-lower methylation level at one CpG locus within the intron of CAPN1 (Additional file 1: Table S2). Given the limits of differential methylation detection above the technical variation of pyrosequencing, regions for validation were selected among the greatest absolute shifts in methylation associated with GDM, with a p value <0.001. In addition to these replication considerations, genomic context was factored into candidate selection, with priority given to regional changes detected via “bump hunting” (Additional file 1: Table S3).
Based on this approach, we identified four candidate regions for verification in our samples and validation in an independent cohort (Fig. 1 and Table 1). These included one locus within an enhancer and 5′UTR of CCDC181, which was associated with a 0.137 (95 % CI: 0.068, 0.207) increase in placental methylation among GDM mothers in our adjusted model (cg25464921). A second locus, within the introns of HLA-H and HLA-J, also exhibited significantly higher methylation with GDM (cg23681866; 0.108 (95 % CI: 0.049, 0.168)). These two sites were additionally within the two largest differentially methylated regions associated with GDM (Additional file 1: Table S3). The remaining two loci chosen showed lower average methylation among the placentas of GDM mothers in our adjusted models. For the locus 285-bp upstream of the TSS of HLA-DOA, the average difference was 0.117 (cg08147094; 95 % CI: −0.177, −0.058), with a smaller difference of 0.089 (cg18506672; 95 % CI: −0.133, −0.044) at a locus associated with the promoter of SNRPN/SNURF. Validation of these associations was performed by pyrosequencing of placenta samples from GDM mothers and matched normal pregnancies selected from the Rhode Island Child Health Study (RICHS), consisting of mothers recruited following delivery at the Women and Infants Hospital of Rhode Island (Additional file 1: Table S4). Besides a regional change trending towards significance upstream of HLA-DOA, none of the associations estimated on the array replicated in the independent cohort (Table 1). In contrast, pyrosequencing of these regions in the HEBC identified significant differential methylation of similar magnitude to the changes estimated on the methylation array proximal to our four candidates and among surrounding loci (with a trend towards significance for the locus within HLA-H/HLA-J), thus verifying the array findings. A failure to validate may reflect a difference in the distribution of effect modifiers between the two study populations. Compared to the mothers from the HEBC, the RICHS mothers were younger, with a higher pre-pregnancy BMI, and a greater proportion self-reporting white ethnicity (Additional file 2: Figure S1). Accordingly, we investigated modification of the association between methylation and GDM by maternal age and BMI, dichotomized by the median of each in HEBC, as well as self-reported ethnicity (Additional file 1: Table S5). Among the RICHS placenta samples, we identified significant effect modification of the association between GDM and methylation in the 5′UTR of CCDC181 by maternal age (p value for interaction p = 0.0122). Similar to the change observed in the HEBC cohort, GDM was associated with a trend towards higher methylation in older mothers, but among the younger mothers in RICHS, placentas had significantly lower average methylation at this context (−5.807 % (95 % CI: −10.890 and −0.724)), respectively. Within the region upstream of HLA-DOA, the impact of GDM differed by maternal pre-pregnancy BMI category in the validation cohort (p value for interaction p = 0.0387); in the higher BMI category, GDM was associated with lower methylation (−5.123 % (95 % CI: −8.825, −1.421)), as observed in HEBC. Significantly lower methylation of this HLA-DOA-associated region was also identified in the maternal blood of the HEBC samples among the women with a higher pre-pregnancy BMI (p value for interaction p = 0.0342; −6.039 % (95 % CI: −11.721, −0.357), Additional file 1: Tables S6–S7).
In separate experiments, we assessed differential expression associated with GDM by interrogating expression levels across the genome in a subsample of 55 placentas assessed for genome-wide methylation changes using the Affymetrix Human Transcriptome Array 2.0. Similar statistical models were utilized to estimate associations between expression levels and GDM, adjusting for the same covariates used in our methylation models and our data-driven estimated surrogates for batch and ethnicity (Additional file 1: Table S8; top 20 unique coding genes, sorted by significance). Several of the most significant changes in coding genes were within the extremely polymorphic major histocompatibility complex (MHC) region on chromosome 6, represented by multiple haplotypes on the array. Playing a critical role in the immune response, this complex encodes genes involved in antigen presentation to T cells. Polymorphisms in the region have been previously implicated in genetic susceptibility to type 1 diabetes. GDM was associated with decreased expression of both MHC class I (e.g., HLA-A, HLA-B, and HLA-C) and MHC class II genes (e.g., HLA-DQA2) (Additional file 1: Table S8; Additional file 2: Figure S2). Given the observed differences in methylation associated with GDM in this same region, we investigated the correlation structure between methylation and gene expression in this region, as well as the correlation in expression among these genes (Fig. 2a, b). The verified locus upstream of HLA-DOA showed a weak positive correlation with HLA-DOA (ρ = 0.29), and a moderate positive correlation with several other MHC genes (ρ = 0.27–0.475), whereas the CpG locus within the intron of HLA-H/HLA-J was not correlated with any MHC genes (Additional file 1: Table S9). However, none of these comparisons reached regional significance (Additional file 1: Table S9). GDM was a stronger predictor of regional variation in expression across the MHC region than methylation (Fig. 2; Additional file 2: Figure S2). When averaged across large blocks of CpG loci in this region, GDM was not associated with shifts in methylation level, suggesting more site-specific impacts. In contrast, the enrichment for coordinated down-regulation of expression by GDM status was unique to the MHC region. Generally, coordinated changes across this region did not appear to be dependent on direct proximity, suggesting higher order regulation of these processes (Fig. 2). Three of the most significant associations with GDM in our adjusted models were verified by real-time PCR (qRT-PCR), chosen based on consistent results between the gene-level association with GDM and the exon-level associations for which primers could be designed. A similar trend towards decreased expression of HLA-C with GDM was observed, but the associations with HLA-B and GPR174 were not replicated using a distinct technology (Additional file 1: Table S10). Validation of these genes by qRT-PCR was also attempted in the RICHS, but the results were not significant (Additional file 1: Table S10).
Pathway-level variation associated with GDM was assessed by gene ontology (GO) enrichment among genes associated with methylation and expression changes. Compared to expression variation, methylation profiles are more stable and may be more strongly associated with early developmental patterns. Conversely, differential expression may be more reflective of maternal-fetal interface during later development. Therefore, each form of genome-wide data may be reflecting unique and meaningful molecular variation that may impact future growth patterns. Biological process GO enrichment was assessed among the genes in proximity to the 648 CpG loci associated with GDM in our adjusted models at α level =0.001. The most significantly enriched processes among methylation changes were associated with cellular metabolism and response to external stimuli. However, this enrichment was not significant after correcting for multiple testing (Additional file 1: Table S11; Fig. 3). Using a less stringent cutoff to identify ontologies enriched among differential expression patterns, the 171 genes associated with GDM in our adjusted expression models at α level =0.01 were strongly enriched for the immune response (Additional file 1: Table S12; Fig. 3). This enrichment reflected the strong association between GDM and expression in the MHC region (Fig. 3).
This study represents the largest investigation of genome-wide variation in placental transcriptional regulation associated with GDM. Prior studies have assessed the impact of GDM on genome-wide expression patterns in the placenta, in both the fetal and maternal side, with numbers of diagnosed cases ranging from 2 to 19 [21–24]. While the genes with the largest change in expression associated with GDM varied between study populations, a majority of these studies identified enrichment for inflammatory pathways among observed differential expression. In one smaller study assessing variation across the fetal-side placental methylome, a distinct enrichment for metabolic disease pathways was observed . Mirroring one of our findings, this prior study found GDM was strongly associated with a change in methylation in proximity to VIPR1, a gene suggested to play a role in controlling inflammation. Another study of 27 GDM-exposed South Asian mothers did not identify fetal-side methylation shifts that overlapped with either our study or the findings of Ruchet et al., suggesting possible race/ethnicity-specific changes . Both prior studies of placenta methylation additionally investigated variation in cord blood methylation and found minimal overlap between site-specific shifts [25, 26]. Our study represents the first integration of genome-wide methylation and expression data to identify distinct and shared regulation. Through this multi-omic profiling, we identified markers of physiological changes in the placenta associated with intrauterine exposure to GDM. Given the influence of the placenta on fetal development, these regulatory patterns may contribute to the fetal (re)programming that has been associated with GDM exposure.
GDM shares several clinical features, lifestyle risk factors, and genetic susceptibility genes with type 2 diabetes [27, 28], including an increased incidence of chronic systemic inflammation [29–33]. In contrast to early insulin sensitivity, the later part of normal gestation is characterized by maternal hyperinsulinemia and insulin resistance, resulting in increased circulation of lipids and glucose to meet the energy requirements of fetal growth . With amplified metabolic stress, such as with GDM and pre-existing obesity, peripheral insulin resistance is more pronounced and associated with increased concentrations of circulating fatty acids and lipids [15, 35]. The changes in insulin signaling may be induced by pro-inflammatory cytokines and adipokines secreted from both the adipose tissue and placenta of mothers with GDM [23, 36, 37], which have been previously implicated in insulin resistance among non-pregnant populations [38, 39]. Our findings further support the premise that a diabetic environment in utero instigates the overexpression of inflammation-related genes in the placenta and suggest the increased release of inflammatory molecules. These shifts in expression may be more generally reflective of the shifts in placenta cell composition associated with the systemic inflammation characteristic of GDM.
In addition to this general immunomodulation, we identified coordinated regulation in the major histocompatibility complex (MHC) region associated with GDM, suggesting a potential autoimmune component in pathogenesis that is more characteristic of type 1 diabetes. Contrary to the insulin resistance of type 2 diabetes, type 1 diabetes is characterized by the progressive destruction on pancreatic beta cells as a result of T-cell-mediated autoimmunity, leading to insulin deficiency and hyperglycemia. Polymorphisms within class II human leukocyte antigen (HLA) genes coded in the MHC region have been implicated in predispositions for type 1 diabetes [40–42]. These class II molecules play a role in the presentation of antigenic peptides to helper T cells, whereas class I molecules present to cytotoxic T cells. Compared to the number of studies focused on the clinical features shared with type 2 diabetes, there is a relative paucity of studies assessing the similarities between GDM and type 1 diabetes. Approximately 10 % of GDM is suspected to represent an autoimmune form, characterized by the presence of beta-call cell autoantibodies , an increased frequency of type-1-diabetes-related haplotypes in HLA class II genes [44, 45], and a higher risk of late-onset type 1 diabetes [46–48]. More generally, GDM has been associated with increased anti-HLA-class-II antibodies in maternal circulation, suggesting reduced tolerance towards alloantigens , which may play a role in GDM pathogenesis through inflammatory activation. Our findings highlight a significant role of the MHC region in the presentation of GDM, with a general down-regulation of HLA genes among GDM-exposed placentas. While GDM has not been associated with an increased risk of type 1 diabetes in the offspring, shifts in autoimmune cells in the placenta may impair the intrauterine environment, shaping fetal development. It is possible that the adverse changes associated with GDM may be exacerbated by a higher pre-pregnancy BMI or higher maternal age, as evidenced by our methylation stratification results. Decreased placental methylation upstream of HLA-DOA among GDM cases was observed in both the HEBC and RICHS cohorts, particularly among the women with a high pre-pregnancy BMI. In maternal blood samples, we saw a similar decrease in methylation in this region among these women. These results highlight the heterogeneity of GDM but also suggest that additional investigation into the regulation of the MHC region associated with GDM is warranted.
Differential methylation and expression associated with GDM likely also captures shifts in cellular composition, reflecting physiological changes in the placenta. GDM has previously been associated with gross anatomical changes, including increased placental weight, diameter, and thickness among poorly controlled diabetic women . Our samples were excised from the maternal (decidua basalis) side, which primarily contains cytotrophoblasts, syncytiotrophoblasts, and extravillous trophoblasts. Previously observed histological changes associated with GDM include villous edema, fibrin deposits in the syncytiotrophoblast, and cytotrophoblast hyperplasia . Changes in the regulation of immunomodulatory genes in the placenta identified in this and prior studies may reflect the infiltration of inflammatory cells into the placenta of diabetic mothers. Unfortunately, purified placenta cell types have not previously been analyzed on the Illumina Infinium array to estimate and adjust for the impact of cellular composition on our results . However, these shifts in composition are potentially on the causal pathway. Removing this component of the variation in methylation and expression would have likely precluded identification of the autoimmune component that has not been thoroughly explored by prior studies. This is a biological variation of interest given its role in shaping the intrauterine environment.
Our study exhibits several strengths, including a large population size, the integration of genome-wide methylation and expression, the employment of technical verification, and the use of an independent cohort for validation. Beyond the general assessment of reproducibility, validation in an independent cohort helped to identify maternal characteristics that modified observed epigenetic changes, further elucidating condition heterogeneity. While our candidate methylation changes were verified in our study population by pyrosequencing, none of these shifts were significantly reproduced in our validation cohort. For two of these regions, a failure to validate reflected a difference in the distribution of measured effect modifiers between the two study populations. Additional effect modification by unmeasured factors, or additional unmeasured confounding, may account for the inability to validate certain regions. Due to our sample size, we could also not explore the possibility of more complex models of interaction that may influence the association. The failure to validate the expression microarray results by qRT-PCR results may partially reflect technical differences between the platforms. For instance, the qRT-PCR probes span larger lengths of the transcriptome than what is represented with the exon levels analyzed by the microarray. In addition, the greater temporal variability of expression may make these changes generally more difficult to validate. Future larger, more directed studies may be able to further refine the specific populations susceptible to molecular changes associated with GDM.
While methylation may be capturing regulatory patterns established early in gestation or earlier shifts in cellular composition, the identification of differential expression is restricted to genes expressed perinatally. By interrogating the methylome and transcriptome, we are able to characterize the influence of GDM on placental gene regulation across gestation. This integration provides comprehensive insight into the molecular basis of GDM pathogenesis with possible implications for fetal (re)programming.
Availability of supporting data
The data sets supporting the results of this article have been deposited in NCBI’s Gene Expression Omnibus and are accessible through GEO SuperSeries accession number GSE70494 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE70494).
Our study population consisted of women enrolled in the Harvard Epigenetic Birth Cohort (HEBC) at the Brigham and Women’s Hospital (BWH) in Boston, MA. The HEBC was initiated to study prenatal determinants of epigenetic marks in cord blood and placenta. Data and biospecimens for the HEBC were collected from June 2007 to June 2009 and include 1941 mother-child dyads . Mother-infant dyads for the proposed study have been selected as follows: “case” placenta samples were selected among mothers with a clinical diagnosis of gestational diabetes, excluding women with pre-existing hypertension or pre-existing diabetes mellitus. “Controls” were identified among samples with no gestational diabetes, no pregnancy-induced hypertension, no preeclampsia, and no previous hypertension or diabetes. Each case was individually matched to a control with a maternal age within 5 years of the case, with the same method of conception, ethnicity, smoking status, and infant sex, and with the closest pre-pregnancy BMI among mothers in the HEBC. Matching criteria was relaxed when the difference in pre-pregnancy BMI between the case and closest control match was >5 kg/m2. Characteristics for the case and control samples are summarized in Additional file 1: Table S1; continuous variables are summarized by the mean (SD), and categorical variables are reported as counts (%).
The study protocol was approved by the Institutional Review Board of the Brigham and Women’s Hospital. Completing the pregnancy questionnaire was considered implied consent.
Placenta and maternal blood samples were collected immediately after delivery. Tissue samples collected for DNA extraction were snap-frozen and stored in liquid nitrogen, with samples for RNA extraction stored in RNAlater (Ambion, Carlsbad, CA) at −20 °C until further processing. All placenta samples used in this study were taken from the maternal (decidua basalis) side near the umbilical cord.
DNA was isolated from placenta tissue and maternal blood samples using the QIAmp DNA Mini Kit (Qiagen) according to the manufacturer’s instructions.
Illumina 450K methylation microarrays
Genomic DNA was bisulfite treated using the Zymo EZ-96 DNA methylation kit (Zymo Research). For the assessment of genome-wide DNA methylation, the Illumina 450K Infinium Methylation BeadChip was used. The distribution of samples across chips was blinded by randomly sorting de-identified matched samples to reduce the likelihood of systematic technical bias. This array covers approximately 99 % of RefSeq genes, with approximately 17.2 probes per gene region, and 96 % of CpG islands, including the more tissue-specific CpG island shores and shelves . The 450K methylation microarrays were performed at The University of Southern California’s (USC, CA, USA) USC Epigenome Center (http://epigenome.usc.edu/).
Validation and verification of DNA methylation using pyrosequencing
Four differentially methylated regions (Additional file 1: Table S13) associated with GDM in our adjusted models were verified in the HEBC and validated in a separate birth cohort, RICHS. RICHS is an ongoing, population-based birth cohort established at Women and Infants’ Hospital of Rhode Island and independently funded (R01 MH094609) [53, 54]. A total of 500 ng of genomic DNA was bisulfite converted using the Zymo EZ-96 DNA methylation kit (Zymo Research).
For each region of interest, PCR was performed using a primer set without CG dinucleotides in their sequences specific for the converted DNA. These primers surrounded at least three CG dinucleotides and enabled us to amplify both methylated and unmethylated templates. Optimized primers were ordered from EpigenDx (EpigenDx, Hopkinton, MA; design described in Additional file 1: Table S13). Amplification products were sequenced on our PyroMark Q24 pyrosequencer (Qiagen). Another bisulfite conversion check was performed by adding cytosine site in the sequence to assess incomplete conversion of the DNA strands. Percent methylation was estimated by the proportion C/(C + T) at each CG site within the amplified region.
Using the mirVANA RNA Isolation Kit (Ambion Inc., Austin, TX), RNA was isolated according to the manufacturer’s protocol.
Affymetrix gene expression arrays
Gene expression was assessed using the Human Transcriptome Array 2.0 (Affymetrix). The array is designed with a median of approximately 21 unique probes per transcript, enabling analysis of expression at the gene level and investigation of alternative splicing variants. The probes on the array cover a total of >30,000 coding transcripts, including alternative splice variants, and >11,000 lincRNA. Transcript coverage and gene count correspond to RefSeq from February, 2012. A total of 50 ng of high-quality RNA (260/280 = 1.8–2.0; RIN > 6.0) was used to generate sense-strand cDNA with the Ambion® WT Expression Kit (Life Technologies). The Affymetrix gene expression arrays were performed at the Center for Personalized Genetic Medicine, a research core facility of the Partners Health Care System in Cambridge, MA. Expression arrays were performed on 55 case-control samples plus quality controls.
The expression levels of three genes that differed significantly among GDM cases and controls were chosen for verification in the HEBC and validation in RICHS. cDNA was synthesized using 500 ng of RNA with the High Capacity cDNA Reverse Transcription Kit (Applied Biosystems, Foster City, USA). Predesigned PrimeTime qPCR Assays from Integrated DNA Technologies (IDT) were used for HLA-C, HLA-B, and GPR174. The qRT-PCR reaction was performed with TaqMan Gene Expression Master Mix (Applied Biosystems) according to the manufacturer’s instructions on a Life Technologies 7900HT qPCR machine at the Harvard Medical School ICCB Screening Facility with reverse transcription controls. The qRT-PCR cycling conditions were as follows: 50 °C for 2 min, 95 °C for 10 min, 40 cycles of 95 °C for 15 s, and 60 °C for 60 s. All qRT-PCR data was normalized using the PrimeTime GAPDH Assay (IDT). Delta Ct (ΔCt) was defined as the expression difference between the target gene and GAPDH: ΔCt = CtGene of Interest – CtGAPDH. All samples were analyzed in triplicate. Additional file 1: Table S14 lists primer information.
Methylation data was processed prior to analysis in accordance with best practices to reduce the influence of technical artifacts . Color bias adjustment and quantile normalization (QN) was performed on signal intensities to decrease technical variation, specifically array position effects . The fluorescence intensities from methylated (M) and unmethylated (U) alleles were then converted to methylation level, ranging from 0 to 1, given by β = M/(M + U + 100). Adjustment for probe-type bias was performed on the β values using beta-mixture quantile normalization (BMIQ) [57, 58]. We restricted further analysis to autosomal loci that did not have a SNP at the target locus or have a probe that cross-hybridized to the X chromosome, which could induce gender bias due to X inactivation . Even after between-array normalization, batch effects may contribute bias in site-specific analyses . Independent surrogate variable analysis (ISVA) was utilized to estimate this technical confounding based on our surrogates for batch (chip, chip row, chip column), as well as biological confounders assumed to be measured with error (self-reported ethnicity) . Estimated independent surrogate variables (ISVs) associated with these confounders were incorporated into subsequent multivariable models. Site-specific methylation was modeled as a function of GDM status, adjusting for maternal age (years), pre-pregnancy BMI (kg/m2), infant sex, maternal smoking (yes/no), and our ISVs. To account for possible heteroscedasticity due to the intrinsic association between methylation level and variability, we used robust standard errors in Wald tests for our indicator of GDM status. Regional changes were identified using a “bump hunting” approach, using the T statistic from our adjusted models to identify differentially methylated regions among contiguous loci within 500 bp that exceed the 99th percentile of genome-wide changes, summarizing these regions based on the effect size . Candidates were selected after removing probes with a SNP or non-specific binding to other autosomal sites, which, while potentially true signals, would increase the difficulty of primer design. Associations with GDM across pyrosequenced loci were modeled using linear mixed models with a random intercept for sample, adjusting for maternal age, pre-pregnancy BMI, infant sex, maternal smoking, and self-reported ethnicity. Site-specific analyses were also performed for the locus in closest proximity to the candidate on the array. Possible effect modification by maternal ethnicity (white/non-white), maternal age (dichotomized by median age in HEBC), and pre-pregnancy BMI (dichotomized by median BMI in HEBC) was assessed among the pyrosequenced regions by a Wald test of the interaction term.
CEL files containing the measured expression intensities were processed using the Affymetrix Expression Console Software (Affymetrix). Background correction was performed using the Robust Multichip Analysis (RMA) algorithm, to minimize the variance seen across arrays . These probe values were then quantile normalized and summarized into one gene-level expression measure using median polish [63, 64]. ISVs significantly associated with our surrogates for batch (plate row, plate column) and maternal ethnicity in the expression matrix were incorporated into subsequent models. Associations between GDM and expression were then adjusted for the covariates included in our methylation models, using robust SEs in our Wald tests. For ease of interpretation, only associations with coding RefSeq genes were reported. To identify coordinated regulation between methylation and expression, as well as co-expression networks in the MHC region (chr6: 28477797–33448354), we estimated the Spearman correlation between each CpG within 10 kb of the MHC region and the expression of each gene in this region, as well as between the expression of each gene in this region. The significance of each pairwise comparison was estimated by 10,000 permutations, imputing 1e−6 for p = 0. We applied the Benjamini-Hochberg (false discovery rate) method to correct the two-sided p values for multiple testing across the MHC region.
Biological process gene ontology (GO) enrichment was assessed among changes in methylation and expression associated with GDM. We restricted analysis to ontologies associated with between 20 and 1500 genes and used the information content of these terms to identify potential redundancy. The Jiang and Conrath method was used to calculate the pairwise similarity between terms, using a scaled (0–1) cutoff of 0.7 to define a clustering of “highly similar terms.” GO analysis was then restricted to the GO associated with the largest number of genes among highly similar terms, resulting in a final subset of 402 biological processes. Enrichment of these terms was assessed among the genes in proximity to the 648 CpG loci associated with GDM in our adjusted models at α level =0.001. The non-random design of the methylation array precluded standard significance testing of this enrichment. Certain genes are represented by more probes on the array, potentially due to CpG island proximity or scientific interest, increasing the likelihood of detecting enrichment among terms associated with these genes. To account for this design bias, a resampling approach was used. For each GO analyzed, 648 CpG loci were randomly selected from the array and assigned to the closest gene. This random sampling was repeated 10,000 times to approximate the null odds ratio distribution of each term on the array and calculate the significance of observed methylation enrichment. Biological process enrichment among expression changes associated with GDM (p < 0.01; 171 genes) was assessed using hypergeometric tests. The significance of methylation and expression GO enrichment was then corrected for the false discovery rate, estimating associated q values .
Reece EA, Leguizamón G, Wiznitzer A. Gestational diabetes: the need for a common ground. Lancet. 2009;373:1789–97.
Ferrara A. Increasing prevalence of gestational diabetes mellitus: a public health perspective. Diabetes Care. 2007;30 Suppl 2:S141–146.
Catalano PM, Tyzbir ED, Wolfe RR, Roman NM, Amini SB, Sims EA. Longitudinal changes in basal hepatic glucose production and suppression during insulin infusion in normal pregnant women. Am J Obstet Gynecol. 1992;167(4 Pt 1):913–9.
Friedman JE, Ishizuka T, Shao J, Huston L, Highman T, Catalano P. Impaired glucose transport and insulin receptor tyrosine phosphorylation in skeletal muscle from obese women with gestational diabetes. Diabetes. 1999;48:1807–14.
O’Sullivan JB. Body weight and subsequent diabetes mellitus. JAMA. 1982;248:949–52.
Moore TR. Fetal exposure to gestational diabetes contributes to subsequent adult metabolic syndrome. Am J Obstet Gynecol. 2010;202:643–9.
Taricco E, Radaelli T, Nobile de Santis MS, Cetin I. Foetal and placental weights in relation to maternal characteristics in gestational diabetes. Placenta. 2003;24:343–7.
Gauster M, Desoye G, Tötsch M, Hiden U. The placenta and gestational diabetes mellitus. Curr Diab Rep. 2012;12:16–23.
Lappas M, Hiden U, Desoye G, Froehlich J, Hauguel-de Mouzon S, Jawerbaum A. The role of oxidative stress in the pathophysiology of gestational diabetes mellitus. Antioxid Redox Signal. 2011;15:3061–100.
Colomiere M, Permezel M, Riley C, Desoye G, Lappas M. Defective insulin signaling in placenta from pregnancies complicated by gestational diabetes mellitus. Eur J Endocrinol Eur Fed Endocr Soc. 2009;160:567–78.
Gaither K, Quraishi AN, Illsley NP. Diabetes alters the expression and activity of the human placental GLUT1 glucose transporter. J Clin Endocrinol Metab. 1999;84:695–701.
Desoye G, Gauster M, Wadsack C. Placental transport in pregnancy pathologies. Am J Clin Nutr. 2011;94(6 Suppl):1896S–902S.
Day PE, Cleal JK, Lofthouse EM, Hanson MA, Lewis RM. What factors determine placental glucose transfer kinetics? Placenta. 2013;34:953–8.
Weintrob N, Karp M, Hod M. Short- and long-range complications in offspring of diabetic mothers. J Diabetes Complications. 1996;10:294–301.
Catalano PM, Thomas A, Huston-Presley L, Amini SB. Increased fetal adiposity: a very sensitive marker of abnormal in utero development. Am J Obstet Gynecol. 2003;189:1698–704.
Cetin I, de Santis MSN, Taricco E, Radaelli T, Teng C, Ronzoni S, et al. Maternal and fetal amino acid concentrations in normal pregnancies and in pregnancies with gestational diabetes mellitus. Am J Obstet Gynecol. 2005;192:610–7.
Sánchez-Vera I, Bonet B, Viana M, Quintanar A, Martín MD, Blanco P, et al. Changes in plasma lipids and increased low-density lipoprotein susceptibility to oxidation in pregnancies complicated by gestational diabetes: consequences of obesity. Metabolism. 2007;56:1527–33.
Barker DJ. In utero programming of chronic disease. Clin Sci Lond Engl 1979. 1998;95:115–28.
Gluckman PD, Hanson MA, Cooper C, Thornburg KL. Effect of in utero and early-life conditions on adult health and disease. N Engl J Med. 2008;359:61–73.
Hochberg Z, Feil R, Constancia M, Fraga M, Junien C, Carel J-C, et al. Child health, developmental plasticity, and epigenetic programming. Endocr Rev. 2011;32:159–224.
Enquobahrie DA, Williams MA, Qiu C, Meller M, Sorensen TK. Global placental gene expression in gestational diabetes mellitus. Am J Obstet Gynecol. 2009;200:206. e1–13.
Radaelli T, Lepercq J, Varastehpour A, Basu S, Catalano PM, Hauguel-De Mouzon S. Differential regulation of genes for fetoplacental lipid pathways in pregnancy with gestational and type 1 diabetes mellitus. Am J Obstet Gynecol. 2009;201:209. e1–209.e10.
Radaelli T, Varastehpour A, Catalano P, Mouzon SH. Gestational diabetes induces placental genes for chronic stress and inflammatory pathways. Diabetes. 2003;52:2951–8.
Zhao Y-H, Wang D-P, Zhang L-L, Zhang F, Wang D-M, Zhang W-Y. Genomic expression profiles of blood and placenta reveal significant immune-related pathways and categories in Chinese women with gestational diabetes mellitus. Diabet Med. 2011;28:237–46.
Ruchat S-M, Houde A-A, Voisin G, St-Pierre J, Perron P, Baillargeon J-P, et al. Gestational diabetes mellitus epigenetically affects genes predominantly involved in metabolic diseases. Epigenetics. 2013;8:935–43.
Finer S, Mathews C, Lowe R, Smart M, Hillman S, Foo L, et al. Maternal gestational diabetes is associated with genome-wide DNA methylation variation in placenta and cord blood of exposed offspring. Hum Mol Genet. 2015;24:3021–9.
Cho YM, Kim TH, Lim S, Choi SH, Shin HD, Lee HK, et al. Type 2 diabetes-associated genetic variants discovered in the recent genome-wide association studies are related to gestational diabetes mellitus in the Korean population. Diabetologia. 2009;52:253–61.
Lauenborg J, Grarup N, Damm P, Borch-Johnsen K, Jørgensen T, Pedersen O, et al. Common type 2 diabetes risk gene variants associate with gestational diabetes. J Clin Endocrinol Metab. 2009;94:145–50.
Thorand B, Löwel H, Schneider A, Kolb H, Meisinger C, Fröhlich M, et al. C-reactive protein as a predictor for incident diabetes mellitus among middle-aged men: results from the MONICA Augsburg cohort study, 1984–1998. Arch Intern Med. 2003;163:93–9.
Taniguchi A, Nagasaka S, Fukushima M, Sakai M, Okumura T, Yoshii S, et al. C-reactive protein and insulin resistance in non-obese Japanese type 2 diabetic patients. Metabolism. 2002;51:1578–81.
Pradhan AD, Manson JE, Rifai N, Buring JE, Ridker PM. C-reactive protein, interleukin 6, and risk of developing type 2 diabetes mellitus. JAMA. 2001;286:327–34.
Freeman DJ, Norrie J, Caslake MJ, Gaw A, Ford I, Lowe GDO, et al. C-reactive protein is an independent predictor of risk for the development of diabetes in the West of Scotland Coronary Prevention Study. Diabetes. 2002;51:1596–600.
Qiu C, Sorensen TK, Luthy DA, Williams MA. A prospective study of maternal serum C-reactive protein (CRP) concentrations and risk of gestational diabetes mellitus. Paediatr Perinat Epidemiol. 2004;18:377–84.
Lain KY, Catalano PM. Metabolic changes in pregnancy. Clin Obstet Gynecol. 2007;50:938–48.
Colomiere M, Permezel M, Lappas M. Diabetes and obesity during pregnancy alter insulin signalling and glucose transporter expression in maternal skeletal muscle and subcutaneous adipose tissue. J Mol Endocrinol. 2010;44:213–23.
Desoye G, Hauguel-de Mouzon S. The human placenta in gestational diabetes mellitus. The insulin and cytokine network. Diabetes Care. 2007;30 Suppl 2:S120–126.
Lappas M, Mitton A, Mittion A, Permezel M. In response to oxidative stress, the expression of inflammatory cytokines and antioxidant enzymes are impaired in placenta, but not adipose tissue, of women with gestational diabetes. J Endocrinol. 2010;204:75–84.
Hivert M-F, Sullivan LM, Fox CS, Nathan DM, D’Agostino Sr RB, Wilson PWF, et al. Associations of adiponectin, resistin, and tumor necrosis factor-alpha with insulin resistance. J Clin Endocrinol Metab. 2008;93:3165–72.
Ouchi N, Parker JL, Lugus JJ, Walsh K. Adipokines in inflammation and metabolic disease. Nat Rev Immunol. 2011;11:85–97.
Stankov K, Benc D, Draskovic D. Genetic and epigenetic factors in etiology of diabetes mellitus type 1. Pediatrics. 2013;132:1112–22.
Nejentsev S, Howson JMM, Walker NM, Szeszko J, Field SF, Stevens HE, et al. Localization of type 1 diabetes susceptibility to the MHC class I genes HLA-B and HLA-A. Nature. 2007;450:887–92.
Erlich H, Valdes AM, Noble J, Carlson JA, Varney M, Concannon P, et al. HLA DR-DQ haplotypes and genotypes and type 1 diabetes risk: analysis of the type 1 diabetes genetics consortium families. Diabetes. 2008;57:1084–92.
Mauricio D, de Leiva A. Autoimmune gestational diabetes mellitus: a distinct clinical entity? Diabetes Metab Res Rev. 2001;17:422–8.
Törn C, Gupta M, Sanjeevi CB, Aberg A, Frid A, Landin-Olsson M. Different HLA-DR-DQ and MHC class I chain-related gene A (MICA) genotypes in autoimmune and nonautoimmune gestational diabetes in a Swedish population. Hum Immunol. 2004;65:1443–50.
Papadopoulou A, Lynch KF, Shaat N, Nilsson A, Lernmark B, Berntorp K, et al. The type 1 diabetes protective HLA DQB1*0602 allele is less frequent in gestational diabetes mellitus. Diabetologia. 2009;52:1339–42.
Damm P, Kühl C, Buschard K, Jakobsen BK, Svejgaard A, Sodoyez-Goffaux F, et al. Prevalence and predictive value of islet cell antibodies and insulin autoantibodies in women with gestational diabetes. Diabet Med J Br Diabet Assoc. 1994;11:558–63.
Füchtenbusch M, Ferber K, Standl E, Ziegler AG. Prediction of type 1 diabetes postpartum in patients with gestational diabetes mellitus by combined islet cell autoantibody screening: a prospective multicenter study. Diabetes. 1997;46:1459–67.
Nilsson C, Ursing D, Törn C, Aberg A, Landin-Olsson M. Presence of GAD antibodies during gestational diabetes mellitus predicts type 1 diabetes. Diabetes Care. 2007;30:1968–71.
Steinborn A, Saran G, Schneider A, Fersis N, Sohn C, Schmitt E. The presence of gestational diabetes is associated with increased detection of anti-HLA-class II antibodies in the maternal circulation. Am J Reprod Immunol N Y N 1989. 2006;56:124–34.
Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinformatics. 2012;13:86.
Michels KB, Harris HR, Barault L. Birthweight, maternal weight trajectories and global DNA methylation of LINE-1 repetitive elements. PLoS One. 2011;6, e25254.
Bibikova M, Barnes B, Tsan C, Ho V, Klotzle B, Le JM, et al. High density DNA methylation array with single CpG site resolution. Genomics. 2011;98:288–95.
Marsit CJ, Lambertini L, Maccani MA, Koestler DC, Houseman EA, Padbury JF, et al. Placenta-imprinted gene expression association of infant neurobehavior. J Pediatr. 2012;160:854–60. e2.
Marsit CJ, Maccani MA, Padbury JF, Lester BM. Placental 11-beta hydroxysteroid dehydrogenase methylation is associated with newborn growth and a measure of neurobehavioral outcome. PLoS One. 2012;7, e33794.
Marabita F, Almgren M, Lindholm ME, Ruhrmann S, Fagerström-Billai F, Jagodic M, et al. An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform. Epigenetics. 2013;8:333–46.
Du P, Kibbe WA, Lin SM. lumi: a pipeline for processing Illumina microarray. Bioinforma Oxf Engl. 2008;24:1547–8.
Pidsley R, Y Wong CC, Volta M, Lunnon K, Mill J, Schalkwyk LC. A data-driven approach to preprocessing Illumina 450K methylation array data. BMC Genomics. 2013;14:293.
Teschendorff AE, Marabita F, Lechner M, Bartlett T, Tegner J, Gomez-Cabrero D, et al. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450 k DNA methylation data. Bioinforma Oxf Engl. 2013;29:189–96.
Price ME, Cotton AM, Lam LL, Farré P, Emberly E, Brown CJ, et al. Additional annotation enhances potential for biologically-relevant analysis of the Illumina Infinium HumanMethylation450 BeadChip array. Epigenetics Chromatin. 2013;6:4.
Leek JT, Scharpf RB, Bravo HC, Simcha D, Langmead B, Johnson WE, et al. Tackling the widespread and critical impact of batch effects in high-throughput data. Nat Rev Genet. 2010;11:733–9.
Teschendorff AE, Zhuang J, Widschwendter M. Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies. Bioinforma Oxf Engl. 2011;27:1496–505.
Jaffe AE, Murakami P, Lee H, Leek JT, Fallin MD, Feinberg AP, et al. Bump hunting to identify differentially methylated regions in epigenetic epidemiology studies. Int J Epidemiol. 2012;41:200–9.
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostat Oxf Engl. 2003;4:249–64.
Bolstad BM, Irizarry RA, Astrand M, Speed TP. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinforma Oxf Engl. 2003;19:185–93.
Jiang J, Conrath D: Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In International Conference Research on Computational Linguistics. Taiwan; 1997.
Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci. 2003;100:9440–5.
We would like to thank the participants of the Harvard Epigenetics Birth Cohort and all staff who helped collect, store, and process the samples at the Brigham and Women’s Hospital. We would also like to express our gratitude to Dr. Rafael Irizarry for reviewing this manuscript and his insightful suggestions. The Epigenetic Birth Cohort was funded by research grant R21CA128382 from the National Cancer Institute, National Institutes of Health. A.M. Binder was supported by Training Grant T32HD060454 in Reproductive, Perinatal and Pediatric Epidemiology from the National Institute of Child Health and Human Development, National Institutes of Health. Dr. LaRocca was supported by the Harvard University Center for the Environment Ziff Environmental Fellowship. Drs. Marsit, Lesseur, and the RICHS cohort are supported by research grants R01MH094609 from the National Institute of Mental Health, R01ES022223, and P01ES022832 from the National Institute of Environmental Health Sciences and RD83544201 from the US EPA.
The authors declare that they have no competing interests.
AB designed the study, performed the data analysis, and wrote the manuscript. JL processed the samples and performed the qRT-PCR and pyrosequencing. CL prepared the validation samples. CM participated in the coordination of the validation cohort. KM participated in the design and coordination of this study. All authors read and approved the final manuscript.
Characteristics of placenta samples selected from the HEBC cohort that were assessed on the Illumina Infinium array. Table S2. Most significant changes in methylation associated with GDM in adjusted model. Table S3. Largest regional changes in methylation associated with GDM. Table S4. Characteristics of placenta samples selected from the RICHS cohort that were used to validate methylation changes observed in HEBC. Table S5. Assessing effect modification of the association between GDM and placenta methylation level estimated by pyrosequencing by maternal age, ethnicity, and pre-pregnancy BMI in our validation cohort (RICHS) and verification set (HEBC). Table S6. Among maternal blood HEBC samples, associations between GDM and methylation of candidate regions assayed by pyrosequencing. Table S7. Assessing effect modification of the association between GDM and maternal blood methylation level estimated by pyrosequencing by maternal age, ethnicity, and pre-pregnancy BMI in HEBC. Table S8. Most significant changes in gene-level expression associated with GDM in adjusted model, restricting to top 20 unique coding RefSeq genes. Table S9. Spearman correlation between methylation of loci chosen for validation and expression of genes in MHC region (chr6: 28477797–33448354), restricting to pairwise comparisons with p < 0.05. Table S10. Association between GDM and expression of genes estimated by qRT-PCR in adjusted models in our validation cohort (RICHS) and verification set (HEBC). Table S11. Biological pathway enrichment among genes in proximity to methylation changes associated with GDM. Table S12. Biological pathway enrichment among genes with differential expression associated with GDM. Table S13. Pyrosequencing assay information. Table S14. qRT-PCR primer information.
Distribution of maternal characteristics in HEBC and validation cohort (RICHS). Figure S2. Plotting the first two principle components for methylation and expression in the MHC region (chr6: 28477797–33448354) for GDM cases (blue) and matched controls (red). Figure S3. The subset of 20 genes driving the first two principle component loadings for expression (|rotation| > 0.1) in the MHC region (chr6: 28477797–33448354) for GDM cases (blue) and matched controls (red).