Skip to main content

DNA methylation as a potential mediator of the association between indoor air pollution and neurodevelopmental delay in a South African birth cohort

Abstract

Background

Exposure to indoor air pollution during pregnancy has been linked to neurodevelopmental delay in toddlers. Epigenetic modification, particularly DNA methylation (DNAm), may explain this link. In this study, we employed three high-dimensional mediation analysis methods (HIMA, DACT, and gHMA) followed by causal mediation analysis to identify differentially methylated CpG sites and genes that mediate the association between indoor air pollution and neurodevelopmental delay. Analyses were performed using data from 142 mother to child pairs from a South African birth cohort, the Drakenstein Child Health Study. DNAm from cord blood was measured using the Infinium MethylationEPIC and HumanMethylation450 arrays. Neurodevelopment was assessed at age 2 years using the Bayley Scores of Infant and Toddler Development, 3rd edition across four domains (cognitive development, general adaptive behavior, language, and motor function). Particulate matter with an aerodynamic diameter of 10 μm or less (PM10) was measured inside participants’ homes during the second trimester of pregnancy.

Results

A total of 29 CpG sites and 4 genes (GOPC, RP11-74K11.1, DYRK1A, RNMT) were identified as significant mediators of the association between PM10 and cognitive neurodevelopment. The estimated proportion mediated (95%-confidence interval) ranged from 0.29 [0.01, 0.86] for cg00694520 to 0.54 [0.11, 1.56] for cg05023582.

Conclusions

Our findings suggest that DNAm may mediate the association between prenatal PM10 exposure and cognitive neurodevelopment. DYRK1A and several genes that our CpG sites mapped to, including CNKSR1, IPO13, IFNGR1, LONP2, and CDH1, are associated with biological pathways implicated in cognitive neurodevelopment and three of our identified CpG sites (cg23560546 [DAPL1], cg22572779 [C6orf218], cg15000966 [NT5C]) have been previously associated with fetal brain development. These findings are novel and add to the limited literature investigating the relationship between indoor air pollution, DNAm, and neurodevelopment, particularly in low- and middle-income country settings and non-white populations.

Introduction

The detrimental effects of air pollution on pregnancy outcomes such as low birth weight and respiratory disease in infants are well-known and have been confirmed by many studies over the last several decades [1, 2]. However, there is limited literature on the impact of prenatal air pollution exposure on neurodevelopmental outcomes, and even less work on the biological mechanisms underpinning these associations. A handful of studies have reported significant associations between prenatal air pollution exposure and neurological conditions such as autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), and more general neurodevelopmental delays [3,4,5,6,7]. However, the bulk of these studies have been conducted in high income country (HIC) contexts and have focused on the effects of outdoor air pollution; therefore, findings may not be wholly generalizable to other settings [8, 9]. It is equally important to address the impact of indoor air pollution, particularly in low- and middle-income country (LMIC) settings where burning fuels such as coal, paraffin, or wood for cooking or heating indoors is common. In such settings, fuel burning can greatly increase indoor air pollution concentration and its impact on neurodevelopment and other health outcomes [5, 10, 11].

Epigenetic modification has long been considered as a key missing link to understanding how gene-environment interactions affect neurodevelopment [12, 13]. As such, careful dissection of the relationship between air pollution, epigenetic modification, and neurological outcomes may allow us to better understand the complex mechanisms underlying the impact of environmental risk factors on neuropsychiatric disorders and neurodevelopment. With the rise of high-throughput genomics, the field of epigenetics has undergone rapid development. Epigenetic modification, specifically DNA methylation (DNAm), has been linked to a number of neuropsychiatric outcomes such as severe neurodevelopmental delay, schizophrenia, ASD, and ADHD [14,15,16,17,18,19,20]. DNAm is known to play a key role in embryonic development and has been hypothesized to impact neural stem cell differentiation and maintenance [21], thereby affecting neuropsychiatric outcomes throughout the life course. DNAm is potentially reversible and identification of differentially methylated CpG sites may be useful in multiple contexts, including clinical therapy design and biomarker identification [22].

DNAm levels are altered by a number of environmental exposures such as drugs, nutrition, stress, and air pollution [8, 9, 23, 24]. Data analyzed as part of the Pregnancy and Child Epigenetics (PACE) consortium showed the effects of prenatal exposure to nitrogen dioxide (NO2), airborne particulate matter with a diameter of 10 microns or less (PM10), and airborne particulate matter with a diameter of 2.5 microns or less (PM2.5) on newborn and childhood DNAm [8, 9]. Prenatal exposure to each of these pollutants has been associated with differential DNAm in neonates, highlighting the need for additional research to understand how environment-driven epigenetic changes impact fetal development and downstream health outcomes [8, 9]. While there is evidence of an association between air pollution and DNAm as well as between DNAm and neurodevelopment, few studies have examined the interconnections between them.

To the best of our knowledge, only one study has examined the relationship between prenatal indoor air pollution exposure (PM10), DNAm, and neurodevelopment in a mediation analysis [5]. The study was conducted in the Drakenstein Child Health study (DCHS) and investigated deviations of epigenetic gestational age from chronological gestational age (ΔGA) as a potential mediator of the association between indoor air pollution and adverse neurodevelopment. However, this previous study did not find evidence of mediation by ΔGA, leading us to take a more granular approach to understand the role of DNAm as a potential mediator of the association between prenatal indoor air pollution and neurodevelopment [5].

In this study, we aimed to identify any differentially methylated CpG sites and gene regions that mediate the association between prenatal exposure to indoor PM10 and neurodevelopment measured at 2 years of age in the DCHS using a combination of high-dimensional mediation analysis methods (HIMA, DACT, and gHMA) and traditional causal mediation analysis [25,26,27].

Materials and methods

Study population

The DCHS is a South African, population-based birth cohort that enrolled 1137 pregnant women with 1143 livebirths from two primary health care clinics in peri-urban communities: Mbekweni and TC Newman. Mothers were recruited during the second trimester of pregnancy and followed throughout the pregnancy. These clinics serve two demographically distinct populations, specifically a majority Black African ancestry community and a majority mixed ancestry community [28]. The DCHS has followed infants from birth until at least 5 years of age [28]. The current study population is composed of 142 mother–child pairs with measures available for cord blood DNA methylation, genotype data, and Bayley Scores of Infant and Toddler Development, 3rd edition (BSID-III) in at least one of the following domains: cognitive development, general adaptive behavior, language, and motor outcomes. Inclusion was also limited to mother–child pairs with measures available for relevant covariates which included principal components (PCs) from genome-wide genotype data, maternal age, maternal smoking, maternal alcohol use, birth weight, sex, and socioeconomic status (SES) (Table 1). Smoking status was determined by maternal urine cotinine levels collected prenatally, while alcohol use was measured via the Alcohol, Smoking and Substance Involvement Screening Test (ASSIST), a tool introduced by the World Health Organization (WHO) and which has shown good validity in LMIC settings [29]. Socioeconomic status was captured as a validated score comprising of four socioeconomic indicators: maternal educational attainment, employment status, household income and assets. [29].

Table 1 Study characteristics of the DCHS participants

The DCHS staff obtain written consent from mothers on an annual basis and the study was approved by the Ethics Committee of the Faculty of Health Sciences, University of Cape Town, by Stellenbosch University and the Western Cape Provincial Research committee [28].

DNA methylation measurements

As described previously [14], DNA was measured from cord blood collected at time of delivery [30]. DNA methylation measures were obtained with both the Illumina Infinium HumanMethylation450 BeadChips (n = 156) and the MethylationEPIC BeadChips (n = 160). Pre-processing and statistics were done using R 3.5.1 and raw iDat files were imported into Rstudio where intensity values were converted into beta values. The 450 K and EPIC datasets were merged using the minfi R package [31]. Background subtraction, color correction and normalization were performed using the preprocessFunnorm function [32]. Following sample and probe filtering, 273 samples and 409,033 probes remained for downstream analysis. Of these samples, 142 had genotype data, at least one BSID-III score measured at 2 years of age, and data available for all relevant covariates (Table 1). Batch effects were removed using ComBat from the R package sva [33]. Cord blood cell type composition was predicted using the most recent cord blood reference data set [34].

Neurodevelopment measurements

The BSID-III is a widely used tool for assessing neurodevelopment in children up to 42 months of age. We included scores from across four distinct domains: cognitive development, language skills, motor function, and adaptive behavior [29]. The BSID-III has been validated in LMIC settings and previous research reports its use in the DCHS specifically [29]. As described previously [29], the DCHS assessed neurodevelopment using BSID-III at 2 years of age. The DCHS BSID-III assessment was conducted by a trained professional and incorporates direct observation of the child as well as caregiver input. Composite scores for cognitive, motor, language, and general adaptive behavior domains were scaled to have a mean of 100 and standard deviation of 15 as per standard use of the tool.

Assessment of indoor air pollution exposure

As described previously [35,36,37], PM10 was measured using a personal air sampling pump (AirChek 52; SKC, Eighty Four, PA, USA), connected to a styrene filter cassette (37 mm cassette blank; SKC) with a gravimetrically pre-weighted filter (PVC filter 37 mm × 5 μm with support pad; SKC) left in the home for 24 h during the 2nd trimester of pregnancy [35, 36]. Filters were weighed after sampling and the National Institute for Occupational Safety and Health method 0600 was used to calculate an average PM10 concentration over 24 h [37, 38]. These 24-h average PM10 measurements were used for our analyses.

Statistical analysis

We used different high-dimensional mediation analysis approaches to investigate whether differential DNAm mediates the association between prenatal indoor air pollution exposure and neurodevelopmental delay. Previous research using this subsample of the DCHS has found a significant total effect of PM10 on neurodevelopment in the cognitive domain, but not in the BSID-III general adaptive behavior, language, or motor function domains [5]. Therefore, we treated the BSID-III cognitive domain as our primary outcome and the remaining BSID-III neurodevelopment domains were used as secondary outcomes to evaluate consistency of results across domains.

Mediation analyses rely on the following three assumptions: (1) no exposure (PM10)—mediator (DNAm) confounding, (2) no mediator (DNAm)—outcome (BSID-III Score) confounding and (3) no exposure (PM10) -outcome (BSID-III Score) confounding [39]. To fulfill these assumptions to the best of our knowledge, we constructed three directed acyclic graphs (DAGs) to visualize each of these paths (Fig. 1). Confounders were selected based on existing literature [40, 41] and a minimal sufficient adjustment set was identified for each path. Exposure-mediator models were adjusted for SES score, genetic ancestry, and maternal smoking and mediator-outcome models were adjusted for maternal alcohol use, maternal age, SES score, child sex, genetic ancestry, and maternal smoking. We adjusted for genetic ancestry by including the first five genotype PCs to account for population stratification [14]. Models were also adjusted for the first three cell type principal components (PCs), which explained > 90% of cell type heterogeneity [14, 42, 43]. Birth weight as a proxy for gestational age was recognized as another possible mediator of the exposure-mediator association and is a possible mechanism through which prenatal indoor air pollution exposure could impact DNAm; therefore, we did not control for birth weight in our analyses.

Fig. 1
figure 1

Directed acyclic graphs (DAGs) for exposure-mediator (A), mediator-outcome (B), and exposure-outcome (C) associations. Above are DAGs for each causal pathway relevant to our mediation analysis

To assess the role of DNAm as a potential mediator of the association between prenatal exposure to PM10 and neurodevelopment at age 2 years, we employed three well-documented methods for high-dimensional mediation analysis: HIMA (high-dimensional mediation analysis), DACT (divide-aggregate composite null), and gHMA (gene-based high-dimensional mediation analysis) [25,26,27]. As these methods are novel, there is no consensus as to which is the gold standard [44]. Therefore, we have incorporated an analysis pipeline based on a combination of these methods with traditional causal mediation analysis in order to assess the robustness of results (Additional file 1: Figure S1). It should be noted that all models, used both for high-dimensional mediation analysis and causal mediation analysis, were adjusted for the appropriate confounders as defined above.

HIMA, a high-dimensional mediation analysis method introduced by Zhang et al. [25], employs a dimension reduction technique followed by minimax concave penalty (MCP)-penalized estimation of mediation effects and joint significance testing for mediation effects in order to identify significant mediators. Dimension reduction is performed using the sure independence screening (SIS) method which is built on a correlation learning framework that filters out features that are weakly correlated with the response variable [45]. The HIMA joint significance test rejects the null hypothesis of no mediation only when both the exposure-mediator (\(\alpha\)) and mediator-outcome effects (\(\beta\)) are significant [25].

DACT leverages epigenome-wide multiple testing to estimate the proportions of the composite null hypothesis to improve power [26, 44]. A preliminary step for the DACT method is to create two linear models pertaining to each CpG site: the exposure-mediator association (\(\alpha\)) and the mediator-outcome association (\(\beta\)). While HIMA uses a screening technique to reduce dimensionality, DACT does not involve a screening step by default. We performed a pre-screen of CpG sites based on their association with the exposure (PM10) and the outcome (BSID-III scores of neurodevelopment); only CpG sites with p < 0.05 for both associations were included in the downstream analyses (Additional file 1: Figure S1). Given previous findings indicating a negative association between prenatal indoor air pollution exposure and neurodevelopment in the DCHS cohort [5], we chose to additionally filter our sites by only allowing a negative natural indirect effect (NIE) defined by \(\alpha *\beta\) (acting in the same direction as the association between indoor air pollution exposure and neurodevelopment). As we chose to pre-screen our CpG sites, we used the Efron correction feature of the DACT package to estimate the proportions of the composite null (Additional file 1: Figure S2) as opposed to Jin and Cai correction which is recommended if performing epigenome-wide mediation effect testing with DACT [26]. Of note, neither of these methods can completely account for correlation between mediators. As such, we calculated Pearson correlation coefficients for all sets of identified CpG sites to better understand if and how these CpG sites were correlated with one another.

The gHMA method was developed by Fang et al. (2020) [27] and focuses on genes as functional units as opposed to individual CpG sites. gHMA is composed of three primary components: (1) linear mediation analysis, (2) nonlinear mediation analysis and (3) an omnibus test of mediation effects. Significance testing results for both the linear and nonlinear mediation analysis steps are combined using the gHMA omnibus (gHMA-O) test. As the true relationship between mediators and outcomes are often not well understood in practice, gHMA-O transforms and combines p-values from the linear and nonlinear analyses in order to construct the gHMA-O test statistic, which is used to assess mediation effects at the gene level [27]. CpG sites were annotated by closest gene using the Bioconductor package hiAnnotator (https://bioconductor.org/packages/release/bioc/html/hiAnnotator.html) and the Ensembl gene predictions (ensGene, version of Apr-06-2014; http://hgdownload.soe.ucsc.edu/goldenPath/.

hg19/database/ensGene.txt.gz) as previously described elsewhere [46].

p-values for CpG sites which passed the screening step and were tested using DACT and HIMA were corrected for multiple testing using the Benjamini–Hochberg false discovery rate correction (BH FDR) [47]. Due to the fact that gHMA assesses differentially methylated gene regions as opposed to individual CpG sites, gHMA p-values were FDR corrected for the total number of gene regions tested, rather than the number of distinct CpG sites. HIMA and DACT CpG sites that remained significant at a false discovery rate of 0.05 were then validated via traditional causal mediation analysis using the function mediate from the R package mediation to obtain estimates of natural indirect effect (NIE), direct effect (DE), total effect (TE), and proportion mediated (PM) [48].

A recent DCHS study identified three CpG sites (cg26971411 [SPTBN4], cg00490349 [intergenic], cg15660740 [intergenic]) associated with neurodevelopment measured by BSID-III [14]. We also examined these CpG sites in our causal mediation analysis step to investigate whether they mediate the association between prenatal PM10 exposure and neurodevelopment.

Maternal HIV has been previously linked to neurodevelopment; however, the association between maternal HIV and DNAm is not well understood [49, 50]. Therefore, we also conducted a sensitivity analysis to determine the effect of including maternal HIV as a potential confounder of the mediator-outcome association.

Results

Description of study participants

The study sample consisted of 142 mother–child pairs with data available for genotype, cord blood methylation, PM10 concentration, scores for one or more BSID-III domains, and all relevant covariates (Table 1). In total, 48.6% of infants were of self-reported Black African ancestry and 51.4% were of self-reported mixed ancestry; 40.1% of infants were female. The mean PM10 concentration was 64.5 μg/m3 (SD = 96.8 μg/m3). Mean composite BSID-III scores were 85.14 (SD = 8.65) for the cognitive domain, 84.31 for the language domain (SD = 12.22), 94.04 for the motor function domain (SD = 13.74), and 83.72 for the general adaptive behavior domain (SD = 13.29). The prevalence of maternal alcohol use was 19.7% and the prevalence of maternal smoking was high, with 40.1% of mothers classed as passive smokers and 33.1% classed as active smokers based on urine cotinine levels. The prevalence of maternal HIV infection was also high with 21.1% of mothers with a confirmed HIV diagnosis.

CpG-based high-dimensional mediation analysis

After BH FDR adjustment for multiple testing, DACT identified a total of 123 distinct CpG sites across the cognitive (35 CpG sites, primary outcome), language (45 CpG sites), motor function (13 CpG sites), and general adaptive behavior (39 CpG sites) domains as significant mediators of the association between PM10 and neurodevelopment (Additional file 2: Tables S1–S4). A total of 9 CpG sites were shared between at least two domains (Additional file 2: Table S5) and one CpG site (cg26858414 [CDSN]) was shared across the language, general adaptive behavior, and motor function domains. These 123 CpG sites were further examined via causal mediation analysis. Results for our primary outcome (cognitive development) are presented here (Table 2; Fig. 2) and results for our secondary outcomes, for which we did not find a total effect of PM10 are presented in Additional file 2: Tables S6–S9. The number of probes (DACT and HIMA) and genes (gHMA) present at each step in the analysis pipeline can be found in Additional file 2: Tables S27–S30.

Table 2 High-dimensional mediation analysis for the association between PM10 (exposure), DNAm (mediator) and neurodevelopment (outcome) using DACT for CpG sites identified via formal causal mediation analysis with a significant indirect effect and total effect
Fig. 2
figure 2

Causal mediation analysis for the association between PM10 (exposure) and cognitive neurodevelopment (outcome) using CpG sites identified with high-dimensional mediation analysis methods. This figure presents estimates for indirect effect (IDE), direct effect (DE), total effect (TE), and proportion mediated (PM) for CpG sites with both significant IDE and TE. Significant total effects were found only for the cognitive neurodevelopmental domain (no significant total effects were identified for general adaptive behavior, language, or motor neurodevelopmental domains). IDE, DE, and TE effect estimates have been multiplied by the PM10 interquartile range (IQR) (58.78 µg/m3) observed in this cohort and thus these effect estimates represent the estimated effects on BSID-III cognitive score per one IQR increase in prenatal PM10 exposure

Of the 35 CpG sites identified with DACT for the cognitive domain, 29 demonstrated significant natural indirect effects (NIE), significant estimates for proportion mediated, and significant estimates for total effect (TE). Two of these CpG sites (cg13690126 [CNKSR1] and cg03234186 [ZNF154]) were also identified via DACT for the language domain (Table 2; Fig. 2). All effect estimates were multiplied by the interquartile range (IQR) of PM10 observed in this cohort (58.78 μg/m3) and therefore represent estimated effects per one IQR increase in PM10. Estimated proportion mediated (95%-confidence interval) ranged from 0.29 [0.014, 0.87] for cg00694520 [KCNE4] to 0.54 [0.11, 1.56] for cg05023582 [PRRG2]. Cg05023582 also showed the largest NIE estimate (95%-confidence interval) of − 0.49 [− 0.959, − 0.146] per one IQR increase in PM10 (Fig. 2).

After correction for multiple testing, HIMA did not identify any CpG sites that significantly mediated the effects of prenatal PM10 exposure on neurodevelopment in any domain (Additional file 2: Tables S10–S13). However, prior to multiple testing correction, one CpG site was identified as a significant mediator (cg05796992 [LRRK1]); this site was also identified with DACT and demonstrated a significant NIE estimate (95%-confidence interval) of − 0.438 (− 0.942, − 0.0426) per one IQR increase in PM10 and an estimated proportion mediated (95%-confidence interval) of 0.484 (0.0468, 1.58) in our causal mediation analysis (Fig. 2, Additional file 2: Tables S6, S14–S15).

We also conducted a sensitivity analysis to examine the effects of adjusting for maternal HIV status. In this sensitivity analysis, 27 of our 29 CpG sites remained significant at a threshold of 0.05 after adjusting for maternal HIV status with only cg16975959 [intergenic] and cg15074838 [HLA-DRA] losing significance after adjustment (Additional file 2: Tables S16–S19).

Additionally, we tested the mediation effects of three CpG sites that were identified in a recent DCHS study in association with severe neurodevelopmental delay in the cognitive (cg26971411 [SPTBN4], cg00490349 [intergenic], and cg15660740 [intergenic]), language (cg26971411 [SPTBN4] & cg00490349 [intergenic]), and motor (cg26971411 [SPTBN4] & cg00490349 [intergenic]) domains using causal mediation analysis [14]. We did not observe significant evidence of mediation in any corresponding BSID-III domain for these CpG sites (Additional file 2: Table S25). This discrepancy may be explained by a lack of association between these three CpG sites and prenatal exposure to PM10.

Gene-based high-dimensional mediation analysis

Differential methylation in four gene regions (GOPC, RP11-74K11.1, RNMT, and DYRK1A) was identified as a significant mediator of the association between PM10 and cognitive development (Table 3). Each of these differentially methylated gene regions were also identified for the other domains (Table 3). No CpG sites mapping to any of these four genes were identified for the cognitive domain with either of the CpG site-based methods (DACT or HIMA) (Additional file 2: Tables S20-S23).

Table 3 Gene-based high-dimensional mediation analysis for the association between PM10 (exposure), DNAm (mediator) and neurodevelopment (outcome) using gHMA

Additionally, we identified five differentially methylated genes (DCAF13, TNN, TAL1, AC011648.1, SPINK2) as significant mediators for the secondary outcome “motor domain”; however, none of these were found to be significant for the other domains (Table 3).

Lastly, we conducted a sensitivity analysis to examine the effects of adjusting for maternal HIV status on our results. After adjusting for maternal HIV status in our gHMA models, we did not identify any differentially methylated genes as significant mediators using a BH FDR threshold of 0.05 (Additional file 2: Table S24).

Discussion

This study of 142 mother–child pairs from a low SES population in South Africa found a total of 29 distinct, differentially methylated DNAm probes to significantly mediate the effect of prenatal exposure to PM10 on neurodevelopment at age 2 years measured by BSID-III scores. Additionally, we found four differentially methylated gene regions which significantly mediate the effect of prenatal PM10 exposure on neurodevelopment using a gene-based high-dimensional mediation analysis technique. To our knowledge, this study is the first to examine differential DNAm at individual probes as potential mediators of the association between prenatal PM10 exposure and neurodevelopment.

A number of prior studies have examined the association between prenatal PM10 exposure and DNAm as well as the association between DNAm and neurodevelopment [8, 9, 14,15,16,17,18,19,20]. While many epigenome-wide association studies (EWAS) have reported differentially methylated CpG sites associated with prenatal air pollution exposure [8, 9, 51,52,53], we did not identify any overlap between our findings and existing findings. It should be noted that our findings are not entirely comparable as, per the underlying assumptions of mediation analysis, probes must be associated with both PM10 exposure and neurodevelopment. Replication is a common problem in EWAS which often lack robust associations at single CpG sites across cohorts [51]. Further research is needed to validate our findings from high-dimensional mediation analysis.

A recent meta-analysis examining epigenome-wide associations between DNAm at birth and childhood cognitive skills synthesizing data from eight pregnancy cohorts within the Pregnancy and Childhood Epigenetics (PACE) consortium (N = 3300) did not find substantial evidence that differential cord blood DNAm at individual CpG sites is associated with cognitive skills [19]. We compared our findings to those from several EWAS investigating DNAm and cognitive development examined in the PACE study. However, no overlap was identified between our findings and those of previous studies [19, 54]. Potential explanations for these discrepancies are (1) that we focused on mediation effects of DNAm instead of the direct association between DNAm and cognitive development; (2) differences in the adjustment of multiple testing (we corrected for multiple testing using the BH FDR after an initial pre-filtering process, whereas the PACE meta-analysis used the more conservative Bonferroni threshold on an epigenome-wide scale).

CpG-based high-dimensional mediation analysis

We identified 29 differentially methylated CpG sites as significant mediators of the association between prenatal PM10 exposure and neurodevelopment in the cognitive domain. Of the 29 CpG sites, differential DNAm at 21 of these CpG sites has been associated with age in EWAS examining DNAm trajectories occurring over the course of childhood [55, 56]. Differential methylation at three CpG sites (cg23560546 [DAPL1], cg22572779 [C6orf218], cg15000966 [NT5C]) has been associated with fetal brain development [57]. Differential methylation at one CpG site (cg16975959 [intergenic]) has been previously identified as a mediator of the association between maternal smoking and birth weight [58] (Additional file 2: Table S16).

Proportion-mediated estimates for these 29 CpG sites vary, ranging from cg00694520 [KCNE4] with 0.29 (0.014, 0.87) to cg05023582 [PRRG2] with 0.54 (0.11, 1.56). However, such high PM estimates for each CpG site should be interpreted with caution due to the associated wide confidence intervals, our small sample size, and the fact that causal mediation analysis does not account for correlation between mediators, which we found to be present among these CpG sites (Additional file 1: Figure S3). Several of these CpG sites are located within or adjacent to genes known to influence fetal development and/or neurological outcomes. Herein we discuss CpG sites that map to genes that have been previously linked to neuropsychiatric outcomes.

Cg13690126 is located in CNKSR1, a protein-coding gene with low tissue expression specificity. Defects in CNKSR1 have been linked to syndromic autosomal recessive intellectual disability (ID) [59, 60]. Najmabadi et al. [61] speculate its function as a scaffold protein mediating communication between Ras and Rho GTPase signaling pathways which have in turn been shown to play a role in neurodevelopmental disorders [61, 62]. Cg07070893 is located in a promotor region for Importin 13 (IPO13), a gene showing tissue enhanced specificity in the brain and skeletal muscle [59, 63]. IPO13 is associated with agenesis of the corpus collosum and has been implicated in embryonic stem cell survival. IPO13 has been proposed as integral to brain development, particularly for the purposes of neural cell-specific cargo trafficking [59, 64, 65].

Cg23560546 is found in an enhancer region of Death Associated Protein Like 1 (DAPL1), a protein-coding gene thought to be involved in early stages of epithelial differentiation and/or apoptosis [59]. DAPL1 has been identified as a significantly differentially methylated region (DMR) in a 2021 study comparing DNAm in peripheral blood cells of toddlers with Down syndrome to neurotypical toddlers [66]. Cg15007548 is located in the gene body of Tetratricopeptide Repeat, Ankyrin Repeat and Coiled-Coil Domain-Containing 1 (TANC1) [59]. TANC1 is a protein-coding gene with low tissue specificity thought to regulate dendritic spines and excitatory synapses [59, 63, 67]. Dendritic spines are integral to synaptic function and loss of function in dendritic spines has been associated with a number of neurological disorders [68]. Cg26668632 is located in a promoter region for IFNGR1 which belongs to the type II cytokine receptor family and encodes a ligand-binding chain of the gamma interferon receptor (IFN-\(\gamma\)) [59]. Several studies have found that IFN-\(\gamma\) signaling targets play a role in neuronal development and synaptic activity and a recent study [69] suggests that IFN-\(\gamma\) signaling is involved in neurodevelopmental disorder etiology [69,70,71]. Cg23989635 is located in the first exon of Cadherin 1 (CDH1), a protein-coding gene that has been implicated in neuronal differentiation and synaptic development in the central nervous system [59, 63, 72,73,74]. CDH1 downregulation has been proposed to play a role in congenital neurodevelopmental disorders [75].

Cg0060655 occupies the same position as SNP rs147829886 and is found within an intron on the gene FAM207A/SLX9 ribosome biogenesis factor. FAM207A hypermethylation in umbilical cord tissue has been linked to pre-term birth, which in turn is associated with delayed neurodevelopment [59, 76,77,78]. Cg23054321 is located in a promotor-associated region proximal to Leucine Rich Repeat and Ig Domain-Containing 3 (LINGO3). LINGO3 is protein-coding gene with tissue enriched in the brain [59, 63]. The LINGO gene family has been found to show increased expression as an embryo develops whereas only low levels of these genes are found in adult brains with only LINGO1 and LINGO3 being detectable [79]. Epigenetic changes in LINGO3 have been correlated with depression and a paralog to LINGO3, LINGO1, acts as a negative regulator of a number of processes key to cognitive function [80]. The genes associated with the remaining CpG sites do not appear to be as well-represented in neuropathological and neurodevelopmental literature. Additional research is needed to elucidate the roles of each of these differentially methylated sites on neurodevelopment.

Gene-based high-dimensional mediation analysis

gHMA identified four differentially methylated gene regions associated with BSID-III neurodevelopmental scores (GOPC, RP11-74K11.1, DYRK1A, RNMT; Table 3). Golgi-Associated PDZ and Coiled-Coil Motif-Containing Protein (GOPC) is a protein-coding gene that has been linked to regulation of GRID2 gene expression which has been shown to impact neurodegeneration [59, 81]. RP11-74K11.1 is a pseudogene on chromosome 12 which is most highly expressed in the brain, particularly the cerebellum. Few studies have investigated RP11-74K11.1, [59, 63, 82]; and therefore additional research is needed to understand its role on neurodevelopment. Dual Specificity Tyrosine Phosphorylation Regulated Kinase 1A (DYRK1A) is located on chromosome 21 and encodes a protein kinase. DYRK1A has been strongly linked to brain development and function across the life course [63, 83]. Decreased expression of DYRK1A has been found in patients with autism spectrum disorder (ASD), while elevated expression has been linked to Down syndrome (DS) [83]. Lastly, RNA Guanine-7 Methyltransferase (RNMT) is a protein-coding gene found on chromosome 18; it is known to play a role in RNA-binding and mRNA-methyltransferase activity [59]. The role of RNMT in neurodevelopment is not well-documented and further research is needed to better understand if such a link exists.

Although no individual CpG sites located in these gene regions were identified using DACT or HIMA, it is possible that differential methylation on the gene region scale plays a crucial role in neurodevelopment. Probes contained within or proximal to gene regions identified as significant mediators using the gHMA method were eliminated from the DACT analysis pipeline due to lack of significant exposure-mediator and/or mediator-outcome associations (Additional file 2: Tables S16–S19). The discrepancy between differentially methylated gene regions identified with gHMA and gene regions associated with CpG sites identified with DACT may be attributable to interaction effects between proximal, differentially methylated DNAm probes that were not captured via DNAm probe-specific mediation analysis. As both DACT and HIMA do not account for interaction between mediators, these methods may be unable to detect such interacting CpG sites on an individual scale [25, 26].

Our sensitivity analysis, which evaluated the effect of including maternal HIV status as a potential confounder of the mediator-outcome association, showed major differences between gHMA findings with and without adjustment for maternal HIV status. Following adjustment for maternal HIV status, we were unable to identify any differentially methylated gene regions as significant mediators of the association between prenatal PM10 exposure and neurodevelopment in any domain (Additional file 2: Table S24). However, in our CpG-based analyses, adjustment for HIV did not greatly alter our findings as both the magnitude and direction of estimated IDE remained consistent (Additional file 2: Table S24). Although maternal HIV status has been linked to neurodevelopmental delay in several studies [49, 50], existing literature is sparse regarding the role of maternal HIV status in DNAm pathways and additional research is needed to better understand if and how maternal HIV status impacts the association between DNAm and neurodevelopment.

Strengths and limitations

This study has many strengths. It adds to the limited literature dedicated to investigating epigenetic modification and associated outcomes in LMIC settings, particularly related to air pollution exposure. Additionally, the mothers and infants involved in the DCHS are of majority Black African or mixed ancestry, two populations underrepresented in epigenetic and genetic literature at large. To account for population stratification, which may play a role in DNAm variation, our study incorporated genome-wide genotype data which is the preferred approach to account for genetic ancestry. To our knowledge, this is the first study to investigate DNAm as a mediator of the association between prenatal indoor air pollution exposure and neurodevelopment. A unique feature of this study is our use of three different high-dimensional mediation analysis techniques. As no single high-dimensional mediation analysis method has been deemed the gold standard, we employed several methods of high-dimensional mediation analysis in order to compare. Although we did not see good concordance between the methods (Table 2; Additional file 2: Tables S14–S15), we recognize that the probe-specific methods (HIMA and DACT) are differentially powered. A key advantage of the DACT method is that it is better powered than the joint significance test used in HIMA, which tends to be overly conservative [26]. However, additional research is needed to validate these methods and to better understand why findings from these three mediation methods did not demonstrate substantial overlap.

There are several limitations. Our analyses were constrained by a small sample size (N = 142), which may have limited the statistical power to detect mediation effects in neurodevelopment across domains. Our sample size may have also limited statistical power to detect underlying associations (total effects) between prenatal PM10 exposure and neurodevelopment. Secondly, as DNAm signatures are tissue and cell type specific, our findings are limited in that we investigated DNAm in cord blood and not brain tissue. Although brain tissue DNAm data would have been more appropriate for a study of neurodevelopment, cord blood is far more feasible to collect from living study participants.

Conclusion

Differential DNAm was found to significantly mediate the association between prenatal exposure to PM10 and neurodevelopment as measured by BSID-III at 2 years of age in the DCHS. Twenty-nine differentially methylated CpG sites as well as four differentially methylated gene regions were identified as significant mediators of this association in the DCHS cohort. Due to our small sample size and the general lack of consensus on a gold standard high-dimensional mediation analysis tool in the scientific community, this study should be regarded as a preliminary investigation. Nevertheless, these findings are novel and encourage further research to replicate and expand these results so as to better understand how DNAm and other biological pathways help explain the impact of air pollution exposure on neurodevelopment.

Availability of data and materials

Not applicable.

References

  1. Korten I, Ramsey K, Latzin P. Air pollution during pregnancy and lung development in the child. Paediatr Respir Rev. 2017;21:38–46. https://doi.org/10.1016/j.prrv.2016.08.008.

    Article  PubMed  Google Scholar 

  2. Pope DP, Mishra V, Thompson L, Siddiqui AR, Rehfuess EA, Weber M, et al. Risk of low birth weight and stillbirth associated with indoor air pollution from solid fuel use in developing countries. Epidemiol Rev. 2010;32:70–81. https://doi.org/10.1093/epirev/mxq005.

    Article  PubMed  Google Scholar 

  3. Su X, Zhang S, Lin Q, Wu Y, Yang Y, Yu H, et al. Prenatal exposure to air pollution and neurodevelopmental delay in children: a birth cohort study in Foshan, China. Sci Total Environ. 2021. https://doi.org/10.1016/j.scitotenv.2021.151658.

    Article  PubMed  Google Scholar 

  4. Kerin T, Volk H, Li W, Lurmann F, Eckel S, McConnell R, et al. Association between air pollution exposure, cognitive and adaptive function, and ASD severity among children with autism spectrum disorder. J Autism Dev Disord. 2018;48:137–50. https://doi.org/10.1007/s10803-017-3304-0.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Christensen GM, Rowcliffe C, Chen J, Vanker A, Koen N, Jones MJ, Gladish N, Hoffman N, Donald K, Wedderburn CJ, Kobor MS, Zar H, Stein DJHA, Hüls A. In-utero exposure to indoor air pollution or tobacco smoke and cognitive development in a South African birth cohort study. Sci Total Environ. 2022;834:155394. https://doi.org/10.1016/j.scitotenv.2022.155394.

    Article  CAS  PubMed  Google Scholar 

  6. Talbott EO, Arena VC, Rager JR, Clougherty JE, Michanowicz DR, Sharma RK, et al. Fine particulate matter and the risk of autism spectrum disorder. Environ Res. 2015;140:414–20. https://doi.org/10.1016/j.envres.2015.04.021.

    Article  CAS  PubMed  Google Scholar 

  7. Kalkbrenner AE, Windham GC, Serre ML, Akita Y, Wang X, Hoffman K, et al. Particulate matter exposure, prenatal and postnatal windows of susceptibility, and autism spectrum disorders. Epidemiology. 2015;26:30–42. https://doi.org/10.1097/EDE.0000000000000173.

    Article  PubMed  Google Scholar 

  8. Gruzieva O, Xu CJ, Breton CV, Annesi-Maesano I, Antó JM, Auffray C, et al. Epigenome-wide meta-analysis of methylation in children related to prenatal NO2 air pollution exposure. Environ Health Perspect. 2017;125:104–10. https://doi.org/10.1289/EHP36.

    Article  CAS  PubMed  Google Scholar 

  9. Gruzieva O, Xu CJ, Yousefi P, Relton C, Merid SK, Breton CV, et al. Prenatal particulate air pollution and DNA methylation in newborns: an epigenome-wide meta-analysis. Environ Health Perspect. 2019. https://doi.org/10.1289/EHP4522.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Jafta N, Barregard L, Jeena PM, Naidoo RN. Indoor air quality of low and middle income urban households in Durban. South Afr Environ Res. 2017;156:47–56. https://doi.org/10.1016/j.envres.2017.03.008.

    Article  CAS  Google Scholar 

  11. Martin WJ, Glass RI, Araj H, Balbus J, Collins FS, Curtis S, et al. Household air pollution in low- and middle-income countries: health risks and research priorities. PLoS Med. 2013;10:1–8. https://doi.org/10.1371/journal.pmed.1001455.

    Article  Google Scholar 

  12. Tran NQV, Miyake K. Neurodevelopmental disorders and environmental toxicants: epigenetics as an underlying mechanism. Int J Genomics. 2017. https://doi.org/10.1155/2017/7526592.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Jaenisch R, Bird A. Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat Genet. 2003;33:245–54. https://doi.org/10.1038/ng1089.

    Article  CAS  PubMed  Google Scholar 

  14. Hüls A, Wedderburn CJ, Groenewold NA, Gladish N, Jones M, Koen N, et al. Newborn differential DNA methylation and subcortical brain volumes as early signs of severe neurodevelopmental delay in a South African birth cohort study. World J Biol Psychiatry. 2022;0:1–31. https://doi.org/10.1080/15622975.2021.2016955.

    Article  CAS  Google Scholar 

  15. Grayson DR, Guidotti A. The dynamics of DNA methylation in schizophrenia and related psychiatric disorders. Neuropsychopharmacology. 2013;38:138–66. https://doi.org/10.1038/npp.2012.125.

    Article  CAS  PubMed  Google Scholar 

  16. van Mil NH, Steegers-Theunissen RPM, Bouwland-Both MI, Verbiest MMPJ, Rijlaarsdam J, Hofman A, et al. DNA methylation profiles at birth and child ADHD symptoms. J Psychiatr Res. 2014;49:51–9. https://doi.org/10.1016/j.jpsychires.2013.10.017.

    Article  PubMed  Google Scholar 

  17. Walton E, Pingault JB, Cecil CAM, Gaunt TR, Relton CL, Mill J, et al. Epigenetic profiling of ADHD symptoms trajectories: a prospective, methylome-wide study. Mol Psychiatry. 2017;22:250–6. https://doi.org/10.1038/mp.2016.85.

    Article  CAS  PubMed  Google Scholar 

  18. Mordaunt CE, Jianu JM, Laufer BI, Zhu Y, Hwang H, Dunaway KW, et al. Cord blood DNA methylome in newborns later diagnosed with autism spectrum disorder reflects early dysregulation of neurodevelopmental and X-linked genes. Genome Med. 2020;12:1–25. https://doi.org/10.1186/s13073-020-00785-8.

    Article  CAS  Google Scholar 

  19. Caramaschi D, Neumann A, Cardenas A, Tindula G, Alemany S, Zillich L, et al. Meta-analysis of epigenome-wide associations between DNA methylation at birth and childhood cognitive skills. Mol Psychiatry. 2022. https://doi.org/10.1038/s41380-022-01441-w.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Neumann A, Walton E, Alemany S, Cecil C, González JR, Jima DD, et al. Association between DNA methylation and ADHD symptoms from birth to school age: a prospective meta-analysis. Transl Psychiatry. 2020. https://doi.org/10.1038/s41398-020-01058-z.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Podobinska M, Szablowska-Gadomska I, Augustyniak J, Sandvig I, Sandvig A, Buzanska L. Epigenetic modulation of stem cells in neurodevelopment: the role of methylation and acetylation. Front Cell Neurosci. 2017;11:1–16. https://doi.org/10.3389/fncel.2017.00023.

    Article  CAS  Google Scholar 

  22. Laird PW. The power and the promise of DNA methylation markers. Nat Rev Cancer. 2003;3:253–66. https://doi.org/10.1038/nrc1045.

    Article  CAS  PubMed  Google Scholar 

  23. Lam LL, Emberly E, Fraser HB, Neumann SM, Chen E, Miller GE, et al. Factors underlying variable DNA methylation in a human community cohort. Proc Natl Acad Sci U S A. 2012;109:17253–60. https://doi.org/10.1073/pnas.1121249109.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Olstad EW, Nordeng HME, Gervin K. Prenatal medication exposure and epigenetic outcomes: a systematic literature review and recommendations for prenatal pharmacoepigenetic studies. Epigenetics. 2021;00:1–24. https://doi.org/10.1080/15592294.2021.1903376.

    Article  Google Scholar 

  25. Zhang H, Zheng Y, Zhang Z, Gao T, Joyce B, Yoon G, et al. Genetics and population analysis: estimating and testing high-dimensional mediation effects in epigenetic studies. Bioinformatics. 2016;32:3150–4. https://doi.org/10.1093/bioinformatics/btw351.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Liu Z, Shen J, Barfield R, Schwartz J, Baccarelli AA, Lin X. Large-scale hypothesis testing for causal mediation effects with applications in genome-wide epigenetic studies. J Am Stat Assoc. 2021;0:1–39. https://doi.org/10.1080/01621459.2021.1914634.

    Article  CAS  Google Scholar 

  27. Fang R, Yang H, Gao Y, Cao H, Goode EL, Cui Y. Gene-based mediation analysis in epigenetic studies. Brief Bioinform. 2021;22:1–11. https://doi.org/10.1093/bib/bbaa113.

    Article  CAS  Google Scholar 

  28. Zar HJ, Barnett W, Myer L, Stein DJ, Nicol MP. Investigating the early-life determinants of illness in Africa: the Drakenstein child health study. Thorax. 2015;70:592–4. https://doi.org/10.1136/thoraxjnl-2014-206242.

    Article  CAS  PubMed  Google Scholar 

  29. Stein DJ, Koen N, Donald KA, Adnams CM, Koopowitz S, Lund C, Marais A, Myers B, Roos A, Sorsdahl K, Stern M, Tomlinson M, van der Westhuizen C, Vythilingum B, Myer L, Barnett W, Brittain KHZ. Investigating the psychosocial determinants of child health in Africa: the Drakenstein child health study. J Neurosci Methods. 2015. https://doi.org/10.1016/j.jneumeth.2015.03.016.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Morin AM, Gatev E, McEwen LM, Macisaac JL, Lin DTS, Koen N, et al. Maternal blood contamination of collected cord blood can be identified using DNA methylation at three CpGs. Clin Epigenet. 2017;9:1–9. https://doi.org/10.1186/s13148-017-0370-2.

    Article  CAS  Google Scholar 

  31. Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of infinium DNA methylation microarrays. Bioinformatics. 2014;30:1363–9. https://doi.org/10.1093/bioinformatics/btu049.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Fortin J-P, Labbe A, Lemire M, Zanke BW, Hudson TJ, Fertig EJ, Greenwood CMHK. Functional normalization of 450k methylation array data improves replication in large cancer studies. Genome Biol. 2014;15:503. https://doi.org/10.1186/s13059-014-0503-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The SVA package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28:882–3. https://doi.org/10.1093/bioinformatics/bts034.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Gervin K, Salas LA, Bakulski KM, Van Zelm MC, Koestler DC, Wiencke JK, et al. Systematic evaluation and validation of reference and library selection methods for deconvolution of cord blood DNA methylation data. Clin Epigenet. 2019;11:1–15. https://doi.org/10.1186/s13148-019-0717-y.

    Article  CAS  Google Scholar 

  35. Vanker A, Barnett W, Workman L, Nduru PM, Sly PD, Gie RP, et al. Early-life exposure to indoor air pollution or tobacco smoke and lower respiratory tract illness and wheezing in African infants: a longitudinal birth cohort study. Lancet Planet Heal. 2017;1:e328–36. https://doi.org/10.1016/S2542-5196(17)30134-1.

    Article  Google Scholar 

  36. Vanker A, Barnett W, Nduru PM, Gie RP, Sly PD, Zar HJ. Home environment and indoor air pollution exposure in an African birth cohort study. Sci Total Environ. 2015;536:362–7. https://doi.org/10.1016/j.scitotenv.2015.06.136.

    Article  CAS  PubMed  Google Scholar 

  37. Hüls A, Vanker A, Gray D, Koen N, MacIsaac JL, Lin DTS, et al. Genetic susceptibility to asthma increases the vulnerability to indoor air pollution. Eur Respir J. 2020;55:1–9. https://doi.org/10.1183/13993003.01831-2019.

    Article  CAS  Google Scholar 

  38. Health NI for OS and. Particulates not otherwise regulated, Respirable 1988. www.%0Dcdc.gov/niosh/docs/2003-154/pdfs/0600.pdf.

  39. VanderWeele TJ. Mediation analysis: a practitioner’s guide. Annu Rev Public Health. 2016;37:17–32. https://doi.org/10.1146/annurev-publhealth-032315-021402.

    Article  PubMed  Google Scholar 

  40. Donald KA, Wedderburn CJ, Barnett W, Nhapi RT, Rehman AM, Stadler JAM, et al. Risk and protective factors for child development: an observational South African birth cohort. PLoS Med. 2019;16:1–20. https://doi.org/10.1371/journal.pmed.1002920.

    Article  Google Scholar 

  41. Zar HJ, Pellowski JA, Cohen S, Barnett W, Vanker A, Koen N, et al. Maternal health and birth outcomes in a South African birth cohort study. PLoS ONE. 2019;14:1–16. https://doi.org/10.1371/journal.pone.0222399.

    Article  CAS  Google Scholar 

  42. Pawlowsky-Glahn V, Egozcue JJ. Compositional data and their analysis: an introduction. Geol Soc Spec Publ. 2006;264:1–10. https://doi.org/10.1144/GSL.SP.2006.264.01.01.

    Article  CAS  Google Scholar 

  43. Drzymalla E, Gladish N, Koen N, Epstein MP, Kobor MS, Zar HJ, et al. Association between maternal depression during pregnancy and newborn DNA methylation. Transl Psychiatry. 2021;11:1–8. https://doi.org/10.1038/s41398-021-01697-w.

    Article  CAS  Google Scholar 

  44. Zeng P, Shao Z, Zhou X. Statistical methods for mediation analysis in the era of high-throughput genomics: current successes and future challenges. Comput Struct Biotechnol J. 2021;19:3209–24. https://doi.org/10.1016/j.csbj.2021.05.042.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Fan J, Lv J. Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc Ser B Stat Methodol. 2008;70:849–911. https://doi.org/10.1111/j.1467-9868.2008.00674.x.

    Article  Google Scholar 

  46. Hüls A, Robins C, Conneely KN, Edgar R, De Jager PL, Bennett DA, et al. Brain DNA methylation patterns in CLDN5 associated with cognitive decline. Biol Psychiatry. 2022;91:389–98. https://doi.org/10.1016/j.biopsych.2021.01.015.

    Article  CAS  PubMed  Google Scholar 

  47. Benjamini Y, Hochberg Y. Controlling the false discovery rate : a practical and powerful approach to multiple testing author (s): Yoav Benjamini and Yosef hochberg source, J. R Stat Soc. Ser B (Methodol) 57(1): (1995 ), Publi J R Stat Soc 1995;57:289–300

  48. Tingley D, Yamamoto T, Keele L, Imai K. Mediation: R package for causal mediation analysis. R Package version 4.2.4. 2013;59.

  49. McHenry MS, McAteer CI, Oyungu E, McDonald BC, Bosma CB, Mpofu PB, et al. Neurodevelopment in young children born to HIV-infected mothers: a meta-analysis. Pediatrics. 2018. https://doi.org/10.1542/peds.2017-2888.

    Article  PubMed  Google Scholar 

  50. Wedderburn CJ, Yeung S, Rehman AM, Stadler JAM, Nhapi RT, Barnett W, et al. Neurodevelopment of HIV-exposed uninfected children in South Africa: outcomes from an observational birth cohort study. Lancet Child Adolesc Heal. 2019;3:803–13. https://doi.org/10.1016/S2352-4642(19)30250-0.

    Article  Google Scholar 

  51. Saevska E, Fiano V, Asta F, Stafoggia M, Moirano G, Popovic M, Pizzi C, et al. Prenatal exposure to PM10 and changes in DNA methylation and telomere length in cord blood. Environ Res. 2022. https://doi.org/10.1016/j.envres.2022.112717.

    Article  Google Scholar 

  52. Breton CV, Yao J, Millstein J, Gao L, Siegmund KD, Mack W. Associations with newborn LINE1 and Alu methylation and childhood blood pressure and carotid intima-media thickness in the children ’ s health study. Env Heal Perspect. 2016;124:1905–12.

    Article  CAS  Google Scholar 

  53. Ladd-Acosta C, Feinberg JI, Brown SC, Lurmann FW, Croen LA, Hertz-Picciotto I, et al. Epigenetic marks of prenatal air pollution exposure found in multiple tissues relevant for child health. Environ Int. 2019;126:363–76. https://doi.org/10.1016/j.envint.2019.02.028.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Marioni RE, McRae AF, Bressler J, Colicino E, Hannon E, Li S, et al. Meta-analysis of epigenome-wide association studies of cognitive abilities. Mol Psychiatry. 2018;23:2133–44. https://doi.org/10.1038/s41380-017-0008-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Mulder RH, Neumann A, Cecil CAM, Walton E, Houtepen LC, Simpkin AJ, et al. Epigenome-wide change and variation in DNA methylation in childhood: trajectories from birth to late adolescence. Hum Mol Genet. 2021;30:119–34. https://doi.org/10.1093/hmg/ddaa280.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Xu CJ, Bonder MJ, Söderhäll C, Bustamante M, Baïz N, Gehring U, et al. The emerging landscape of dynamic DNA methylation in early childhood. BMC Genomics. 2017;18:1–11. https://doi.org/10.1186/s12864-016-3452-1.

    Article  CAS  Google Scholar 

  57. Spiers H, Hannon E, Schalkwyk LC, Smith R, Wong CCY, O’Donovan MC, et al. Methylomic trajectories across human fetal brain development. Genome Res. 2015;25:338–52. https://doi.org/10.1101/gr.180273.114.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Hannon E, Schendel D, Ladd-Acosta C, Grove J, Hansen CS, Hougaard DM, et al. Variable DNA methylation in neonates mediates the association between prenatal smoking and birth weight. Philos Trans R Soc B Biol Sci. 2019. https://doi.org/10.1098/rstb.2018.0120.

    Article  Google Scholar 

  59. Safran M, Rosen N, Twik M, BarShir R, Iny Stein T, Dahary D, Fishilevich S and LD. GeneCards: the human gene database. GeneCards Suite Chapter 2022;Practical:27–56.

  60. Kazeminasab S, Taskiran II, Fattahi Z, Bazazzadegan N, Hosseini M, Rahimi M, et al. CNKSR1 gene defect can cause syndromic autosomal recessive intellectual disability. Am J Med Genet Part B Neuropsychiatr Genet. 2018;177:691–9. https://doi.org/10.1002/ajmg.b.32648.

    Article  CAS  Google Scholar 

  61. Najmabadi H, Hu H, Garshasbi M, Zemojtel T, Abedini SS, Chen W, et al. Deep sequencing reveals 50 novel genes for recessive cognitive disorders. Nature. 2011;478:57–63. https://doi.org/10.1038/nature10423.

    Article  CAS  PubMed  Google Scholar 

  62. Reichova A, Zatkova M, Bacova Z, Bakos J. Abnormalities in interactions of Rho GTPases with scaffolding proteins contribute to neurodevelopmental disorders. J Neurosci Res. 2018;96:781–8. https://doi.org/10.1002/jnr.24200.

    Article  CAS  PubMed  Google Scholar 

  63. Uhlén M, Fagerberg L, Hallström BM, Cecilia Lindskog PO, Mardinoglu A, Sivertsson Å, Kampf C, Evelina Sjöstedt AA, Olsson I, Edlund K, Emma Lundberg SN, Al-Khalili Szigyarto C, Jacob Odeberg DD, Takanen JO, Hober S, Alm T, Per-Henrik Edqvist HB, Tegel H, Mulder J, Rockberg J, PeterNilsson JMS, et al. Tissue-based map of the human proteome. Science (80-). 2015;347:1260419–1260419. https://doi.org/10.1126/science.1260419.

    Article  CAS  Google Scholar 

  64. Fatima S, Wagstaff KM, Lim SM, Polo JM, Young JC, Jans DA. The nuclear transporter importin 13 is critical for cell survival during embryonic stem cell differentiation. Biochem Biophys Res Commun. 2021;534:141–8. https://doi.org/10.1016/j.bbrc.2020.11.099.

    Article  CAS  PubMed  Google Scholar 

  65. You P, Peng Z, Wang Y, Tao T. Expression and subcellular distribution of imp13 are regulated in brain development. Vitr Cell Dev Biol - Anim. 2013;49:346–53. https://doi.org/10.1007/s11626-013-9599-z.

    Article  CAS  Google Scholar 

  66. Naumova OY. Down syndrome down syndrome 2021.

  67. Han S, Nam J, Li Y, Kim S, Cho SH, Cho YS, et al. Regulation of dendritic spines, spatial memory, and embryonic development by the TANC family of PSD-95-interacting proteins. J Neurosci. 2010;30:15102–12. https://doi.org/10.1523/JNEUROSCI.3128-10.2010.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Nimchinsky EA, Sabatini BL, Svoboda K. Structure and function of dendritic spines. Annu Rev Physiol. 2002;64:313–53. https://doi.org/10.1146/annurev.physiol.64.081501.160008.

    Article  CAS  PubMed  Google Scholar 

  69. Warre-Cornish K, Perfect L, Nagy R, Duarte RRR, Reid MJ, Raval P, et al. Interferon-γ signaling in human iPSC–derived neurons recapitulates neurodevelopmental disorder phenotypes. Sci Adv. 2020;6:1–17. https://doi.org/10.1126/sciadv.aay9506.

    Article  CAS  Google Scholar 

  70. Bilousova T, Dang H, Xu W, Gustafson S, Jin Y, Wickramasinghe L, et al. Major histocompatibility complex class I molecules modulate embryonic neuritogenesis and neuronal polarization. J Neuroimmunol. 2012;247:1–8. https://doi.org/10.1016/j.jneuroim.2012.03.008.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Needleman LA, Liu XB, El-Sabeawy F, Jones EG, McAllister AK. MHC class I molecules are present both pre- and postsynaptically in the visual cortex during postnatal development and in adulthood. Proc Natl Acad Sci U S A. 2010;107:16999–7004. https://doi.org/10.1073/pnas.1006087107.

    Article  PubMed  PubMed Central  Google Scholar 

  72. Almeida A. Regulation of APC/C-Cdh1 and its function in neuronal survival. Mol Neurobiol. 2012;46:547–54. https://doi.org/10.1007/s12035-012-8309-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  73. Zhou Z, He M, Shah AA, Wan Y. Insights into APC/C: from cellular function to diseases and therapeutics. Cell Div. 2016;11:1–18. https://doi.org/10.1186/s13008-016-0021-6.

    Article  CAS  Google Scholar 

  74. Li M, Shin YH, Hou L, Huang X, Wei Z, Klann E, et al. The adaptor protein of the anaphase promoting complex Cdh1 is essential in maintaining replicative lifespan and in learning and memory. Nat Cell Biol. 2008;10:1083–9. https://doi.org/10.1038/ncb1768.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Delgado-Esteban M, García-Higuera I, Maestre C, Moreno S, Almeida A. APC/C-Cdh1 coordinates neurogenesis and cortical size during development. Nat Commun. 2013. https://doi.org/10.1038/ncomms3879.

    Article  PubMed  Google Scholar 

  76. Wu Y, Lin X, Lim IY, Chen L, Teh AL, MacIsaac JL, et al. Analysis of two birth tissues provides new insights into the epigenetic landscape of neonates born preterm. Clin Epigenet. 2019;11:1–12. https://doi.org/10.1186/s13148-018-0599-4.

    Article  CAS  Google Scholar 

  77. Li C, Cao M, Zhou X. Role of epigenetics in parturition and preterm birth. Biol Rev. 2021. https://doi.org/10.1111/brv.12825.

    Article  PubMed  Google Scholar 

  78. Maxwell JR, Yellowhair TR, Oppong AY, et al. Cognitive development in preterm infants: multifaceted deficits reflect vulnerability of rigorous neurodevelopmental pathways. Minerva Pediatr. 2017;69:298–313. https://doi.org/10.23736/s0026-4946.17.04905-2.

    Article  PubMed  Google Scholar 

  79. Haines BP, Rigby PWJ. Expression of the Lingo/LERN gene family during mouse embryogenesis. Gene Expr Patterns. 2008;8:79–86. https://doi.org/10.1016/j.modgep.2007.10.003.

    Article  CAS  PubMed  Google Scholar 

  80. Andrews JL, Fernandez-Enright F. A decade from discovery to therapy: Lingo-1, the dark horse in neurological and psychiatric disorders. Neurosci Biobehav Rev. 2015;56:97–114. https://doi.org/10.1016/j.neubiorev.2015.06.009.

    Article  CAS  PubMed  Google Scholar 

  81. Kalkan Z, Durasi IM, Sezerman U, Atasever-Arslan B. Potential of GRID2 receptor gene for preventing TNF-induced neurodegeneration in autism. Neurosci Lett. 2016;620:62–9. https://doi.org/10.1016/j.neulet.2016.03.043.

    Article  CAS  PubMed  Google Scholar 

  82. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. https://doi.org/10.1101/gr.229102.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  83. Arbones ML, Thomazeau A, Nakano-Kobayashi A, Hagiwara M, Delabar JM. DYRK1A and cognition: a lifelong relationship. Pharmacol Ther. 2019;194:199–221. https://doi.org/10.1016/j.pharmthera.2018.09.010.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The authors thank the study and clinical staff at Paarl Hospital, Mbekweni and TC Newman clinics, as well as the CEO of Paarl Hospital, and the Western Cape Health Department for their support of the study. The authors thank the families and children who participated in this study.

Funding

The Drakenstein Child Health Study was funded by the Bill & Melinda Gates Foundation (OPP 1017641), with additional support for this work from the South African Medical Research Council, and National Research Foundation South Africa., Additional support for the DNA methylation work was by the Eunice Kennedy Shriver National Institute of Child Health and Human Development of the National Institutes of Health (NICHD) under Award Number R21HD085849, and the Fogarty International Center (FIC). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. AH was supported by the HERCULES Center (NIEHS P30ES019776). GMC is supported by the NIEHS T32 Training Program in Environmental Health and Toxicology (5T32ES12870). DJS and HJZ are supported by the South African Medical Research Council (SAMRC). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of manuscript.

Author information

Authors and Affiliations

Authors

Contributions

DF performed the analyses, contributed to the methodology, visualized findings, prepared the original manuscript, and reviewed and edited the manuscript. SA and AK contributed to the methodology and reviewed and edited the manuscript. AH directed the analyses, contributed to the methodology, and was a major contributor in writing the manuscript. NK, KAD, AV, and CJW performed investigation, data curation, and writing—review and editing. MSK performed writing—review and editing, and supervision—generation of DNA methylation data. HJZ and DJS performed supervision, funding acquisition, and writing—review and editing. All authors read, edited, and approved the final manuscript.

Corresponding author

Correspondence to Anke Hüls.

Ethics declarations

Ethics approval and consent to participate

The DCHS staff obtain written consent from mothers on an annual basis and the study was approved by the Ethics Committee of the Faculty of Health Sciences, University of Cape Town, by Stellenbosch University and the Western Cape Provincial Research committee.

Consent for publication

Not applicable.

Competing interests

All authors declare they have no actual or potential competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

: Figures S1–S3. Figures outlining the analysis pipeline, DACT goodness-of-fit output, and significant mediator correlations.

Additional file 2

: Tables S1–S30. Tables showcasing HDMA results, CMA results, and findings at each step of the analysis pipeline.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Feil, D., Abrishamcar, S., Christensen, G.M. et al. DNA methylation as a potential mediator of the association between indoor air pollution and neurodevelopmental delay in a South African birth cohort. Clin Epigenet 15, 31 (2023). https://doi.org/10.1186/s13148-023-01444-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13148-023-01444-6

Keywords