Prenatal metal exposure, cord blood DNA methylation and persistence in childhood: an epigenome-wide association study of 12 metals

Background Prenatal exposure to essential and non-essential metals impacts birth and child health, including fetal growth and neurodevelopment. DNA methylation (DNAm) may be involved in pathways linking prenatal metal exposure and health. In the Project Viva cohort, we analyzed the extent to which metals (As, Ba, Cd, Cr, Cs, Cu, Hg, Mg, Mn, Pb, Se, and Zn) measured in maternal erythrocytes were associated with differentially methylated positions (DMPs) and regions (DMRs) in cord blood and tested if associations persisted in blood collected in mid-childhood. We measured metal concentrations in first-trimester maternal erythrocytes, and DNAm in cord blood (N = 361) and mid-childhood blood (N = 333, 6–10 years) with the Illumina HumanMethylation450 BeadChip. For each metal individually, we tested for DMPs using linear models (considered significant at FDR < 0.05), and for DMRs using comb-p (Sidak p < 0.05). Covariates included biologically relevant variables and estimated cell-type composition. We also performed sex-stratified analyses. Results Pb was associated with decreased methylation of cg20608990 (CASP8) (FDR = 0.04), and Mn was associated with increased methylation of cg02042823 (A2BP1) in cord blood (FDR = 9.73 × 10–6). Both associations remained significant but attenuated in blood DNAm collected at mid-childhood (p < 0.01). Two and nine Mn-associated DMPs were identified in male and female infants, respectively (FDR < 0.05), with two and six persisting in mid-childhood (p < 0.05). All metals except Ba and Pb were associated with ≥ 1 DMR among all infants (Sidak p < 0.05). Overlapping DMRs annotated to genes in the human leukocyte antigen (HLA) region were identified for Cr, Cs, Cu, Hg, Mg, and Mn. Conclusions Prenatal metal exposure is associated with DNAm, including DMRs annotated to genes involved in neurodevelopment. Future research is needed to determine if DNAm partially explains the relationship between prenatal metal exposures and health outcomes. Supplementary Information The online version contains supplementary material available at 10.1186/s13148-021-01198-z.

Environmental exposure to harmful heavy metals and metalloids, collectively referred to here as "metals, " from natural and anthropogenic sources is also a public health concern. The toxicity of some heavy metals can lead to adverse health outcomes through various biological mechanisms including oxidative stress, disruption of cell membrane potential and homeostasis, and competitive binding to proteins [11,12]. Higher levels of nonessential metals including arsenic (As), cadmium (Cd), and lead (Pb) have been associated with risk of low birth weight [13][14][15]. In addition, As, Cd, mercury (Hg), and Pb are potent neurotoxicants, and prenatal exposure to these metals has been associated with decreased measures of neurodevelopment and cognitive function in children [16][17][18][19]. Essential metals may also have nonlinear associations with child health. For example, maternal Mn has been shown to have an inverse U-shaped relationship with neurodevelopment in infants, with low and high levels associated with poor mental and psychomotor development [8]. Additionally, sex-specific associations for metals and health, including metal uptake and retention, have been reported, highlighting the need to study sex-specific associations [20,21]. Specifically, sex-specific associations have been reported with early-pregnancy maternal metal concentrations and birth outcomes in the Project Viva cohort [22].
Changes in epigenetic markers may be involved in the biological pathways, or serve as biomarkers, relating metal exposure to infant and child health. Embryonic development is a time of rapid epigenetic reprogramming, which may impact health outcomes later in life. As proposed by the developmental origins of health and disease (DOHaD) hypothesis [23], pre-and postnatal exposures that occur during critical developmental periods may affect disease phenotypes through epigenetic mechanisms including DNA methylation (DNAm) reprogramming [24]. Prenatal maternal exposure to a range of environmental contaminants including some metals has been associated with alterations in global and locus-specific DNAm [25].
For selected metals, there is ample evidence of epigenetic dysregulation. For example, multiple epigenomewide association studies (EWAS) of prenatal As exposure have been conducted, including populations with both low [26,27] and high exposure levels [28][29][30]; however, a common epigenetic signature of prenatal As exposure has not been identified, but rather multiple cohort-specific signals. Fewer EWAS of other prenatal metal exposures have been conducted. For Pb, two EWAS have identified varying signatures across populations. An EWAS prenatal Pb exposure and cord blood DNAm conducted in a cohort in Mexico did not identify significant associations between Pb and locus-specific DNAm [31]. However, an EWAS conducted among mother-infant pairs in the United States (US)-based Project Viva cohort (N = 268) identified four CpGs associated with second-trimester maternal red blood cell (RBC) Pb concentrations among infants overall, with 38 CpGs among female and two among male infants in sex-stratified analyses (FDR < 0.05) [32]. Also, within Project Viva, second-trimester maternal RBC Hg concentrations were associated with methylation of one CpG among infants overall and of one CpG among males (N = 321; p Bonferroni < 0.05) [33].
Considering the limited epidemiological research on associations between prenatal metal exposures and epigenetic dysregulation, in this study we investigate the extent to which prenatal metals measured in first-trimester maternal RBCs are associated with DNAm profiles in cord blood and compare associations across metals. Leveraging data and samples from Project Viva, a pre-birth cohort with relatively low levels of metal exposures, we analyzed associations between prenatal exposure to 12 metals [As, barium (Ba), Cd, chromium (Cr), cesium (Cs), Cu, Hg, Mg, Mn, Pb, Se, Zn] and differentially methylated positions (DMPs) and regions (DMRs) in cord blood. The current research expands upon previous analyses conducted in Project Viva by investigating multiple metals measured during the first trimester, reflecting exposure during embryonic, rather than fetal, development. We also investigated sex-specific associations and persistence of observed associations in mid-childhood.

Participant characteristics
DNAm data were available for 361 (Additional file 1: Figure S1) and 333 mother-infant pairs at birth and in mid-childhood, respectively, and a total of 189 motherinfant pairs had both cord blood and mid-childhood DNAm data. Characteristics of mother-infant pairs and maternal blood metal concentrations are provided in Table 1. Among participants with cord blood DNAm data, the median (IQR) maternal age at enrollment was 32.4 (29.7, 35.9) years. The majority of mothers were college graduates and had household incomes > $70,000 per year (70.6% and 60.4%, respectively). Approximately half of infants were female (46.8%), 70.4% were White, and 11.4% were Black. Spearman correlations between metals are presented in Additional file 1: Figure S2. The greatest correlations were observed between Cu and Zn (r = 0.56), As and Hg (r = 0.50), Cu and Mg (r = 0.45), and Mg and Se (r = 0.44) (p < 0.05). There were no significant differences in baseline characteristics or prenatal maternal blood metal concentrations between mother-infant pairs with cord blood or mid-childhood DNAm data (p > 0.10).
In linear models adjusted for infant sex, race, gestational age, maternal age, pre-pregnancy BMI, education, household income, and smoking, prenatal Cu exposure was associated with CD4 + T cell proportions estimated using the Houseman regression calibration method. For every doubling of Cu concentration, there was an increase of 3.25% in the proportion of estimated CD4 + T cells (p = 0.037) (Additional file 1: Table S1). There was a trend toward positive associations between Cd and CD4 + T cells and between Hg and monocytes, although these associations did not achieve statistical significance (B = 0.63, p = 0.05 and B = 0.23, p = 0.05, respectively).

Differentially methylated positions
Manhattan and Q-Q plots for each metal in fully adjusted models for cord blood DNAm are shown in Fig. 1 and Additional file 1: Figure S3, respectively; λ values are presented in Additional file 1: Table S2. Full results of EWAS for each metal are available on the study's GitHub repository (https:// github. com/ anneb ozack/ viva_ DNAm_ metals) and Open Science Framework site (https:// osf. io/ jf5yt/). Among infants overall, prenatal exposure to Mn  Table 2). In linear models testing for effect modification by sex, significant Mn × sex interaction terms were observed for cg00954161, cg11161853, cg23903787, cg19908812, cg26462130, cg08904630, cg22799518, cg01744822, cg15712310, and cg03763518 (p < 0.05). There was not significant Mn × sex interaction for cg02042823 (A2BP1) (p = 0.06), although the association between prenatal Mn and DNAm of cg02042823 was nominally significant among females (B for each doubling of Mn = 1.08%; p = 0.014) but met statistical significance after adjustment for multiple comparisons among males (B for each doubling of Mn = 3.40%; FDR = 1.10 × 10 -4 ). No DMPs were statistically significant in sex-stratified analyses for other metal exposures (FDR > 0.05), or for As and Hg adjusting for maternal fish intake.
Persistence of cord blood DMPs associated with prenatal Mn and Pb exposure was tested in blood collected in mid-childhood (Table 2) Of the nine Mn-associated DMPs among females, seven were nominally significant in mid-childhood (cg00954161, cg23903787, cg26462130, cg08904630, cg22799518, cg01744822, and cg15712310; p < 0.05). Among these CpGs, the direction of association was consistent between the two outcome time points with the exception of cg22799518 (RBMS2; cord blood B = 2.03, p = 2.03 × 10 -7 ; mid-childhood B = − 0.20, p = 0.004). The DMP negatively associated with first-trimester Pb among infants overall (cg20608990; CASP8) remained significantly associated in mid-childhood (B = − 1.56; p = 0.008). Mid-childhood results were consistent when restricting analyses to only children included in the cord blood EWAS (N = 189) except for cg22799518 among females, which failed to achieve nominal significance (p = 0.07).
We compared our results to previous EWAS of second-trimester maternal RBC Pb ) and Hg ) conducted in Project Viva. Among four CpGs associated with second-trimester Pb among infants overall, one was nominally associated with first-trimester Pb in our study (cg22112000: B = − 0.09%, p = 0.046), and among 38 CpGs associated with second-trimester Pb among females, four were nominally associated with first-trimester Pb in our study (cg17971003: annotated to SLN, B = 0.90%, p = 0.044; cg05959994: TFDP1, B = − 0.31%, p = 2.30 × 10 -4 ; cg04571282: KIAA1267, B = − 0.34%, p = 1.11 × 10 -4 ; cg11610754: TMEM59L, B = 0.07%, p = 0.039) (Additional file 1: Table S3). The direction of association between Pb and DNAm was consistent between analyses of first-and second-trimester concentrations. Neither of the two CpGs associated with second-trimester Pb among males replicated in our study. One CpG was previously associated with second-trimester Hg among infants overall (cg13340705, WBP11P1), and one CpG (cg13416866, TOR4A) and one DMR (chr7:94,953,653-94,954,202, PON1) was associated with second-trimester Hg among males ). Although the CpG associated with Hg among infants overall did not replicate in our study, among males, cg13416866 had a nominally significant association with first-trimester Hg and consistent direction of association (B = 0.51%; p = 0.028) (Additional file 1: Table S3). Consistent with second-trimester Hg analyses, all CpGs in the region chr7:94,953,653-94,954,202 had a negative direction of association among males in our study of first-trimester Hg, although only one was nominally associated with Hg

Differentially methylated regions
The number of DMRs identified in cord blood for each metal among infants overall and stratified by sex is listed in Table 3 (Sidak p < 0.05) and is shown in the Manhattan plots in Fig. 1 (noted by bolded individual CpGs within the region). For each DMR, the chromosomal coordinates, gene, probes, probe effect sizes, and p values in linear analyses (limma) are provided in Additional file 2: Spreadsheets 1-12. In summary, at least one DMR was identified among infants overall for all metals with the exception of Ba and Pb. Cs was associated with the Table 2 Cord blood differentially methylated positions associated with Mn and Pb (FDR < 0.05) Differentially methylated positions (DMPs) identified using models of individual log 2 -transformed prenatal metal concentration adjusted for infant sex, race/ethnicity, gestational age (age at blood collection for midchildhood models), nulliparous, maternal age at enrollment, pre-pregnancy BMI, education, smoking, household income, and estimated cord blood cell-type proportions (estimated in cord blood or from leukocytes in mid-childhood). Mean difference in % methylation determined using the same adjusted models for  We assessed the overlap among DMRs identified for each metal (Table 4). Three DMRs were identified as associated with more than one metal in analyses of infants overall for Hg and Mg (chr6: RNF39, Fig. 2a), Mg and Mn (chr6: TNXB, Fig. 2b), and Cs, Cu, and Mg (chr20: GNAS, GNASAS).
Results from Gene Ontology (GO) enrichment analysis of DMRs are summarized in Table 5; complete results and differentially methylated genes within GO terms are included in Additional file 2: Spreadsheet 13. We identified nominally significant GO terms with an overrepresentation of differentially methylated genes associated with Cr (N = 18), Cs (N = 20), Cu (N = 17), Se (N = 41), and Zn (N = 18) (p < 0.05); no terms remained significant after FDR adjustment. The majority of GO terms (87%) belonged to the biological process domain. Three GO terms were identified for multiple metals, although differentially methylated genes within GO terms differed between metals. Common GO terms were: cellular response to DNA damage stimulus [GO:00069741, associated with Cs (differentially methylated: ZBTB38, ZNF385A, In sex-stratified analysis, at least one DMR was identified in all analyses among females with the exception of Hg, Pb, and Zn, and in all analyses among males with the exception of Cr (Table 3, Sidak p < 0.05). DMRs located in the HLA region were also identified among males (for Cs: chr6: intergenic and GTF2H4; for Hg: RNF39; Fig. 2ad). Although the majority of these DMRs were specific to sex-stratified analyses, within metals, we did observe overlap between DMRs identified in analyses of infants overall and stratified by sex (Additional file 1: Table S4). Two sets of overlapping DMRs were associated with multiple metals only in analyses restricted to males: Cs and Hg (chr6: 31,650,735-31,651,361, Fig. 2e) and Mg and Se (chr19: AURKC). To assess persistence of associations within DMRs in mid-childhood, we compared the probe effect sizes and p values from DMP analyses within each region between cord blood and mid-childhood (Additional file 2: Spreadsheets 1-12). Among infants overall, four DMRs contained 100% of probes that were nominally significant in mid-childhood (p < 0.05), associated with Cd (chr17: HOXB7), Mg (chr11: B4GALNT4), and Zn (chr17: FSCN2; chr20, LOC284798). In sensitivity analyses limited to children with cord blood DNAm data, eight Cord blood differentially methylated regions associated with metals within the human leukocyte antigen region. Cord blood differentially methylated regions (DMRs) located within the human leukocyte antigen (HLA) region (chr6: 28,477,797-33,448,3540) were associated with prenatal exposure to a Cr in infants overall and males; b Hg in infants overall and males, and Mg in infants overall, c Cs in infants overall and males; d Cr in females; e Cs and Hg in males; f Hg and Mn in infants overall; g Cs in infants overall; h Cr in infants overall; and i Cs in infants overall. DMRs identified using comb-p (Sidak p < 0.05) adjusted for infant sex, race/ethnicity, gestational age, nulliparous, maternal age at enrollment, pre-pregnancy BMI, education, smoking, household income, and estimated cord blood cell-type proportions

Conclusions
We investigated epigenome-wide associations of prenatal concentrations of 12 metals measured in maternal firsttrimester RBC samples with cord blood DNAm and persistence DNAm changes in mid-childhood in the Project Viva pre-birth cohort. Overall we identified two CpGs at which cord blood DNAm was associated with specific metal levels in the first trimester: cg02042823 (A2BP1), associated with Mn, and cg20608990 (CASP8), associated with Pb, and both DNAm associations seemed to persist in mid-childhood blood. Sex-specific associations with cord blood DNAm at individual CpGs were observed with Mn exposure (9 among females, and 2 CpGs among males that included cg02042823; FDR < 0.05); the two male-specific cord blood DMPs were persistent while six of the female-specific DMPs were persistent in midchildhood blood DNAm. Multiple DMRs were identified with all prenatal metal exposures among infants overall with the exception of Ba and Pb (Sidak p < 0.05). We identified overlapping DNA methylation regions associated with prenatal exposure to Cr, Cs, Cu, Hg, Mg, and Mn in the HLA region. The same cord blood DMR within the GNAS gene was associated with prenatal Cs, Cu, and Mg among all infants.
Although our study did not identify individual DMPs associated with exposure to metals other than Mn and Pb, previous EWAS of selected prenatal exposures have identified DMPs. For example, among one of most widely studied metals, As, prior reports in other prenatal cohorts have documented multiple associated DMPs [26,[28][29][30][34][35][36]. These studies used maternal drinking water, urine, or nails to measure As exposure, which capture toxic inorganic forms of As. Urinary As in these studies was measured as the sum of inorganic species, whereas As is primarily present in inorganic forms in nail samples [37]. In our study, we used maternal RBCs which provide total arsenicals, including less toxic organic forms. In addition, the majority of previous studies have been conducted in populations with moderate-to-high As exposure, whereas exposure was low in our study. These factors potentially explain the lack of significant associations for our study. An EWAS of prenatal maternal blood Cd concentrations and DNAm in cord blood in a Bangladeshi cohort (N = 127) identified sex-specific effects more pronounced in boys, although no DMPs were statistically significant after adjustment for multiple comparisons [38]. This is in line with our study in which there was no evidence of DMPs associated with prenatal blood Cd levels. In our study median cadmium levels were much lower (0.39 ng/g) compared to median levels in the Bangladeshi study (1.3 ug/kg).
Mn is an essential trace element necessary for numerous biological functions including the metabolism of amino acids, proteins, lipids, and carbohydrates; immune function; energy regulation; and protection against free radicals [39]. Mn is also necessary for brain development and cognitive function, although Mn is a neurotoxicant at excessive levels [40], and in epidemiological studies, high levels of exposure to Mn prenatally have been negatively associated with measures of neurodevelopment [8,41]. Sex-specific associations have also been observed between prenatal Mn exposure and adolescent neuromotor [42] and neurobehavioral function [43], with high exposure associated with overall poorer outcomes among females compared with males. Few EWAS have previously investigated associations between prenatal Mn exposure and DNAm in offspring. An EWAS of placental DNAm and infant toenail Mn concentrations in Table 4 Overlapping cord blood differentially methylated regions for metals among infants overall (Sidak p < 0.05) Differentially methylated regions (DMRs) identified using comb-p adjusted for infant sex, race/ethnicity, gestational age, nulliparous, maternal age at enrollment, prepregnancy BMI, education, smoking, household income, and estimated cord blood cell-type proportions a US birth cohort (N = 61) identified differential methylation at five CpGs with tertiles of Mn exposure (p Bonferroni < 0.05) [44]; however, no Bonferroni-significant DMPs nor DMPs at p < 1 × 10 -4 overlapped with the Mnassociated DMPs or DMRs identified in our study. Given the sensitive window of prenatal development, timing of exposure and tissue used for Mn measurement could influence findings which might explain the lack of overlap in study results.
We identified Mn-associated DMPs annotated to genes with roles in neurodevelopment. Among infants overall and in stratified analyses of male infants, we found Mn exposure to be associated with greater methylation of cg02042823, annotated to Ataxin-2-binding protein 1 (A2BP1), also known as RNA binding fox-1 homolog 1 (RBFOX1). A2BP1 is an RNA binding protein involved in tissue-specific pre-mRNA splicing of transcripts involved in neuronal development [45,46]. A2BP1-knockout mice

Table 5 Summary of Gene Ontology (GO) enrichment analyses of cord blood differentially methylated regions (DMRs)
Prenatal exposure to Cr, Cs, Cu, Se, and Zn was associated with 18, 20, 17, 41, and 18 GO terms, respectively (p < 0.05). GO terms were identified for multiple metals are: cellular response to DNA damage stimulus (GO:00069741, associated with Cs and Cu), response to stress (GO:0006950, Cu and Se), and cellular response to stress (GO:0033554; Cs and Cu). Closely related GO terms have been combined; for a complete list of DMR-associated GO terms, see Additional file 2: Spreadsheet 13 have splicing changes in proteins involved in synaptic transmission and increased neuronal excitability [47]. In humans, A2BP1 variants have been associated with biomarkers of Alzheimer's disease [48] and copy number variation of A2BP1 has been associated with autism spectrum disorder [49]. We also found prenatal Mn exposure among females to be associated with methylation of cg19908812, annotated to Neuropeptide Y receptor Y1 (NPY1R), which encodes a transmembrane protein involved in neurotransmitter activity. NPY1R expression has been associated with anxious temperament in monkeys [50] and deletion involving NPY1R has been identified in a case study of autism [51]. Similarly, cg26462130, annotated to Mitotic arrest deficient 1 like 1 (MAD1L1), was associated with Mn in female infants. MAD1L1 is involved in mitotic spindle-assembly and genetic variation of MAD1L1 has been associated with schizophrenia [52] and anxiety disorders [53]. Whether our findings are implicated in neurodevelopmental disruption remains to be determined. Prenatal Pb exposure has been negatively associated with mental and cognitive development and sensory function in infants and children [54][55][56]. In a previous study conducted in the Project Viva cohort, where Pb levels were below the CDC reference of 5 μg/dL, there was a trend in lower neurobehavioral measures with higher prenatal Pb exposure [57]. Animal models have provided evidence that Pb effects global DNAm levels via DNA methyltransferase inhibition [58,59], and prenatal Pb exposure has been negatively associated with global levels DNAm measured in cord blood [60]. There is limited research on epigenome-wide associations between Pb and locus-specific association with DNAm; however, an EWAS conducted in a Mexican pre-birth cohort (N = 420) identified no associations between prenatal exposure to Pb and cord blood DNAm [31]. Overall, we similarly found null results in Pb analyses with the exception of the DMP cg20608990, annotated to Caspase 8 (CASP8). CASP8 is involved in cell apoptosis [61], including neuronal apoptosis [62,63], and cell death may be induced by Pb through a pathway involving CASP8 [64]. DNAm of the CASP8 promoter was not found to be associated with Pb and Cd exposure among children environmentally exposed through e-waste recycling (N = 116), although this study differed from our analyses in age participants (i.e., exposure and DNAm measured at 3-7 years old) [65]. In addition, this study measured DNAm at four loci within the CASP8 promoter using pyrosequencing.
In Project Viva, previous EWAS have been conducted for second-trimester maternal RBC Pb (N = 268) [32] and second-trimester maternal RBC Hg [33] (N = 321). Using a look-up approach of CpGs associated with second-trimester Pb, in our study first-trimester Pb was nominally associated with one CpG among all infants and four CpGs among females (p < 0.05), with a consistent direction of association across EWAS. In the EWAS of second-trimester Hg, DNAm of cg13340705 (WBP11P1) and cg13416866 (TOR4A) was positively associated with exposure among infants overall and males, respectively (p Bonferroni < 0.05) [33]. In the current analyses of first-trimester RBC Hg concentrations, cg13416866 had a nominally significant positive association with Hg among males. In addition, Cardenas et al. identified one DMR associated with second-trimester Hg among male infants: chr7:94,953,653-94,954,202, annotated to Paraoxonase 1 (PON1) and consisting of 9 CpGs with negative associations with Hg (FDR < 0.05). Similarly, all CpGs in this region had a negative direction of association among males in our study of first-trimester Hg, although only one was nominally associated with Hg (cg04871131, p = 0.026). Limited replication of results between EWAS of first-and second-trimester exposures may be due to windows of susceptibility to epigenetic dysregulation or incomplete overlap between mother-infant pairs included in first-trimester and second-trimester EWAS of Hg and Pb in Project Viva. These results highlight the complexity of epigenetic dysregulation for trimesterspecific exposures during fetal development, a period of rapid epigenetic reprogramming.
Our study found multiple DMRs located in the HLA region associated with Cr, Cs, Hg, Mg, and Mn, including common DMRs associated with Hg and Mg (annotated to RNF39) and Mg and Mn (annotated to TNXB). The HLA region spans a ~ 4 Mb segment of the chromosomal band 6p21 and contains an overrepresentation of protein-coding genes [66]. The HLA region is involved in immune regulation, including inflammation and innate and adaptive immunity, as well as reproduction, central nervous system development, and neurological disorders [66]. Ring Finger Protein 29 (RNF39) is a non-HLA expressed gene involved in regulation of synaptic plasticity [67]. RNF39 was found to be differentially expressed in rodent brains exposed to methylmercury [68], and differential methylation of RNF39 has been associated with schizophrenia spectrum disorders [69] and posttraumatic stress disorder [70]. Tenascin XB (TNXB) is also a non-HLA expressed gene, and TNXB methylation has similarly been associated with psychiatric disorders including social anxiety disorder [71] and anorexia nervosa [72]. We also identified multiple DMRs annotated to GNAS associated with Cs, Cu, and Mg among all infants, and with Cu and Mg among females. GNAS complex locus (GNAS) is a protein-coding imprinted gene which includes four alternative exons. Methylation of GNAS has been associated with schizophrenia spectrum disorders [69], and parent-or-origin effects of SNPs have been associated with DNAm of GNAS [73]. In GO enrichment analysis of cord blood DMRs, Cs and Cu were nominally associated with GO terms including GNAS and involved in a broad range of biological processes including growth, development, response to stress, and metabolic processes.
In addition, GO analyses of DMRs identified similar terms nominally associated with multiple prenatal metals. Specifically, terms related to response to stress were enriched for Cs-, Cu-, and Se-associated differentially methylated genes; terms related to apoptosis and cell death were enriched for Cr-and Se-associated differentially methylated genes; and terms related to macromolecule metabolic processes were enriched for Cr-, Cs-, and Cu-associated differentially methylated genes. Interestingly, differentially methylated genes in related pathways differed between metals, suggesting that prenatal metal exposures may affect common biological pathways through epigenetic regulation of multiple genes.
This study was strengthened by the use of maternal blood metal concentrations measured in the first trimester, which served as an unbiased biomarker of prenatal exposure in early life. We were also able to confirm reliability of maternal blood metal concentrations by calculating ICCs using duplicate samples. Overall, metal concentration measurements had good reliability (ICCs > 0.70); however, Cr and Cu had lower ICCs (0.40 and 0.64, respectively), which should be considered in interpreting the results from these EWAS. In addition, our study measured multiple metals in maternal blood samples and DNAm of offspring at birth and in midchildhood. This allowed us to compare EWAS results across prenatal metal exposures in the same population, as well as test for the persistence of differential methylation in childhood. Although our sample size was moderate, we were able to conduct stratified analyses to investigate sex-specific effects of metal exposures. It should be noted, however, that we adjusted for multiple comparisons within each metal EWAS (i.e., 394,460 probes), and we did not adjust for the number of metals tested or multiple testing introduced in sex-stratified analyses, which may increase the number of false-positives. This study was also strengthened by analyzing both locus-specific and regional differential methylation. DMRs were identified using the comb-p algorithm, which has been shown to have higher power and greater ability to identify DMRs with small effect sizes than other methods to detect DMRs [74] but at the expense of a higher Type I error rate [75]. Therefore, our results reflect candidate genomic positions associated with prenatal metal exposure, and the robustness of our findings should be tested in other cohorts and confirmed in meta-analyses.
Many of the DMRs we identified were restricted to sexstratified analyses, suggesting sexual dimorphism in the epigenomic effects of prenatal metal exposures; however, further research is needed to understand sex-specific regional differences in DNAm as most studies limit analyses across sex.
Several limitations of our study should be considered. Maternal blood metal exposure was assessed at a single time, and a single measurement may not capture critical windows of vulnerability during fetal development to epigenetic dysregulation. We also lacked data and samples for gene expression and cannot determine if changes in DNAm were associated with alterations in gene expression. In addition, although we investigated persistence of metal-induced epigenetic changes in mid-childhood, we did not have measures of childhood metal exposure which may also impact epigenetic programming. Most Mn-associated DMPs were nominally significant in midchildhood (p < 0.05 for all Mn-associated DMPs among infants overall and males, and 78% of DMPs among females); however, we observed limited persistence of DMRs in mid-childhood (four DMRs had p < 0.05 in 100% of probes in mid-childhood). We are unable to assess how metal exposure between birth and mid-childhood may have affected DNA methylation levels. Our analyses of persistence of effects in mid-childhood were also limited by the incomplete overlap between motherinfant pairs with cord blood and mid-childhood DNAm data (N = 189 mother-infant pairs with data at both time points). Furthermore, we chose to analyze associations between each metal exposure and DNAm separately to identify independent effects on epigenetic dysregulation. However, prenatal metal exposure may also act as a mixture to affect DMAm profiles. DNA methylation disruption across the genome resulting from prenatal maternal exposures appear non-targeted with multiple studies finding evidence of numerous associations that appear cohort specific. This might indicate the importance of taking into account environmental mixtures in epigenetic research. Statistical methods are needed in order to capture the complexity of environmental mixtures on the epigenome.
Few studies have investigated effects of multiple prenatal metal exposures on epigenome-wide DNAm and its persistence in childhood. In this study of 12 metals, we provide evidence that prenatal metal exposure is associated with cord blood DNAm at individual CpGs for two specific metals (Mn and Pb). In addition, we observed associations between metal exposures and overlapping DMRs annotated to genes related to neurodevelopment and co-located in the HLA region, although analyses should be replicated in other cohorts to determine the generalizability of our findings. Further research may also provide insights to the effect of mixtures of prenatal metal exposures on epigenetic dysregulation as well as the impact of changes in DNAm on gene expression and biological pathways and child health outcomes.

Study design
Project Viva is a longitudinal pre-birth cohort established to examine associations between maternal diet and other environmental factors and maternal and child health. The Project Viva cohort has previously been described in detail [76]. Briefly, we recruited women between 1999 and 2002 from Atrius Harvard Vanguard Medical Associates, a group practice in eastern Massachusetts, US Research staff screened women at their initial obstetric visit (median 9.9 weeks of gestation). Exclusion criteria were multiple gestation, not English-speaking, ≥ 22 weeks gestation, and plan to leave the study area prior to delivery. Initially, we included a total of 2,670 pregnancies (64% of those screened), of which 2,128 live births remained in the cohort at the time of delivery (Additional file 1: Figure S1). For mothers with two children included in the current dataset (N = 2), we excluded the second birth from the current analyses.
At study recruitment (median = 10 weeks gestation), research assistants administered a brief interview and provided mothers with a questionnaire to return by mail. Research assistants also conducted a mid-pregnancy visit, a hospital visit during the birth admission, and visits during infancy and mid-childhood (median 7.66 years). At birth, research assistants collected cord blood samples. Mothers provided written informed consent at enrollment and at the mid-childhood visit. Protocols for biosample collection were designed to minimize discomfort and inconvenience for mothers and children. The Institutional Review Board of Harvard Pilgrim Health Care reviewed and approved all study protocols.

Sample collection, processing, and analysis
Maternal RBC metal analysis: We collected blood samples from pregnant women at recruitment. We centrifuged blood samples (2,000 rpm for 10 min at 4 °C) to separate erythrocytes from plasma and stored aliquots at -70 °C prior to analyses. We performed sample handling in an ISO class 6 clean room with an ISO class 5 laminar flow clean hood. Briefly, 0.5 ml of packed erythrocytes was weighed and digested in 2 ml of ultra-pure concentrated HNO 3 acid for 48 h, and then further digested with 1 ml of 30% ultra-pure hydrogen peroxide for 24 h before diluting to 10 ml with deionized water.
We measured first-trimester concentrations of the metals aluminum (Al), As, Ba, Cd, cobalt (Co); Cr, Cs, Cu, Mg, Mn, molybdenum (Mo); nickel (Ni), Pb, antimony (Sb), Se, tin (Sn), Tl: thallium, vanadium (V), and Zn in erythrocytes using a triple quadrupole inductively coupled plasma mass spectrometry (ICP-MS) (Agilent 8800 ICP-QQQ) on a single run in MS/MS mode using appropriate cell gases and internal standards. We analyzed Hg concentrations separately using a Direct Mercury Analyzer 80 (Milestone Inc.). For Hg, three samples had insufficient quantity for analysis.
We applied the following quality control (QC) measures: analysis of initial and continuous calibration verification, procedural blanks, and repeated analysis of 2% of samples. In addition, Seronorm-Blood L3 was analyzed daily as QC samples with one blinded sample at high and low levels run per batch. Recoveries of QC standards were between 90 and 110% with the exception of Al, Sn, and Sb where measured values were below the limit of detection (LOD) out of the acceptable range 80-120%. Intraday CVs were calculated using three concentrations of in-house QC pools (n = 7). Intraday CVs were < 5% for all analytes with the exception of Se and Sb, which were < 10%. Interday CVs for all elements were < 15% except for concentrations near the LOD. We measured metals concentrations as ng/g red blood cells. In this analysis, we include only metals that met the quality control criteria of percent detected ≥ 80% and intraclass correlation coefficients (ICCs) ≥ 0.70 among duplicates, with the exception of Cr and Cu which had ICCs = 0.40 and 0.64, respectively, but had detectable measures in over 80% of the samples.
Infant and child blood and DNAm analysis: Clinicians collected cord blood at delivery using a syringe and needle from the umbilical vein. Paper or electronic "flags" on participants' charts were used to prompt clinical staff to collect samples, and cord blood samples were collected from approximately 75% of participants who delivered at the study hospitals. We collected fasting blood samples from children at the mid-childhood visit using ethylenediaminetetraacetic acid (EDTA)-containing vacutainer tubes, and put samples on ice. To separate plasma, nucleated cells (including leukocytes and nucleated RBCs in cord blood and leukocytes in child blood), and RBCs, we centrifuged the tubes at 1,700×g for 10 min at 4 °C within 24 h of collection.
Measurement of DNAm has previously been described [32]. Briefly, we extracted genomic DNA from the nucleated cells with PureGene Kits (Fisher, Catalog Nos. A407-4, A416-4; Qiagen, Catalog Nos. 158908, 158912, 158924), and we stored sample aliquots at − 80 °C until analysis. Research staff performed bisulfite conversion using the Zymo DNA Methylation kit (Zymo Research, Irvine, CA). One μg DNA of each sample was randomized across plates and BeadChips to reduce batch effects. DNAm analysis was performed at Illumina, Inc.

Covariates
We collected maternal demographics (including education and household income), smoking status, and prepregnancy height and weight through self-administered questionnaires and interviews during pregnancy. We estimated maternal fish intake as servings per week for the first trimester or period closest to blood draw using a semi-quantitative food frequency questionnaire [77]. We used maternal self-reported height and weight to calculate pre-pregnancy body mass index (BMI; kg/m 2 ). We estimated gestational age from mothers' last menstrual period (LMP) at enrollment; we used gestational age determined by ultrasound if available and differed from LMP by > 10 days [76].

Data processing
DNAm data were preprocessed using the R package minfi [78]. We excluded duplicate samples, samples with low individual call rates (< 0.98), and samples that had a genotype or sex mismatch (N = 22 cord blood DNAm; Additional file 1: Figure S1). Non-CpG probes and probes with detection p values > 0.05 for > 1% of samples were dropped. In addition, we excluded probes located on sex the chromosomes, cross hybridizing probes [79], and probes associated with single-nucleotide polymorphisms (SNPs) that have a minor allele frequency ≥ 0.05 at the single base extension or within the target region. A total of 394,460 high-quality probes remained for analyses. We performed background correction and dye-bias equalization using the normal-exponential out-of-band method (noob) [80], and probe-type normalization using the beta-mixture quantile method (BMIQ) implemented through minfi. We estimated cell-type composition using the Houseman regression calibration method [81] implemented in the minfi package [78] using reference panels derived from cord blood nucleated cells for cord blood cell-type estimates [82] and adult leukocytes for midchildhood cell-type estimates [83].

Data analysis
Metal concentrations < the LOD were replaced with LOD/√2. The LODs and the number of samples < LOD for each metal were: As: 0. We presented descriptive statistics for participant characteristics and metal concentration using medians and interquartile ranges (IQRs) for continuous variables and frequencies and proportions for categorical variables. We assessed differences between mother-infant pairs with cord blood and mid-childhood DNAm data using the Mann-Whitney test for continuous variables and Chi-squared test for categorical variables. We tested associations between prenatal first-trimester metal concentrations and estimated cord blood cell-type proportions [B cells, CD4 + T cells, CD8 + T cells, granulocytes, monocytes, natural killer (NK) cells, and nucleated RBCs] with linear models adjusted for infant sex, race/ethnicity, gestational age, nulliparous, maternal age at enrollment, pre-pregnancy BMI, education (< college graduate or college graduate), household income (≤ $70,000 per year or > $70,000 per year), and maternal smoking (never, former, or smoking during pregnancy).
We conducted EWAS for DMPs (i.e., individual CpGs) using linear regression with empirical Bayes smoothing of standard errors with the robust estimator for prior variances implemented using the R package limma [84]. To better meet models assumptions, we performed analyses using the logit-transformation of Beta-values (i.e., M-value = ln[Beta-value / (1-Beta-value)]) [85]. We conducted EWAS using limma independently for each log 2 -transformed prenatal metal concentration (i.e., 12 EWAS of cord blood DNAm). Overall, we adjusted cord blood models for infant sex, race/ethnicity, gestational age, nulliparous, maternal age at enrollment, prepregnancy BMI, education, household income, maternal smoking, and estimated cell-type distribution (B cells, CD4 + T cells, CD8 + T cells, granulocytes, monocytes, NK cells, and nucleated RBCs). For each analysis, we generated Q-Q plots and calculated the genomic inflation factor (λ) to evaluate potential systematic biases. Within each metal EWAS, we used the Benjamini-Hochberg false discovery rate (FDR) method [86] to adjust for multiple comparisons using the p.adjust function in R. To address potential effect modification by infant sex, we performed limma analyses for infants overall and stratified by sex. For FDR-significant probes in sex-stratified analyses, we further evaluated effect modification using linear models including a metal × sex interaction term. We performed sensitivity analysis for associations with As and Hg including the covariate of maternal fish consumption during pregnancy. We report DMP effect sizes on the Beta-value scale for interpretability.
We analyzed DMRs using the comb-p method [87] implemented in the R package ENmix [88]. Comb-p identities DMRs using all EWAS p values and associated chromosomal coordinates by first adjusting for autocorrelation between nearby probes using the Stouffer-Liptak-Kechris (slk) correction (e.g., p values with adjacent low values will be pulled lower). An FDR correction is applied to slk-adjusted p values, regions are identified among probes meeting a specified FDR threshold, and a single slk-adjusted p value is calculated for each region. Correction for multiple comparisons is performed on regional p values using a Sidak correction based on the total number of possible regions. We performed comb-p using the arguments of a maximum distance to combine DMRs of 1,000 base pairs, the recommended bin size for autocorrelation of 310, and the FDR significance threshold of 0.001. Regions containing only one probe were excluded, and regions with Sidak p < 0.05 were considered significant. GO enrichment analysis of DMRs was performed using goregion implemented in the R package missMethy [89,90]. The goregion function uses genes annotated to DMRs to identify GO terms including an overrepresentation of differentially methylated genes while accounting for coverage of genes on the 450K microarray [91]. goregion was implemented independently for each metal with the input of cord blood DMRs (Sidak p < 0.05) and all CpGs included in analyses. REVIGO was used to aid in interpretation of relationships between identified GO terms [92].
To test the persistence of observed associations between prenatal metal exposure and cord blood DNAm, we performed limma analyses with mid-childhood DNAm. Models included the covariates of infant sex, race/ethnicity, child age at blood draw, nulliparous, maternal age at enrollment, pre-pregnancy BMI, education, household income, maternal smoking, and midchildhood estimated cell-type composition (B cells, CD4 + T cells, CD8 + T cells, granulocytes, monocytes, and NK cells). We performed sensitivity analyses restricting samples to children included in the cord blood DNAm analyses. We used a nominal p value < 0.05 along with consistency of direction of association to evaluate persistence of associations in mid-childhood.
To compare our findings of results of previous EWAS of second-trimester maternal RBC Hg and Pb in Project Viva [32,33], we used a look-up approach of CpGs identified as associated with Hg and Pb by Wu et al. and Cardenas et al., respectively. We used an unadjusted p value < 0.05 and direction of association to determine consistency of results between EWAS of first-and second-trimester metal exposures.
We performed all data analysis using R 4.0.3. [93]. AKB developed the data analysis plan, analyzed and visualized data, and drafted the manuscript; SLR-S was involved in developing study methodology and data management and reviewed and edited the manuscript; BAC was involved in developing study methodology and reviewed and edited the manuscript; AAB reviewed and edited the manuscript; ROW was involved in funding acquisition and reviewed and edited the manuscript; CA was involved in sample processing and reviewed and edited the manuscript; DRG was involved in developing study methodology and reviewed and edited the manuscript; EO was involved in developing study methodology and funding acquisition and reviewed and edited the manuscript; M-FH was involved in developing study methodology and reviewed and edited the manuscript; Andres Cardenas conceptualized the project, developed the data analysis plan, was involved in developing study methodology and funding acquisition, and reviewed and edited the manuscript. All authors read and approved the final manuscript.

Funding
This work was supported by the United States National Institutes of Health Grants R01ES031259, R01HD034568, and UH3OD023286.

Availability of data and materials
Datasets generated and analyzed during the current study are not publicly available because we did not obtain consent for such public release of epigenetic data from participants. However, raw data to generate figures and tables are available from the corresponding author with the appropriate permission from the Project Viva study team and investigators (project_viva@hphc.org) upon reasonable request and institutional review board approval. R code for all analyses is available at the study's GitHub repository (https:// github. com/ anneb ozack/ viva_ DNAm_ metals). Complete cord blood EWAS results for all metals are available at GitHub repository and Open Science Framework (OSF) site (https:// osf. io/ jf5yt/).