- Research
- Open access
- Published:
Exposure-associated DNA methylation among people exposed to multiple industrial pollutants
Clinical Epigenetics volume 16, Article number: 111 (2024)
Abstract
Background
Current research on the epigenetic repercussions of exposure to a combination of pollutants is limited. This study aims to discern DNA methylation probes associated with exposure to multiple pollutants, serving as early effect markers, and single-nucleotide polymorphisms (SNPs) as surrogate indicators for population susceptibility. The investigation involved the analysis of urine exposure biomarkers for 11 heavy metals (vanadium, arsenic, mercury, cadmium, chromium, nickel, lead, manganese, copper, strontium, thallium), polycyclic aromatic hydrocarbon (PAHs) (1-hydroxypyrene), genome-wide DNA methylation sequencing, and SNPs array on all study participants. The data were integrated with metabolomics information and analyzed both at a community level based on proximity to home addresses relative to the complex and at an individual level based on exposure biomarker concentrations.
Results
On a community level, 67 exposure-related CpG probes were identified, while 70 CpG probes were associated with urine arsenic concentration, 2 with mercury, and 46 with vanadium on an individual level. These probes were annotated to genes implicated in cancers and chronic kidney disease. Weighted quantile sum regression analysis revealed that vanadium, mercury, and 1-hydroxypyrene contributed the most to cg08238319 hypomethylation. cg08238319 is annotated to the aryl hydrocarbon receptor repressor (AHRR) gene, and AHRR hypomethylation was correlated with an elevated risk of lung cancer. AHRR was further linked to deregulations in phenylalanine metabolism, alanine, aspartate, and glutamate metabolism, along with heightened oxidative stress. Additionally, three SNPs (rs11085020, rs199442, and rs10947050) corresponding to exposure-related CpG probes exhibited significant interaction effects with multiple heavy metals and PAHs exposure, and have been implicated in cancer progression and respiratory diseases.
Conclusion
Our findings underscore the pivotal role of AHRR methylation in gene-environment interactions and highlight SNPs that could potentially serve as indicators of population susceptibility in regions exposed to multiple heavy metals and PAHs.
Introduction
It is estimated that 70–90% of the human disease burden could be attributed to environmental exposures [1]. Traditionally, studies in environmental health have focused on understanding the toxic effects and biological mechanisms related to exposure to single pollutants, such as heavy metals and polycyclic aromatic hydrocarbons (PAHs) [2, 3]. However, real-world scenarios involve humans being exposed to complex mixtures of pollutants simultaneously. Even when individual pollutants do not surpass toxicity or regulatory limits, their cumulative exposure might still lead to additive effects. Once these pollutants enter the body, they interact with different substrates, including genome, epigenome, and metabolome. These interactions are key in determining how an individual responds to pollutants and subsequent health effects. The emerging field of precision environmental health advocates for a comprehensive approach that considers multi-pollutant and integrates multi-omics analysis. This approach aims to achieve a thorough understanding of exposure effects and how they vary from person to person. By examining how multiple pollutants interact with our biological systems, we can gain a deeper insight into their collective impact on overall disease burden [4, 5]. This could be instrumental in identifying individuals who are more susceptible to health issues due to environmental exposures. Such knowledge is essential for creating and implementing evidence-based targeted prevention and intervention strategies.
Residents living near petrochemical complexes could be exposed to multiple industrial pollutants due to the consortium of high pollution facilities including coal-fired power plants and oil refineries. We have conducted a series of studies near the largest petrochemical complex in Taiwan and applied exposomics approach to identify exposure biomarkers, early health effect biomarkers, and metabolomic changes linking multiple-pollutant exposure with multiple adverse health outcomes, including cancer, chronic kidney disease (CKD), liver injuries, hyperlipidemia, and respiratory diseases [6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23]. The exposome is the sum of all the exposures that an individual has from birth to death and exposomics is the comprehensive evaluation of all exposures and their contribution to disease causation or progression. It is recommended in exposomics studies to employ omics methods to identify links between exposures and health outcomes, understand the mechanisms of disease development and progression, and potentially developing new biomarkers for exposure and early health effects [1, 24, 25]. We showed novel omics tools could help identify the complex relationship between well-characterized multiple exposures and health impacts in residents living near a petrochemical complex, but we still lack genomics information. Clarifying the relationship on a genetic and epigenetic level could provide insight on the affected molecular mechanisms and potential public health implications including individual susceptibility to environmental exposures.
Gene-environment interaction is the interplay between gene functions and environmental stress, which could influence phenotypes such as health outcomes. Epigenome has become a focus in gene-environment interactions due to the modifiable characteristics of epigenetic modulators. This suggests a potential role as a biomarker of previous exposures and early effects [4, 5]. DNA methylation is the most extensively studied epigenetic modulator in response to environmental stimuli. Alterations in DNA methylation involves the addition of a methyl group (-CH2) to the fifth carbon position of the cytosine base, a process facilitated by the DNA methyltransferases (DNMTs) enzymes [26]. This modification has the potential to influence gene expression and subsequent protein expression without altering the primary DNA sequence. It reflects the organism’s immediate adaptation to environmental exposures, possibly triggered by pollutants exposure stimulating the binding of transcription factors to CpG sites. This, in turn, affects DNMT access and therefore influences gene-specific DNA methylation [27].
Epigenetic studies have investigated heritable alterations in global and gene-specific DNA methylation following exposure to heavy metals such as arsenic (As), cadmium (Cd), nickel (Ni), lead (Pb), mercury (Hg), and chromium (Cr), as well as PAHs. These investigations have revealed associations between these alterations and adverse health effects, including oxidative stress, cardiovascular diseases, cancer, and respiratory diseases [28,29,30,31,32,33]. However, most of these studies focus on single pollutant exposure, leaving room for exploring exposure to multiple-pollutant mixtures [32]. The identification of gene-specific DNA methylation alterations induced by multiple-pollutant exposure could be used in finding surrogate biomarkers indicative of early health effects.
Individuals carrying distinct single-nucleotide polymorphisms (SNPs) may exhibit varying sensitivity to the toxicity of heavy metal and PAHs exposures [34,35,36]. SNPs refer to variations in DNA sequences where more than two types of nucleotides can exist at a specific position in DNA, differing among individuals. It is recognized as a useful and widely applicable biomarker to locate genetic distribution and predicting individual responses to specific disease or external stimuli [37,38,39,40]. Analyzing SNPs within populations residing in a highly polluted areas could unveil potential genetic markers that could serve as susceptibility biomarkers for risk assessment.
We systematically collected exposure, genetic, epigenetic, and metabolomic information on our study subjects, integrating and analyzing the data on both community and individual levels (Fig. 1). Our objective was to identify multiple-pollutant exposure-related DNA methylation probes, serving as possible markers for early effects, and SNPs that could act as potential surrogate marker for population susceptibility to the health impact of multiple exposures. We then integrated these findings with metabolomics data to strengthen the link between exposure and early health effects observed in residents. We successfully identified exposure-related CpG probes that annotate to genes associated with cancers and chronic kidney disease. Our findings include the aryl hydrocarbon receptor repressor (AHRR), which has been linked to an elevated risk of lung cancer. We further linked AHRR to deregulations in phenylalanine metabolism, alanine, aspartate, and glutamate metabolism, along with elevated oxidative stress. We also identified three SNPs that corresponded to exposure-related CpG probes and showed significant interaction with multiple heavy metals and PAHs exposure. These SNPs have been implicated in cancer progression and respiratory diseases.
Our study employed a precision environmental health approach in a region heavily influenced by a large petrochemical industry. Our SNPs finding could contribute to the identification of individuals at higher risk, and the epigenetic alterations we found may indicate early effects in residents living closer to the petrochemical complex and have been exposed to multiple industrial pollutants. Furthermore, our study enhances the understanding of the impact on critical biological mechanisms that could be precursors to chronic and acute diseases. This comprehensive information offers valuable insights for risk prediction and developing precision prevention and intervention strategies.
Results
Table 1 shows the basic characteristics, health data, and exposure levels for 159 study subjects. There was no significant difference between high and low exposure groups for age, sex, smoking, drinking, and betel nut chewing history, body mass index (BMI), systolic blood pressure (SBP), alanine transaminase (ALT), aspartate transaminase (AST), high-density lipoprotein cholesterol (HDL-C), and low-density lipoprotein cholesterol (LDL-C). All 12 exposure biomarkers were increased in high exposure group compared to low exposure group. In high exposure group, PAHs exposure biomarker urinary 1-hydroxypyrene (1-OHP) concentrations (0.23 ± 0.56 µmol/mol-creatinine) was significantly elevated compared to low exposure group (0.03 ± 0.01 µmol/mol-creatinine) (p < 0.0001). The same trend was found for urinary vanadium (V) (high exposure group: 1.62 ± 1.15 µg/g-creatinine; low exposure group: 0.24 ± 0.11 µg/g-creatinine; p < 0.0001). 1-OHP and V had the largest contrast between the two exposure groups with 7.67- and 6.75-fold change in urine concentrations, respectively. Hg had 3.02-fold change (p < 0.0001), Cr, manganese (Mn), and Ni with 2.3 (respectively, p = 0.011, < 0.0001, 0.024), strontium (Sr) 2.19 (p < 0.0001), As 1.90 (p < 0.0001), and thallium (Tl) 1.59 (p < 0.0001). Pb and copper (Cu) had 1.24- and 1.17-fold change with borderline statistical significance (p = 0.054 and 0.057, respectively). Cd was 1.39 times higher in high exposure group compared to low exposure group, but with no statistical significance (p = 0.646).
Despite the significant differences between high and low exposure groups, none of the exposure biomarkers we analyzed exceeded the acute toxicological thresholds. In the high exposure group, the pollutants’ concentration range was approximately equal to or lower than general occupational exposure levels. We had also excluded study subjects who reported to have worked at the petrochemical complex. Therefore, the exposure levels discussed in this study should represent local environmental exposure levels.
For community-level analysis, we examined the differences between high and low exposure groups for SNPs and DNA methylation. We did not find any significant association between SNPs and exposure status, indicating there are no distinctive genetic differences and no selection bias between our two exposure groups (Figure S5). We did identify 67 probes with DNA methylation levels significantly different between high and low exposure groups (p < 0.05 and ∆β >|0.1|), with 62 being hypomethylated and five hypermethylated, corresponding to 41 and 4 known human genes, respectively (Fig. 2; Table 2). These 45 genes were put through the Database for Annotation, Visualization, and Integrated Discovery (DAVID) platform’s functional annotation chart and 16 pathways were found (p < 0.05) (Table S2) [41]. These results indicated the multiple-pollutant exposure could impact the epigenomic layer and the probes we identified may potentially serve as surrogate early biomarkers for increased risk of adverse health effects.
The 67 CpG probes we identified correspond to 11 SNPs in our SNPs array. We used both quantitative and qualitative models to analyze whether these 11 SNPs are indeed associated with the methylation alteration of the corresponding probes. We then applied a linear regression model to investigate whether the interaction effect between exposure status and SNPs has significant influence on DNA methylation levels. We found in quantitative model all 11 SNPs were significantly associated with methylation alteration of corresponding CpG probes (p < 0.05). Linear regression analysis showed SNPs rs11085020 in Nuclear Factor I C (NFIC) and rs199442 in N-ethylmaleimide-sensitive factor (NSF) had significant interaction effects with exposure status that influenced DNA methylation level of corresponding CpG probe (p < 0.05) (Figures S6A and B). In qualitative model, all 11 SNPs were significantly associated with the methylation alterations of corresponding CpG probes (p < 0.05), and when the SNP variable was coded based on whether the minor allele was carried, rs10947050 in Ring Finger Protein 39 (RNF39) had significant interaction effects with exposure status that influenced DNA methylation level of corresponding CpG probe (p < 0.05) (Fig. S6C). However, when SNP variable was coded based on whether the major allele was carried, linear regression analysis did not identify any significant interaction effects between SNPs and exposure status. These results identified SNPs that could potentially serve as a surrogate marker for susceptibility.
We found that out of the 67 exposure-related CpG probes, 40 had DNA methylation levels significantly associated with at least one exposure-related metabolite we previously identified (p < 0.01) [11,12,13]. Methylation level of CpG probe cg01625212 was significantly associated with 9 exposure-related metabolites (7 urinary metabolite and 2 lipids), cg10632770 with 6 urinary metabolites, cg08238319 with 3 urinary and 2 serum metabolites (5 in total), cg21499289 with 5 urinary metabolites, cg00303108 with 4 metabolites (1 urinary, 1 lipid, and 2 serum metabolites), and cg02617100 with 4 urinary metabolites (Table 3). For cg01625212, 3 out of the 9 associated metabolites, palmitic acid, myristic acid, and stearic acid, are involved in fatty acid biosynthesis pathway. Cg21499289 (corresponding gene C9orf171) and cg02617100 (corresponding gene LINC00673) were also associated with both palmitic acid and stearic acid. Cg08238319 (corresponding gene AHRR) was associated with phenylalanine, an important exposure-related intermediate biomarker, as well as γ-Aminobutyric acid and oxoglutaric acid, both involved in alanine, aspartate, and glutamate metabolism, an exposure-related pathway we identified in our previous study. These results suggest a link from DNA methylation alterations caused by multiple exposure affected to biological mechanisms and early health effects we previously identified through metabolomics studies.
For individual-level analysis, Pearson’s correlation analysis identified 70 probes with DNA methylation levels associated with As urine concentration, 2 with Hg, and 46 with V (p < 1 × 10−5), corresponding to 62, 0, and 32 known human genes, respectively (Table 2). Interestingly, for As, the associated probes were mostly hypomethylated, while for V it was the opposite. This suggests different pollutants could have varying effects on DNA methylation. When we compare these results with community-level findings, only two genes significantly different between high and low exposure groups were also associated with urinary As concentrations (DFNA5、TSPY4). Gene-Set Enrichment Analysis (GSEA) pathway analysis results showed four pathways related to urinary As and Hg exposure levels each (p < 0.05), while no pathways were identified to be related to V exposure levels (Table S3) [42].
We employed weighted-quantile sum (WQS) regression for multi-pollutant analysis at an individual level. Figure 3 shows the association between the mixture of eight exposure biomarkers with the highest fold-change between high and low exposure groups (V, 1-OHP, Hg, Cr, Mn, Ni, Sr, and As) and the six exposure-related CpG probes that were linked to exposure-related metabolites (cg01625212, cg10632770, cg08238319, cg21499289, cg00303108, and cg02617100), respectively. For all six CpG probes, the association was statistically significant (p < 0.05), and V was the main contributor except for cg08238319. In Fig. 3A, V predominated the mixture index for cg01625212 as the largest (weight = 0.65) contributor to the mixture effect (cutoff weight defined as 0.125, the inverse of the number of variables included in the mixture) (p = 0.006). For cg10632770, V (weight = 0.45), Mn (weight = 0.22), and As (weight = 0.16) were the major contributors to the index (p = 0.011) (Fig. 3B). Hg was the main contributor for cg08238319 (weight = 0.39) with 1-OHP (weight = 0.27) (p < 0.001) (Fig. 3C). In cg21499289, V (weight = 0.38), As (weight = 0.15), 1-OHP (weight = 0.14), Sr (weight = 0.14) reached the cutoff weight for significant contribution (p = 0.010). V (weight = 0.35), 1-OHP (weight = 0.24), and Mn (weight = 0.18) contributions were most significant in the mixture effect for cg00303108 (p = 0.009) (Fig. 3E). Figure 3F shows that for cg02617100, V contributed to over half of the mixture index (weight = 0.59) followed by Hg (weight = 0.17) (p < 0.001).
Our results identified three SNPs that could potentially act as surrogate markers for susceptibility to multiple-pollutant exposure. We also found DNA methylation probes that could be affected by multiple-pollutant exposure, suggesting they could serve as possible surrogate biomarkers for early health effects. Additionally, we discovered DNA methylation alterations that could link multiple-pollutant exposure to metabolic changes and early health effects, including oxidative stress. However, due to the limited sample size, further studies are required to validate these findings.
Discussion
This is the first study to apply a multi-omics approach to investigate DNA methylation and SNPs profiles in industrial area residents chronically exposed to multiple pollutants. Previous studies mainly focused on single toxic exposure or specific SNPs and CpG sites. Through a comprehensive analysis of various omics layers, our study epitomizes the principles of precision environmental health [4]. We have identified and linked potential underlying individual risk factors (SNPs) and exposures (multiple pollutants) that contribute to early health effects with specific molecular endotypes (DNA methylation alterations and exposure-related metabolites). Our findings contribute to the understanding of the health impacts on the local community and population and could guide future prevention and intervention strategies.
Of the 45 differentially methylated genes we identified in community-level analysis, only aryl hydrocarbon receptor repressor (AHRR) had been reported in epigenetic studies for toxic exposure. Smoking-related Cd exposure and occupational PAHs exposure have been significantly associated with differentially methylated CpGs that annotate to AHRR gene [43,44,45]. In Taiwan, Tantoh et al. reported non-smoking adults living in areas with higher PM2.5 level had lower AHRR methylation [46].
Previous studies reported correlation between AHRR methylation and lung cancer risk, and Jacobsen et al. suggested adding AHRR (cg05575921) methylation as an eligibility criterion could enhance the specificity of low-dose computer tomography (LDCT) lung cancer screening [47]. We did not identify the same CpG probe cg05575921 in our study; however, we did find cg08238319 which also annotated to AHRR and was significantly hypomethylated in high exposure group compared to low exposure group (Table 2). There was no significant difference in smoking history between our high and low exposure groups (Table 1). However, due to the small sample size of our study subjects, we could not rule out the possible effects of smoking on AHRR methylation in our study subjects. We did establish connection between combined exposure biomarkers and cg08238319 DNA methylation in our study subjects, with Hg contributing to almost half the mixture index, followed by 1-OHP and V (Fig. 3C). Previous studies had not reported Hg and V exposure to be associated with AHRR methylation. Interestingly, we also did not identify association between Hg and AHRR or V and AHRR through individual-level analysis, which further suggest that it was the combined effect of multi-pollutant exposure that influenced AHRR methylation (Table 2).
We further linked AHRR methylation to five previously identified exposure-related metabolite features: 2-ethylhydracrylic acid, phenylalanine, γ-Aminobutyric acid, carnitine, and oxoglutaric acid (Table 3). We had reported in the same study area that As, Cd, and Ni exposure could affect phenylalanine metabolism pathway, and phenylalanine could be linked to oxidative stress biomarkers 8-OHdG, HNE-MA, 8-isoPGF2α, and 8-NO2Gua [11]. In the same study, we showed PAHs, As, Cu, Cd, Ni, and Hg exposure was associated with γ-Aminobutyric acid and oxoglutaric acid from alanine, aspartate, and glutamate metabolism and could also be linked to increased oxidative stress. Our findings suggest a potential gene-environment interaction from multi-pollutant exposure to AHRR DNA hypomethylation that could be further linked to deregulations in phenylalanine metabolism, alanine, aspartate, and glutamate metabolism, and elevated oxidative stress.
For the other differentially methylated genes we identified in community-level analysis, several have been reported in cancer studies including DFNA5, KIAA1199, and LINC00673 which were all hypomethylated in our high exposure group compared to low exposure group. DFNA5 methylation has been suggested as a biomarker for breast cancer, colorectal cancer, and gastric cancer [48,49,50]. Kuscu et al. reported KIAA1199 expression is upregulated in breast cancer through DNA methylation regulatory mechanisms [51]. LINC00673 is highly expressed in prostate cancer tissues [52]. MAD1L1 was hypermethylated in our study, and in previous studies methylation of MAD1L1 was negatively associated with cancer incidence [53]. For other diseases, DNA methylation differences for PTPRN2 was found in CKD patients [54]. We had previously reported elder and female residents living in the high exposure area had increased carcinogenic exposure and elevated risk of all cancers after the complex had been operating for 10 years [15, 16]. We also found increased risk of CKD associated with proximity to the complex and urine As and 1-OHP levels [18, 19]. These findings suggest the DNA methylation probes we identified could be considered as potential biomarkers for cancer and CKD after multiple-pollutant exposure.
For individual-level analysis, we found DNA methylation of 62 known human genes, including FOXM1, HDAC2, and SATB2, were associated with As urinary concentrations (Table 2). This supports previous studies that showed As exposure led to upregulation of gene transcription for proto-oncogene FOXM1, deregulation of HDAC2 protein levels, and overexpression of SATB2 in cell models [55,56,57]. Hg urine concentrations did not correspond to any known human genes, and although urinary levels of V were associated with DNA methylation of 32 genes, we did not find any previous studies reporting similar findings. This suggests that the DNA methylation probes we found could provide insight in understanding the epigenetic changes induced by heavy metals exposure.
There was little overlap between the exposure-related CpG probes identified through community-level analysis and individual-level analysis (Table 2). However, the CpG probes we identified through community-level analysis were significantly associated with the mixture of exposure biomarkers with major contributions from V, 1-OHP, Hg, and As (Fig. 3). This corresponded with our previous study where Hg and V were most prominent in the mixture effect associated with oxidative stress biomarkers [12, 13]. Our findings showed the difference between using multi-pollutant model and single pollutant models to identify epigenetic changes, reaffirming the importance of acknowledging and considering the combined effect of simultaneous exposure to multiple pollutants in real-world settings.
The three SNPs we identified that corresponded to exposure-related CpG sites with significant interaction effects with exposure status: rs11085020 in NFIC, rs199442 in NSF, and rs10947050 in RNF39 have not been reported in previous studies. However, NFIC is a transcription factor that plays a role in cell proliferation, differentiation, and migration during organ development and has been reported as a tumor suppressor gene in breast carcinomas, osteosarcoma, and T-cell lymphomas [58]. Lee et al. suggest through animal model results that NSF mediates inflammation responses in respiratory disease [59]. These SNPs we found could potentially serve as genetic markers for health-related risk evaluation in populations living in this area, especially based on our previous reports all cancers in elder female residents and respiratory diseases in children and adolescents [15, 22].
Although the exposure levels detected in this study does not exceed acute toxicological thresholds, heavy metals could still accumulate and/or have synergistic effects in the body after exposure. Residents living near petrochemical industrial areas are likely to experience long-term and stable low-dose exposure to heavy metals due to the continuous operation of these complexes. Previous studies we conducted in this area have shown that even at such low exposure levels, significant correlation between the level of exposure and health effects can still be observed, including early health effects such as increased oxidative stress and alterations in metabolite profiles in high exposure group compared to low exposure group [11,12,13].
There are limitations to this study. Firstly, we applied genome-wide DNA methylation sequencing analysis which still has the possibility of inaccurate identification and we could not provide exact quantification of DNA methylation levels. Secondly, to maximize the possibility to identify potential loci, we used a loose p value threshold. However, it allowed us to extend the coverage of more loci. Lastly, our study subjects were selected from a prospective cohort of 3230 participants. Due to sample availability and required sample quality for multi-omics analyses, our sample size was limited. However, our selection criteria minimized bias and ensured representation of the cohort. Additionally, it is a cross-sectional study and therefore we were unable to verify the stability of the quantitative changes we identified and thus large samples size is required to further validate the findings before its applications. It is also possible that other confounding factors such as preexisting physical conditions, dietary habits, and occupational exposures could have influenced exposure levels, DNA methylation, and metabolomics results. We minimized these biases by selecting participants with no prior chronic diseases and no prior work experience at the pollutants’ main emission source, No. 6 Naphtha Cracking Complex, according to their interview-administered questionnaire surveys.
Conclusion
In this study, we applied a precision environmental health approach in a highly polluted industrial community and identified DNA methylation probes that could serve as surrogate markers for early effects of multiple-industrial-pollutant exposure, three SNPs that could potentially be used to identify vulnerable populations more susceptible to multiple-industrial-pollutant exposure, and gene-environment interactions that linked multiple-pollutant exposure with epigenetic changes and biological pathways related to adverse health outcomes such as increased oxidative stress and cancer. Our findings characterized the complexity of exposure and health impacts and can provide information for risk prediction models and the development of precision prevention and intervention strategies in this area.
Materials and methods
Study area and subjects
Our study comprised 159 subjects selected from a prospective cohort of 3,230 participants who had resided in communities surrounding No. 6 Naphtha Cracking Complex for at least five years. All subjects completed interview-administered questionnaire surveys to gather information on age, gender, smoking, alcohol consumption, and betelnut chewing habits. Additionally, they underwent a health examination, including measurements of height, weight, and blood pressure.
Each participant provided a morning spot urine sample for the analysis of exposure biomarkers and a fasting blood sample to measure ALT, AST, HDL-C, and LDL-C.
We categorized the study area based on the proximity of home addresses to the complex, resulting in a high exposure community (three townships closest to the complex) and a low exposure community (seven townships further away) (see Fig. S1).
We used urine exposure biomarkers, specifically V levels and PAHs exposure biomarker 1-OHP, along with home addresses to define the high and low exposure groups. Out of our 159 study subjects, 78 residing in the high exposure community, with urinary 1-OHP and V levels in the top 60% of the cohort, were identified as the high exposure group. The remaining 81 subjects, living in the low exposure community with urine concentrations of 1-OHP and V in the bottom 40% of the cohort, were identified as the low exposure group.
The No. 6 Naphtha Cracking Complex commenced operations in 1999 and is situated in Yunlin County on the west coast of central Taiwan, covering a total area of 2,603 hectares. The complex comprises 53 plants, including a coal-fired power plant with a total capacity of 1.8 million kW, oil refinery plants processing 530,000 barrels of crude oil per day, three naphtha cracking plants producing 2.935 million tons of ethylene annually, and co-generation plants with a capacity of 2.75 million kW [60].
The prospective cohort used in this study was recruited from 2009 to 2011. Approval for this study was granted by the Research Ethics Committee of the National Health Research Institutes (accession number: 201704053RIND), and informed consent was obtained from each participant.
Internal exposure
Urinary levels of V, As, Hg, Cd, Cr, Ni, Pb, Mn, Cu, Sr, and Tl were analyzed using inductively coupled plasma mass spectrometry (ICP-MS) and 1-OHP using high-performance liquid chromatography (HPLC) following previously reported methods [9,10,11]. Standard reference materials were used to confirm accuracy (SERO, Billingstad, Norway). In each experiment batch, we ensured the relative error of ten spiked samples was below 10% for measurement stability. Batches with a recovery rate lower than 85% were reanalyzed.
The method detection limit (MDL) for each exposure biomarker was 0.016 (V), 3.325 (As), 0.440 (Hg), 0.129 (Cd), 0.155 (Cr), 1.204 (Ni), 0.300 (Pb), 0.060 (Mn), 1.444 (Cu), 3.920 (Sr), and 0.041 (Tl) μg/L. 1-OHP analysis had an MDL of 0.01 ng/mL with an 89.6% recovery rate, and the coefficient of variation was 4.0% for repeated measurements. Urine concentration of exposure biomarkers below the MDL was replaced by half of the MDL for data analysis.
To minimize batch variations, we included the analysis of standard tune solution in each batch before sample detection to ensure instrument stability and adjust signal quantification. National Taiwan University Hospital medical diagnosis laboratory analyzed urinary creatinine by enzyme-linked immunosorbent assay, and we used the creatinine concentrations to adjust urinary exposure biomarker levels. All urine samples underwent creatinine analysis, and samples with urinary creatinine concentrations below 30 or above 300 mg/dL were excluded from further data analysis due to potential abnormalities of unknown reasons.
DNA extraction
Two hundred microliters of whole blood was utilized for DNA extraction in each study subject, employing the QIAamp Blood Mini Kit (QIAGEN). The extracted DNA concentration must exceed 100 ng/μL, with an A260/A280 ratio between 1.6 and 2.0, A230/A260 greater than 1.6, A320 near 0, and levels of fragmentation checked via electrophoresis. These criteria ensure compliance with the quality standards required for DNA methylation and SNP analysis.
SNPs analysis
The DNA from each study subject was individually adjusted to a concentration of 15 ng/μL, and 50 μL of each individual's sample was then placed into ABgene 96-well plates. These prepared samples were sent to the National Center for Genome Medicine for analysis using the Affymetrix Axiom genome-wide array TWB 2.0 (Thermo Fisher Scientific), which contains 752,921 probes.
DNA methylation analysis
Five hundred nanograms of DNA samples from each study subject was used for genome-wide DNA methylation sequencing analysis. Samples were prepared using the Illumina Infinium Human MethylationEPIC BeadChip platform following the manufacturer's standard protocol. Subsequently, the prepared samples were scanned with Illumina HiScan (Illumina, Inc.), and DNA methylation levels were analyzed using GenomeStudio software v2011.1. The analysis covered more than 850,000 CpG sites.
Exposure-related metabolite features
Exposure-related urinary metabolite features, serum metabolite features, and serum lipid features were identified in previous studies [11,12,13]. A total of 103 metabolite features were included for data integration and analysis, comprising 76 urinary metabolite features, 9 serum metabolite features, and 18 serum lipid features (Table S1). Of the urinary metabolites, data were available for 49 study subjects, including 21 from the high exposure group and 28 from the low exposure group. Serum metabolite data were available for 43 study subjects (high exposure group N = 23, low exposure group N = 20), and lipid information was available for 44 study subjects, with 20 in the high exposure group and 24 in the low exposure group.
Community-level analysis
We first examined the differences between high and low exposure groups across various parameters, including basic characteristics, urinary exposure biomarkers, SNPs, and DNA methylation. To compare basic characteristics, we used Student’s t-test for continuous variables and Chi-squared test or Fisher’s exact test for discrete variables. For exposure biomarkers, urinary concentrations underwent log transformation and were subsequently compared using ANCOVA, adjusting for age, gender, smoking, alcohol consumption, betel nut chewing, and fish consumption. Post-comparisons were made using Scheffe test. For SNPs array data, a Pearson correlation test was applied to clarify the association between SNPs and exposure. For DNA methylation, all β-values from the microarrays were first transformed into M-values for improved consistency and robustness. Wilcoxon’s rank sum test was performed to identify exposure-related probes, considering the probes with p < 0.05 and ∆β >|0.1| as significant (Figure S2). To further explore the biological relevance, gene annotation was performed to identify known human genes corresponding to the exposure-related probes. The identified genes were then put through pathway analysis using the Database for Annotation, Visualization, and Integrated Discovery (DAVID) platform [41].
Association between SNPs and DNA methylation level of corresponding CpG probes was examined using Fisher’s exact test and Wilcoxon’s rank sum test. Additionally, a linear regression model was applied to investigate whether the interaction effect between SNPs and exposure status has significant influence on DNA methylation levels (Fig. S3). These analyses were conducted under both quantitative and qualitative models. In quantitative model, an SNP variable would be coded as “0”, “1”, or “2” based on the allele pair, while in the qualitative model, the SNP variable were encoded as “0” or “1” depending on whether the corresponding allele was carried (Figure S4). For Fisher’s exact test, study subjects were categorized into two DNA methylation groups using two distinct approaches: hypermethylated (β < 0.3)/unchanged (β ≥ 0.3) or hypomethylated (β > 0.7)/unchanged (β ≤ 0.7) (Figure S4).
To ascertain the association between exposure-related CpG probes and exposure-related metabolites, we conducted Pearson’s correlation analysis to examine the significance of difference between high and low exposure groups (p < 0.01).
Individual-level analysis
Pearson’s correlation analysis was applied to identify DNA methylation probes exhibiting a significant association to urinary As, Hg, and V concentrations, respectively (p < 1 × 10−5). All 865,918 CpG probes from the DNA methylation microarray along with individual urine As, Hg, and V concentrations data were put through Gene-Set Enrichment Analysis (GSEA) for pathway analysis by random walk approach [42].
For multi-pollutant analysis at an individual level, WQS regression analysis was applied to analyze the association between combined exposure biomarkers with CpG probes, using 100 bootstrap samples (p < 0.05).
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Rappaport SM, Smith MT. Epidemiology. Environment and disease risks. Science. 2010;330(6003):460–1.
Tchounwou PB, Yedjou CG, Patlolla AK, Sutton DJ. Heavy Metal Toxicity and the Environment. In: Luch A, editor. Molecular, Clinical and Environmental Toxicology: Volume 3: Environmental Toxicology. Basel: Springer Basel; 2012. p. 133–64.
Henkler F, Stolpmann K, Luch A. Exposure to Polycyclic Aromatic Hydrocarbons: Bulky DNA Adducts and Cellular Responses. In: Luch A, editor. Molecular, Clinical and Environmental Toxicology: Volume 3: Environmental Toxicology. Basel: Springer Basel; 2012. p. 107–31.
Baccarelli A, Dolinoy DC, Walker CL. A precision environmental health approach to prevention of human disease. Nat Commun. 2023;14(1):2449.
Wu H, Eckhardt CM, Baccarelli AA. Molecular mechanisms of environmental exposures and human disease. Nat Rev Genet. 2023;24(5):332–44.
Shie RH, Chan CC. Tracking hazardous air pollutants from a refinery fire by applying on-line and off-line air monitoring and back trajectory modeling. J Hazard Mater. 2013;261:72–82.
Shie RH, Yuan TH, Chan CC. Using pollution roses to assess sulfur dioxide impacts in a township downwind of a petrochemical complex. J Air Waste Manag Assoc. 2013;63(6):702–11.
Chio CP, Yuan TH, Shie RH, Chan CC. Assessing vanadium and arsenic exposure of people living near a petrochemical complex with two-stage dispersion models. J Hazard Mater. 2014;271:98–107.
Yuan TH, Shie RH, Chin YY, Chan CC. Assessment of the levels of urinary 1-hydroxypyrene and air polycyclic aromatic hydrocarbon in PM2.5 for adult exposure to the petrochemical complex emissions. Environ Res. 2015;136:219–26.
Yuan TH, Chio CP, Shie RH, Pien WH, Chan CC. The distance-to-source trend in vanadium and arsenic exposures for residents living near a petrochemical complex. J Expo Sci Environ Epidemiol. 2016;26(3):270–6.
Chen CS, Yuan TH, Shie RH, Wu KY, Chan CC. Linking sources to early effects by profiling urine metabolome of residents living near oil refineries and coal-fired power plants. Environ Int. 2017;102:87–96.
Chen CS, Kuo TC, Kuo HC, Tseng YJ, Kuo CH, Yuan TH, et al. Metabolomics of children and adolescents exposed to industrial carcinogenic pollutants. Environ Sci Technol. 2019;53(9):5454–65.
Chen CS, Kuo TC, Kuo HC, Tseng YJ, Kuo CH, Yuan TH, et al. Lipidomics of children and adolescents exposed to multiple industrial pollutants. Environ Res. 2021;201:111448.
Yuan TH, Chung MK, Lin CY, Chen ST, Wu KY, Chan CC. Metabolites of resident exposed to vanadium and PAHs in the vicinity of a petrochemical complex. Sci Total Environ. 2016;548–549:260–9.
Yuan TH, Shen YC, Shie RH, Hung SH, Chen CF, Chan CC. Increased cancers among residents living in the neighborhood of a petrochemical complex: a 12-year retrospective cohort study. Int J Hyg Environ Health. 2018;221(2):308–14.
Chen CF, Chio CP, Yuan TH, Yeh YP, Chan CC. Increased cancer incidence of Changhua residents living in Taisi Village north to the No. 6 Naphtha cracking complex. J Formos Med Assoc. 2018;117(12):1101–7.
Yuan TH, Chen JL, Shie RH, Yeh YP, Chen YH, Chan CC. Liver fibrosis associated with potential vinyl chloride and ethylene dichloride exposure from the petrochemical industry. Sci Total Environ. 2020;739:139920.
Yuan TH, Jhuang MJ, Yeh YP, Chen YH, Lu S, Chan CC. Relationship between renal function and metal exposure of residents living near the No. 6 Naphtha Cracking Complex: a cross-sectional study. J Formos Med Assoc. 2021;120(10):1845–54.
Yuan TH, Ke DY, Wang JE, Chan CC. Associations between renal functions and exposure of arsenic and polycyclic aromatic hydrocarbon in adults living near a petrochemical complex. Environ Pollut. 2020;256:113457.
Wang CW, Liao KW, Chan CC, Yu ML, Chuang HY, Chiang HC, et al. Association between urinary thiodiglycolic acid level and hepatic function or fibrosis index in school-aged children living near a petrochemical complex. Environ Pollut. 2019;244:648–56.
Shun CH, Yuan TH, Hung SH, Yeh YP, Chen YH, Chan CC. Assessment of the hyperlipidemia risk for residents exposed to potential emitted metals in the vicinity of a petrochemical complex. Environ Sci Pollut Res Int. 2021;28(22):27966–75.
Chiang TY, Yuan TH, Shie RH, Chen CF, Chan CC. Increased incidence of allergic rhinitis, bronchitis and asthma, in children living near a petrochemical complex with SO2 pollution. Environ Int. 2016;96:1–7.
Chuang HC, Shie RH, Chio CP, Yuan TH, Lee JH, Chan CC. Cluster analysis of fine particulate matter (PM2.5) emissions and its bioreactivity in the vicinity of a petrochemical complex. Environ Pollut. 2018;236:591–7.
Wild CP. Environmental exposure measurement in cancer epidemiology. Mutagenesis. 2009;24(2):117–25.
Vineis P, van Veldhoven K, Chadeau-Hyam M, Athersuch TJ. Advancing the application of omics-based biomarkers in environmental epidemiology. Environ Mol Mutagen. 2013;54(7):461–7.
Miller OJ, Schnedl W, Allen J, Erlanger BF. 5-Methylcytosine localised in mammalian constitutive heterochromatin. Nature. 1974;251(5476):636–7.
Sharavanan VJ, Sivaramakrishnan M, Sivarajasekar N, Senthilrani N, Kothandan R, Dhakal N, et al. Pollutants inducing epigenetic changes and diseases. Environ Chem Lett. 2020;18(2):325–43.
Brocato J, Costa M. Basic mechanics of DNA methylation and the unique landscape of the DNA methylome in metal-induced carcinogenesis. Crit Rev Toxicol. 2013;43(6):493–514.
Clancy HA, Sun H, Passantino L, Kluz T, Muñoz A, Zavadil J, et al. Gene expression changes in human lung cells exposed to arsenic, chromium, nickel or vanadium indicate the first steps in cancer. Metallomics. 2012;4(8):784–93.
Das DN, Ravi N. Influences of polycyclic aromatic hydrocarbon on the epigenome toxicity and its applicability in human health risk assessment. Environ Res. 2022;213:113677.
Lin CY, Lee HL, Hwang YT, Huang PC, Wang C, Sung FC, et al. Urinary heavy metals, DNA methylation, and subclinical atherosclerosis. Ecotoxicol Environ Saf. 2020;204:111039.
Martin EM, Fry RC. Environmental Influences on the epigenome: exposure- associated DNA methylation in human populations. Annu Rev Public Health. 2018;39:309–33.
Shiek SS, Mani MS, Kabekkodu SP, Dsouza HS. Health repercussions of environmental exposure to lead: methylation perspective. Toxicology. 2021;461:152927.
Eom SY, Lim JA, Kim YD, Choi BS, Hwang MS, Park JD, et al. Allele frequencies of the single nucleotide polymorphisms related to the body burden of heavy metals in the korean population and their ethnic differences. Toxicol Res. 2016;32(3):195–205.
Kayaaltı Z, Aliyev V, Söylemezoğlu T. The potential effect of metallothionein 2A–5 A/G single nucleotide polymorphism on blood cadmium, lead, zinc and copper levels. Toxicol Appl Pharmacol. 2011;256(1):1–7.
Lee DG, Schuetz JM, Lai AS, Burstyn I, Brooks-Wilson A, Aronson KJ, et al. Interactions between exposure to polycyclic aromatic hydrocarbons and xenobiotic metabolism genes, and risk of breast cancer. Breast Cancer. 2022;29(1):38–49.
Do CB, Hinds DA, Francke U, Eriksson N. Comparison of family history and SNPs for predicting risk of complex disease. Plos Genet. 2012;8(10):e1002973.
Johnson AD. Single-nucleotide polymorphism bioinformatics a comprehensive review of resources. Circ Cardiovascul Genet. 2009;2(5):530-U289.
LaFramboise T. Single nucleotide polymorphism arrays: a decade of biological, computational and technological advances. Nucleic Acids Res. 2009;37(13):4181–93.
Landegren U, Nilsson M, Kwok PY. Reading bits of genetic information: methods for single-nucleotide polymorphism analysis. Genome Res. 1998;8(8):769–76.
da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. 2005;102(43):15545–50.
Alhamdow A, Lindh C, Hagberg J, Graff P, Westberg H, Krais AM, et al. DNA methylation of the cancer-related genes F2RL3 and AHRR is associated with occupational exposure to polycyclic aromatic hydrocarbons. Carcinogenesis. 2018;39(7):869–78.
Domingo-Relloso A, Riffo-Campos AL, Haack K, Rentero-Garrido P, Ladd-Acosta C, Fallin DM, et al. Cadmium, smoking, and human blood DNA methylation profiles in adults from the strong heart study. Environ Health Perspect. 2020;128(6):67005.
Lee JE, Kim HR, Lee MH, Kim NH, Wang KM, Lee SH, et al. Smoking-related DNA methylation is differentially associated with cadmium concentration in blood. Biochem Genet. 2020;58(4):617–30.
Tantoh DM, Lee K-J, Nfor ON, Liaw Y-C, Lin C, Chu H-W, et al. Methylation at cg05575921 of a smoking-related gene (AHRR) in non-smoking Taiwanese adults residing in areas with different PM2.5 concentrations. Clin Epigenet. 2019;11(1):69.
Jacobsen KK, Schnohr P, Jensen GB, Bojesen SE. AHRR (cg05575921) methylation safely improves specificity of lung cancer screening eligibility criteria: a cohort study. Cancer Epidemiol Biomarkers Prev. 2022;31(4):758–65.
Akino K, Toyota M, Suzuki H, Imai T, Maruyama R, Kusano M, et al. Identification of DFNA5 as a target of epigenetic inactivation in gastric cancer. Cancer Sci. 2007;98(1):88–95.
Croes L, Beyens M, Fransen E, Ibrahim J, Vanden Berghe W, Suls A, et al. Large-scale analysis of DFNA5 methylation reveals its potential as biomarker for breast cancer. Clin Epigenet. 2018;10:51.
Yokomizo K, Harada Y, Kijima K, Shinmura K, Sakata M, Sakuraba K, et al. Methylation of the DFNA5 gene is frequently detected in colorectal cancer. Anticancer Res. 2012;32(4):1319–22.
Kuscu C, Evensen N, Kim D, Hu YJ, Zucker S, Cao J. Transcriptional and epigenetic regulation of KIAA1199 gene expression in human breast cancer. PLoS ONE. 2012;7(9):e44661.
Jiang Z, Zhang Y, Chen X, Wu P, Chen D. Long non-coding RNA LINC00673 silencing inhibits proliferation and drug resistance of prostate cancer cells via decreasing KLF4 promoter methylation. J Cell Mol Med. 2020;24(2):1878–92.
Joyce BT, Zheng Y, Nannini D, Zhang Z, Liu L, Gao T, et al. DNA methylation of telomere-related genes and cancer risk. Cancer Prev Res. 2018;11(8):511–22.
Smyth LJ, McKay GJ, Maxwell AP, McKnight AJ. DNA hypermethylation and DNA hypomethylation is present at different loci in chronic kidney disease. Epigenetics. 2014;9(3):366–76.
Liu Y, Hock JM, Van Beneden RJ, Li X. Aberrant overexpression of FOXM1 transcription factor plays a critical role in lung carcinogenesis induced by low doses of arsenic. Mol Carcinog. 2014;53(5):380–91.
Pournara A, Kippler M, Holmlund T, Ceder R, Grafström R, Vahter M, et al. Arsenic alters global histone modifications in lymphocytes in vitro and in vivo. Cell Biol Toxicol. 2016;32(4):275–84.
Chen QY, Li J, Sun H, Wu F, Zhu Y, Kluz T, et al. Role of miR-31 and SATB2 in arsenic-induced malignant BEAS-2B cell transformation. Mol Carcinog. 2018;57(8):968–77.
Chen KS, Lim JWC, Richards LJ, Bunt J. The convergent roles of the nuclear factor I transcription factors in development and cancer. Cancer Lett. 2017;410:124–38.
Lee JY, Linge HM, Ochani K, Lin K, Miller EJ. N-Ethylmaleimide Sensitive Factor (NSF) Inhibition Prevents Vascular Instability following Gram-Positive Pulmonary Challenge. PLoS One. 2016;11(6):e0157837.
FPCC. About No.6 Naphtha Cracker Complex—Magnitude & Facilities 2023 [Available from: http://www.fpcc.com.tw/en/about/no6.]
Acknowledgements
Not applicable.
Funding
This study was funded by Taiwan’s National Health Research Institutes (Grant Number: NHRI-EX110-10832PI), National Science and Technology Council (Grant Numbers: MOST 104-2314-B-002-170, MOST 105-2314-B-002-050-MY3, 113-2621-M-002-016-), and Ministry of Education (Grant Number: 109L9003).
Author information
Authors and Affiliations
Contributions
CSC acquired and analyzed metabolomics data, interpreted the overall results, and wrote the manuscript. TY made substantial contribution to the conception and design of the work, sample acquisition, and acquired exposure data. TL oversaw analyzing DNA methylation and SNPs data, overall statistical analysis, and revision of manuscript. HL analyzed DNA methylation and SNPs data. YC drafted the work. LL contributed to the conception and design of the work. MT contributed to the conception and design of the work. EYC made substantial contributions to the conception and design of the work. CC made substantial contributions to the conception and design of the work and substantively revised the manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
This study was approved by the Research Ethics Committee of National Health Research Institutes (accession number: 201704053RIND), with informed consent obtained from each participant.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests. The authors declare they have no actual or potential competing financial interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Chen, CH.S., Yuan, TH., Lu, TP. et al. Exposure-associated DNA methylation among people exposed to multiple industrial pollutants. Clin Epigenet 16, 111 (2024). https://doi.org/10.1186/s13148-024-01705-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13148-024-01705-y