Differential methylation in EGLN1 associates with blood oxygen saturation and plasma protein levels in high-altitude pulmonary edema
Clinical Epigenetics volume 14, Article number: 123 (2022)
High-altitude (HA, 2500 m) hypoxic exposure evokes a multitude of physiological processes. The hypoxia-sensing genes though influence transcriptional output in disease susceptibility; the exact regulatory mechanisms remain undetermined in high-altitude pulmonary edema (HAPE). Here, we investigated the differential DNA methylation distribution in the two genes encoding the oxygen-sensing HIF-prolyl hydroxylases, prolyl hydroxylase domain protein 2 (PHD2) and factor inhibiting HIF-1α and the consequent contributions to the HAPE pathophysiology.
Deep sequencing of the sodium bisulfite converted DNA segments of the two genes, Egl nine homolog 1 (EGLN1) and Hypoxia Inducible Factor 1 Subunit Alpha Inhibitor (HIF1AN), was conducted to analyze the differential methylation distribution in three study groups, namely HAPE-patients (HAPE-p), HAPE-free sojourners (HAPE-f) and healthy HA natives (HLs). HAPE-p and HAPE-f were permanent residents of low altitude (< 200 m) of North India who traveled to Leh (3500 m), India, and were recruited through Sonam Norboo Memorial (SNM) hospital, Leh. HLs were permanent residents of altitudes at and above 3500 m. In addition to the high resolution, bisulfite converted DNA sequencing, gene expression of EGLN1 and HIF1AN and their plasma protein levels were estimated.
A significantly lower methylation distribution of CpG sites was observed in EGLN1 and higher in HIF1AN (P < 0.01) in HAPE-p compared to the two control groups, HAPE-f and HLs. Of note, differential methylation distribution of a few CpG sites, 231,556,748, 231,556,804, 231,556,881, 231,557,317 and 231,557,329, in EGLN1 were significantly associated with the risk of HAPE (OR = 4.79–10.29; P = 0.048–004). Overall, the methylation percentage in EGLN1 correlated with upregulated plasma PHD2 levels (R = − 0.36, P = 0.002) and decreased peripheral blood oxygen saturation (SpO2) levels (R = 0.34, P = 0.004). We also identified a few regulatory SNPs in the DNA methylation region of EGLN1 covering chr1:231,556,683–231,558,443 suggestive of the functional role of differential methylation distribution of these CpG sites in the regulation of the genes and consequently in the HIF-1α signaling.
Significantly lower methylation distribution in EGLN1 and the consequent physiological influences annotated its functional epigenetic relevance in the HAPE pathophysiology.
Oxygen is vital for all living organisms as their evolution relies on the body’s homeostasis mechanisms for the efficiency of the energy-generating processes . Oxygen also acts as a developmental morphogen influencing the differentiation of the progenitor cells . These cellular differentiations are primarily driven through oxygen-modulated epigenetic modifying enzymes such as the Ten-Eleven-Translocation family of dioxygenases (TETs) and DNA methyltransferases (DNMTs) [3, 4]. Any deviation in the oxygen supply chain can be detrimental to human beings and manifests various cardiovascular pathophysiologies . Nonetheless, the human body has a remarkable ability to sense and adapt to the changes in oxygen availability. These adaptations are primarily carried by the hypoxia-inducible factor-1α (HIF-1α) that transactivates several genes in response to oxygen fluctuations . Along with HIF, the two oxygen sensors, Prolyl hydroxylase domain protein 2 (PHD2) and Factor inhibiting HIF-1α (FIH-1), play pivotal roles in the regulation of the hypoxia signaling pathway . They utilize molecular oxygen as a co-substrate to accelerate hydroxylation of the HIF-1α subunit leading to its degradation and inactivation under the normoxic condition . The gene names encoding PHD2 and FIH-1 are Egl nine homolog 1 (EGLN1) and Hypoxia Inducible Factor 1 Subunit Alpha Inhibitor (HIF1AN), respectively. The HIF-1α pathway being cardinal to human life, attained maximum attention to understand the underlying physiological processes under hypobaric hypoxia conditions prevalent at high altitude (HA), 2500 m above sea level . Despite the interesting insights on hypoxia as a driving force of numerous genetic adaptations and maladaptation, our current knowledge on epigenetic modifications of the oxygen-sensing genes remains inadequate. The physiological response to hypoxia is impacted by both genetic and epigenetic mechanisms [8, 9]. The methylation of hypoxia response element (HRE) recognized by HIF is poorly understood, but it can potentially impact the HIF transactivation of its target genes [10, 11]. Additionally, the presence of CpG islands in numerous HIF signaling genes emphasizes the overall influence of epigenetics on the hypoxia response of the human body . DNA methylation, one of the major epigenetic modifications regulating gene expression, is the mediator of crosstalk between genes and the environment . It occurs at the 5′-cytosine position of CpG dinucleotide sites located at the CpG rich regions known as CpG islands, primarily present in the promoters of the genes. Both EGLN1 and HIF1AN contain CpG islands, implicating a probable role of epigenetics in regulating their respective gene expression and subsequently to the HIF signaling.
PHD2 works by hydroxylating two proline residues in the oxygen-dependent degradation domain of HIF-1α so that it is recognized by the Von Hippel–Lindau protein ubiquitin ligase machinery for its proteasomal degradation . In contrast, FIH-1 hydroxylates the asparagine residue in the transactivation domain of the HIF-1α subunit to block the binding of HIF-1α with co-activators p300/CREB-binding protein inhibiting its transcriptional activity . Since oxygen is critical for the survival of both hydroxylases, the low oxygen condition or hypoxia suppresses their catalytic activities, stabilizing the HIF-1α subunit. Our prior work had identified the differential distribution of EGLN1 polymorphisms and altered transcription factors on respective loci highlighting the genetic role of EGLN1 on pathophysiological regulations in high-altitude pulmonary edema (HAPE) [16, 17]. This acute and severe HA illness occurs in unacclimatized individuals rapidly exposed to HA. Mode of ascent, altitude, speed and individual susceptibility are the most critical determinants for the occurrence of HAPE . In continuation with our pursuit of EGLN1 regulation at HA and its importance in HAPE pathophysiology, the present study investigated the epigenetic roles in differential DNA methylation distributions of EGLN1 and HIF1AN and their associations with the clinical outcome. The study performed targeted deep sequencing of the two genes to evaluate the methylation percentage in their respective CpG islands in the three study groups, namely HAPE-patients (HAPE-p) and the two healthy control groups HAPE-free sojourners (HAPE-f) and healthy HA natives (HLs). HAPE-p and HAPE-f were permanent residents of low altitude (< 200 m) of North India and were of Indo-Aryan ethnicity; both groups visited HA. HAPE-p were sojourners who suffered the disorder upon exposure to HA. HAPE-f were the healthy subjects who visited HA under similar conditions and carried out routine strenuous physical activities but did not suffer from the disorder. HLs were permanent residents of altitudes at and above 3500 m for many generations of Tibeto-Burman ethnicity . Subsequently, a correlation of the methylation percentage with respective protein expressions and the peripheral blood oxygen saturation (SpO2) levels was performed to understand the functional consequences of these epigenetic modifications in the disease pathophysiology. The DNA methylation investigation of the two genes provided interesting insights into the engagement of CpG sites in regulating the two genes and consequently in the HIF-1α signaling at HA.
Clinical parameters reveal low blood peripheral oxygen saturation levels in HAPE
Clinical parameters differed significantly in HAPE patients compared to the control groups, i.e., HAPE-f and HLs (Table 1). Of note, in patients, SpO2 decreased significantly as compared to the two control groups (P < 0.0001, Additional file 1: Fig. 1); whereas it remained comparable in the two control groups, i.e., HAPE-f and HLs (P > 0.05).
Gene expression and plasma protein level
The EGLN1 mRNA expression was1.38 fold higher in HAPE-p compared to HAPE-f (P < 0.0001; Fig. 1ai). However, no such significance was observed with HIF1AN gene expression (Fig. 1aii). In the case of protein expression, the plasma levels of both PHD2 and FIH-1 were significantly upregulated in HAPE-p compared to the HAPE-f (P < 0.001; Fig. 1 b i & ii).
Targeted gene methylation profiling reveals differential patterns
After the extensive quality checks of libraries for 32 samples in each group, only 26 HAPE-p, 26 HAPE-f and 24 HLs samples proceeded for deep sequencing and further analysis. Sequential analysis was performed starting with the entire CpG regions of the two genes, EGLN1 and HIF1AN, to shorter regions and finally to specific sites. The dot plot of CpG methylation in EGLN1 in the three study groups revealed 97 CpG sites in EGLN1 CpG island 179 and 46 CpG sites in HIF1AN CpG island 47 (Additional file 1: Figs. 2 and 3). A population-wise distribution of 5mC sites observed in the CpG region for each gene was calculated as the ratio of the total 5mC (methylated) count and the entire site (methylated and unmethylated) count for each study group (Fig. 2a). The differential methylation of CpG sites in EGLN1 in the three groups mostly occurred at the two peripheries of the CpG island, while a substantial part of the central region remained unmethylated. HIF1AN CpG island, on the other hand, presented a rather scattered distribution (Fig. 2bi & ii). The cumulative methylation percentage of CpG sites in EGLN1 was 36.3 ± 10.6 in HAPE-p as compared to 38.1 ± 6.3 in HAPE-f (P ≤ 0.05) and 51.4 ± 20.1 in HLs (Fig. 2a & Additional file 1: Table 1). The cumulative methylation percentage of CpG sites in HIF1AN was 71.6 ± 10.6 in HAPE-p compared to 62.3 ± 22.7 in HAPE-f and 66.9 ± 23 in HLs (Fig. 2a & Additional file 1: Table 1). As shown in the heat map, a detailed analysis of each CpG site in all the samples showed that only 43 CpG sites out of 97 in EGLN1 and 45 CpG sites out of 46 in HIF1AN demonstrated the differential distribution (Fig. 3ai & ii). The cumulative methylation percentage from these selected CpG sites further improved the significance of differential distribution in the three study groups. The methylation percentage of the CpG island of EGLN1 in HAPE-p was significantly decreased compared to the two control groups (P < 0.01; Fig. 3bi). However, for HIF1AN, the methylation percentage in HAPE-p was significantly increased compared to the two controls (P < 0.01, Fig. 3bii).
Altered DNA methylation correlates with plasma PHD2 and blood peripheral oxygen saturation level
The differential methylation percentage of EGLN1 and HIF1AN in each individual of the three groups was examined with their respective SpO2 levels. A linear correlation existed between SpO2 level and the methylation percentage distribution in EGLN1 (R = 0.34, P = 0.004; Fig. 4a). The differential methylation percentage of EGLN1 and HIF1AN in each individual of the three groups was also examined with the respective gene and protein expressions. The increase in the translational expression of EGLN1 in HAPE-p was in line with our observation of significantly lower methylation percentage distribution of EGLN1 in the patients. Likewise, an inverse correlation of the plasma PHD2 levels with methylation percentage distribution was observed (R = − 0.36, P = 0.002; Fig. 4b).
Potential specific CpG sites are susceptible to the disorder
The intuitive analysis of all CpG sites was performed using multivariate logistic regression that provided some interesting discernments. Of relevance, among these specific sites, we could not overlook nine CpG sites in EGLN1 (231,556,748, 231,556,804, 231,556,843, 231,556,858, 231,556,881, 231,557,315, 231,557,317, 231,557,329, 231,557,493) that stood apart distinctly with differential distribution in our groups (Fig. 4c). As shown in Table 2, out of these sites, the CpG sites that differed significantly between HAPE-p and HAPE-f were 231,556,748 (OR = 8.33; P = 0.004), 231,556,804 (OR = 10.29; P = 0.011), 231,556,881 (OR = 4.79; P = 0.048), 231,557,317 (OR = 5.62; P = 0.022) and 231,557,329 (OR = 6.42; P = 0.006). Next, we performed a regression coefficient to show the association between these nine CpG sites and the plasma PHD2 levels in the three groups (Table 3). Two CpG sites, 231,556,748 and 231,557,315, were inversely associated with the plasma PHD2 level in HLs (P < 0.05). The in-depth analysis did not have any noteworthy results for CpG sites of HIF1AN.
Single nucleotide polymorphisms in and around CpG sites
With the interesting results on the differentially and significantly distributed individual CpG sites in EGLN1 in the three groups, we hypothesized that a single nucleotide polymorphism (SNP) in and around the differentially methylated sites might contribute to the regulation of the gene. We did in silico analysis to determine the presence of SNPs around these differentially methylated regions of EGLN1 covering chr1:231,556,683–231,558,443 (Table 4). The SNP rs186996510, also known as 12C > G or Asp4Glu that lies upstream to the hypermethylated sites, chr1: 231,557,485 and 231,557,493, bears the transcription factor binding site for the transcription factor (TF) transforming growth factor-beta-induced factor homeobox 1 (TGIF1) (RegulomeDB Score, 2b). SNP rs12097901, also known as 380G > C or Cys127Ser, and rs61750991, also known as 471G > A or Gln157His, annotated to the two upstream regions of the hypomethylated sites, chr1: 231,556,843 and 231,556,858. While rs12097901 was associated with TF Forkhead Box B1 (FOXB1) (RegulomeDB Score, 3a), rs61750991 was associated with RNA Polymerase II Subunit A (POLR2A) and Fos Proto-Oncogene, AP-1 Transcription Factor Subunit (FOS), however, the association appeared poor with a RegulomeDB Score4.
The present study identified the CpG islands with differentiated DNA methylation levels in the two oxygen-sensor prolyl hydroxylase genes, EGLN1 and HIF1AN, regulating the HIF signaling pathway in the three study groups, HAPE-p, HAPE-f and HLs. Overall, the percentage of methylation distribution in the CpG island 179 of EGLN1 was significantly lower and CpG island 47 of HIF1AN was significantly higher in HAPE-p compared to the two controls, HAPE-f and HLs. Of note, there were sites at a stretch that scarcely bore methylation and some that were ornately methylated. Moreover, in EGLN1, specific sites with methylation percentages were significantly lower in patients and were also associated with the risk of HAPE in the susceptible individuals. We could identify such distinct methylated CpG sites in EGLN1 potentially adding to the HAPE risk. The elevated EGLN1 expression and plasma PHD2 level in HAPE-p compared to its healthy counterparts suggested its deleterious role in HAPE. Importantly, our results demonstrated an inverse correlation between the plasma PHD2 level and methylation percentage of CpG sites in EGLN1. The methylation state in the CpG Island regulates the normal functioning of the gene . Among the clinical parameters, SpO2 level was significantly decreased in patients, and the proportional correlation of methylation percentage of CpG sites in EGNL1 with SpO2 level further gains significance in the manifestation of HA pathophysiology. These findings portray the functional consequences of the epigenetic modifications at HA. Similarly, HIF1AN expression and plasma FIH-1 level were upregulated in HAPE-p compared to HAPE-f and HLs. However, no such correlations of the methylation levels with the gene or protein expressions were observed for HIF1AN. Perhaps, the regulation of HIF1AN is not influenced by its methylation.
Several reports in the past and recent have clearly demonstrated the selection in the variants of EGLN1 in the highland population around the world [20,21,22]. Bigham et al.  showed evidence of positive selection in EGLN1 in both Tibetans and Andeans . Further, our studies affirmed that the EGLN1 risk alleles rs1538664A, rs479200T and rs480902C increased the EGLN1 gene expression and were also associated with decreased SpO2 levels . These alleles were explored for the additional contributions from the associated secondary molecules, especially the transcription factors (TFs) that may regulate the gene through the differential distribution of their variant alleles and the respective TFs in the healthy and susceptible subjects . The study validated the specificity between a TF and allelic variants such as FUSRNA-binding protein (FUS) with rs1538664A, Rho GDP dissociation inhibitor 1 (RhoGDH1) with rs479200T, and hypoxia upregulated protein 1 (HYOU1) with rs480902C. Brutsaert et al.  found the increased frequency of an EGLN1 causal variant, rs1769793 that enhances O2 delivery or use during exercise at altitude in the Peruvian Quechua population . Xiang et al.  detected a significant association between rs186996510 and hemoglobin levels in Tibetans, suggesting that EGLN1 contributes to the adaptively low hemoglobin level of Tibetans compared with acclimatized lowlanders at high altitudes . Thus, the EGLN1 genetics variants have been influential in affecting the physiological changes relevant to HA.
The present study explored the epigenetic factors, such as DNA methylation associated with this gene. The position of a CpG island studied in our study for differential DNA methylation does not coincide with the region of evolutionary selection seen in the EGLN1 gene in most of the above studies; however, we still identified potential regulatory SNPs in the EGLN1 methylation region covering chr1:231,556,683–231,558,443. For example, SNP rs12097901 annotated to the upstream regions of the CpG sites 231,556,840, 231,556,843, and 231,556,844 and SNPs rs186996510 upstream to the CpG sites 231,557,485 and 231,557,493 are reported to have been selected in the Tibetan population under the HA settings . Further, SNP rs12097901 along with other coding region SNP rs186996510 may have adaptive benefits as both are associated with reduced hemoglobin phenotype characteristic of Tibetan adaptation to altitude . The few other regulatory SNPs around the methylated region were also associated with transcription factors whose interactions might be influenced by the differential methylation, affecting the regulation of the gene and body physiology [25, 26]. A recent study outlined four one-carbon metabolism SNPs, methylenetetrahydrofolate dehydrogenase 1 rs2236225, Thymidylate Synthase rs502396, Folate Hydrolase 1 rs202676 and glycine decarboxylase rs10975681, that cumulatively explained 11.29% of the variation in average LINE-1 methylation among Andean Quechua population . They also found that the number of years lived at HA was negatively associated with EPAS1 methylation and positively associated with LINE-1 methylation. Apart from HA, the epigenetic influence of EGLNs has been demonstrated as a potential marker for lung adenocarcinoma prognosis .
Our study showed germane results for epigenetic modifications in EGLN1 for HA adaptation and HAPE pathophysiology. It emphasized the differential methylation of EGLN1 and HIF1AN in HAPE-p compared to the two control groups, i.e., HAPE-f and HLs. Based on the methylation frequency, HAPE-p showed a significantly lower methylation distribution in EGLN1 but higher distribution in HIF1AN. Moreover, specific sites in EGLN1 had significantly lower methylation distribution in HAPE and potentially added to the HAPE risk. We also identified potential regulatory SNPs in the methylation region of EGLN1 that may get influenced by the differential methylation in the HA settings. The correlation studies of methylation distribution of EGLN1 CpG island with plasma PHD2 level and SpO2 level further indicated the causal role in HA pathophysiology. Overall, our findings inclined towards a likely epigenetic regulatory role of EGLN1 in the susceptibility to HAPE. Nonetheless, we realize that validation in greater sample sizes and various ethnicities would uphold the findings. Additional experimental studies determining the effects of each potential CpG dinucleotide site would authenticate its functional consequence. Further, non-matched patient groups and possible changes in cell populations driving some differential methylation patterns are a few other limitations of the study. The validation of our results in an in vitro setup may account for all the individual cofounders influencing epigenetic patterns that we plan to do in our future studies.
Blood samples were obtained from subjects that were categorized into three well-defined groups: 1) HAPE-patients (HAPE-p) were sojourners who suffered the disorder upon exposure to HA; 2) HAPE-free sojourners (HAPE-f), who visited HA under similar conditions and carried out routine strenuous physical activities but did not suffer from the disorder and remained healthy, and 3) highland natives (HLs) were permanent residents of altitude at and above 3500 m for many generations with Tibeto-Burman ethnicity. HAPE-p and HAPE-f belonged to Indo-Aryan ethnicity and were permanent residents of low altitude (< 200 m) of North India who traveled to Leh, Ladakh, for reasons such as professional assignments, recreation, and adventure. Approximately 32 subjects from each group were recruited through Sonam Norboo Memorial (SNM) hospital, Leh (3500 m), Ladakh, India.
The human ethics committee of CSIR-Institute of Genomics and Integrative Biology, Delhi and SNM Hospital, Leh, Ladakh, India, approved the study for human subjects. All participants of the study whose identities were undisclosed gave written informed consent.
Blood sample collection and clinical assessment
Eight milliliters of blood sample was collected from each subject in acid-citrate-dextrose anticoagulant. Blood samples of HAPE-p were drawn immediately after the diagnosis but before starting medication. Plasma and peripheral blood leukocytes were separated; the latter was processed for DNA extraction. Two ml of whole blood without anticoagulant was collected for RNA extraction. Plasma and RNA were stored at − 80 °C and DNA at − 20 °C. General clinical parameters for each subject were recorded. Diagnosis of HAPE was based on published clinical criteria . The SpO2 level was measured by Finger-Pulse Oximeter 503 (Criticare Systems Inc, USA).
Expression analysis of EGLN1 and HIF1AN
Quantitative real-time PCR
Gene expression of EGLN1 and HIF1AN was determined on 10 samples each of HAPE-p, HAPE-f, and HLs. Total RNA was extracted from a 2 ml whole blood sample aliquot without anticoagulant by TRI reagent RT blood (Molecular Research Centre, Cincinnati, USA). RNA quantity and quality were determined on a NanoDrop ND-1000 spectrophotometer, and integrity was checked on 1.5% agarose gel. Total RNA, 1.0 μg, was used to generate cDNA by EZ-first strand cDNA synthesis kit for reverse transcriptase-PCR (Biological Industries, BeitHaEmek, Israel). Real-time PCR was performed in triplicate with primers (Pearl Primer software; Additional file 1: Table 2) and SYBR Green PCR Master Mix on an ABI Prism 7300 Sequence Detection System (Applied Biosystems, Foster City, USA). The relative transcript quantity was calculated using the ΔΔCt method against 18SrRNA endogenous reference.
Estimation of plasma PHD2 and FIH-1 levels
Plasma PHD2 and FIH-1 levels were estimated by immunoassay kits (USCN Life Science, Wuhan, China) on a high-throughput SpectraMax plus384 Spectrophotometer (Molecular Devices, San Jose, USA).
Targeted methylation pattern in EGLN1 and HIF1AN
CpG islands in both the genes were confirmed by the UCSC genome browser (genome.ucsc.edu/) according to the February 2009 Human Genome Browser data. In silico study identified CpG islands, CpG179spanning the region chr1: 231,556,683–231,558,443 in EGLN1 and CpG47 from the region chr10:102,289,150–102,296,000 in HIF1AN. Total methylation distribution and the actual percentage of methylation were quantified for EGLN1 and HIF1AN concerning CpG sites.
Sodium Bi-sulfite conversion of DNA
Genomic DNA from whole blood was extracted from peripheral blood leukocytes using modified salting out procedure . Quantification and quality check of DNA was carried out on a NanoDropTM1000 Spectrophotometer (Thermo Scientific, USA). One μg of blood genomic DNA was converted to sodium bisulfite by EZ DNA Methylation-GoldTM Kit (Zymo Research, Irvine, USA). Briefly, DNA was bisulfite-converted for 16 h at 50 °C and subsequently desulfonated, washed, and eluted in 10 μl elution buffer. The method selectively converts cytosine (C) to uracil (U) without significant transformation of 5-methylcytosine (5mC) to thymine (T).
PCR Amplification of Bisulfite converted DNA
The bisulfite converted DNA of EGLN1 and HIF1AN consisting CpG island region was PCR amplified using the bisulfite-conversion-based methylation PCR primers designed by the software methprimer (http://www.urogene.org/cgi-bin/methprimer/methprimer.cgi, Additional file 1: Table 3). PCR amplifications were achieved by Amplitaq gold DNA polymerase by using a range of varying annealing temperatures in a gradient thermocycler (Thermo Scientific, USA). The reaction conditions and amplicon size for each primer pair are given in Additional file 1: Table 3. PCR products were purified by QIAquick PCR columns (Qiagen, USA). The length and concentration of these amplicons were analyzed using an Agilent High Sensitivity DNA chip on Agilent 2100 bioanalyzer (Agilent Technologies, USA).
NGS library preparation and deep sequencing of Sodium Bisulfite converted amplicons
The Nextera DNA sample preparation kit from Illumina profiled the CpG islands in the MiSeq sequencing platform in thirty-one subjects, each from HAPE-p and HAPE-f groups and thirty-two subjects from the HLs group. According to the manufacturer's protocol, dual indexed libraries were generated (Illumina, San Diego, CA, USA). Each purified PCR product, one nanogram diluted, was used for library generation in a 96-well plate format. Tagmentation process that includes transposome-mediated simultaneous DNA fragmentation and adapter ligation was performed at 55 °C for 5 min. After the tagmentation, specific PCR primers were indexed for multiplex sequencing. Limited cycle-number PCR was performed to amplify the purified libraries using AMPure XP beads (Beckman-Coulter, Brea, CA, USA). Double-stranded libraries were quality checked for size and molarity determination on a high sensitivity DNA Agilent chip run on the Agilent 2100 Bioanalyzer (Agilent Technologies). Equimolar libraries were pooled in equal volumes for sequencing on the Illumina MiSeq benchtop sequencer as per the manufacturer’s protocol.
Bi-sulfite sequence analysis
The variant calling algorithms counted the Cs and Ts at CpG sites in the reference sequences for quantitative digital methylation. Sequences were evaluated by mapping against the human reference genome using Illumina MiSeq system built-in Illumina MiSeq Reporter software. Bismark tool kit analyzed the bisulfite reads. The paired-end sequence reads were aligned to the in silico bisulfite-converted human reference genome hg19 in a strand-specific manner, not allowing any mismatches or multiple alignments. Deduplication was carried out and saved in the BAM format using deduplicate_bismark program. Methylation calls were extracted from the BAM files generated by deduplication, along with a short report detailing the calls. It generated mainly strand and context-specific cytosine output files and overall count report along with the HTML report. Genetic location is according to the February 2009 Human Genome Browser data . Significance was maintained at ≤ 0.05.
In silico identification of regulatory loci present within the CpG island
RegulomeDB ver 1.1 performed the functional annotation exercise within the differentially methylated regions. The identified regulatory SNPs associated with the methylated sites were annotated to understand their crucial role in gene regulation.
The role of methylation in HAPE disease and health was evaluated by multivariate logistic regression analysis using SPSS 16.0 software. The study groups depicting phenotype were the dependent variable. While comparing, one group was considered as reference against the other groups. The reference group was labeled as 1, while the test group was labeled as 0, meaning cases/test. Since we wanted to test the significance of each methylated site in all the study groups to identify their association with the disease. Therefore, we tested individual methylation sites and considered each site as a fixed factor or categorical independent variable. To achieve this, we labeled subjects with no methylation for a site tested as 1 (reference) against the methylated site labeled 0. Finally, to derive the best-fitting and biologically reasonable model to describe the relationship between an outcome and a set of predictors, we adjusted the data with age and gender as covariates. The adjusted P value < 0.05 was considered significant. Odds ratio (OR) and 95% confidence interval (CI) were calculated. The SPSS16.0 and EPIINFO-6.0 software were used for analyses. Statistical analysis was performed using the standard two-tailed parametric Student’s t test. Multiple correlation analyses using Pearson’s correlation (r) values were performed for levels. The quantitative RT-PCR was analyzed by one-way analysis of variance. Values are represented as means ± standard deviation.
Availability of data and materials
The data presented in this study are available in the manuscript and in supplementary materials.
High-altitude pulmonary edema
Egl nine homolog 1 (EGLN1)
Prolyl hydroxylase domain protein 2
Hypoxia inducible factor 1 subunit alpha inhibitor
Factor inhibiting HIF-1α
- SpO2 :
Peripheral blood oxygen saturation
Single nucleotide polymorphisms
Fathollahipour S, Patil PS, Leipzig ND. Oxygen regulation in development: lessons from embryogenesis towards tissue engineering. Cells Tissues Organs. 2018;205(5–6):350–71.
Simon MC, Keith B. The role of oxygen availability in embryonic development and stem cell function. Nat Rev Mol Cell Biol. 2008;9(4):285–96.
An J, Rao A, Ko M. TET family dioxygenases and DNA demethylation in stem cells and cancers. Exp Mol Med. 2017;49(4):e323.
Xu R, Sun Y, Chen Z, Yao Y, Ma G. Hypoxic preconditioning inhibits hypoxia-induced apoptosis of cardiac progenitor cells via the PI3K/Akt-DNMT1-p53 pathway. Sci Rep. 2016;6:30922.
Bhatnagar A. environmental determinants of cardiovascular disease. Circ Res. 2017;121(2):162–80.
Hu CJ, Wang LY, Chodosh LA, Keith B, Simon MC. Differential roles of hypoxia-inducible factor 1α (HIF-1α) and HIF-2α in hypoxic gene regulation. Mol Cell Biol. 2003;23(24):9361–74.
Fong GH, Takeda K. Role and regulation of prolyl hydroxylase domain proteins. Cell Death Differ. 2008;15(4):635–41.
Mishra A, Mohammad G, Norboo T, Newman JH, Pasha MQ. Lungs at high-altitude: genomic insights into hypoxic responses. J Appl Physiol. 2015;119(1):1–15.
Julian CG. Epigenomics and human adaptation to high altitude. J Appl Physiol. 2017;123(5):1362–70.
D’Anna F, Van Dyck L, Xiong J, Zhao H, Berrens RV, Qian J, Bieniasz-Krzywiec P, Chandra V, Schoonjans L, Matthews J, De Smedt J, Minnoye L, Amorim R, Khorasanizadeh S, Yu Q, Zhao L, De Borre M, Savvides SN, Simon MC, Carmeliet P, Reik W, Rastinejad F, Mazzone M, Thienpont B, Lambrechts D. DNA methylation repels binding of hypoxia-inducible transcription factors to maintain tumor immunotolerance. Genome Biol. 2020;21(1):182.
Koslowski M, Luxemburger U, Türeci O, Sahin U. Tumor-associated CpGdemethylation augments hypoxia-induced effects by positive autoregulation of HIF-1α. Oncogene. 2011;30(7):876–82.
Kindrick JD, Mole DR. Hypoxic regulation of gene transcription and chromatin: cause and effect. Int J Mol Sci. 2020;21(21):8320.
Moore LD, Le T, Fan G. DNA methylation and its basic function. Neuropsychopharmacology. 2013;38:23–38.
Fong G-H, Takeda K. Role and regulation of prolyl hydroxylase domain proteins. Cell Death Differ. 2008;15:635–41.
Lando D, Peet DJ, Gorman JJ, Whelan DA, Whitelaw ML, Bruick RK. FIH-1 is an asparaginyl hydroxylase enzyme that regulates the transcriptional activity of hypoxia-inducible factor. Genes Dev. 2002;16(12):1466–71.
Sharma K, Mishra A, Singh HN, Prashar D, Alam P, Thinlas T, Mohammad G, Kukreti R, Syed MA, Pasha MAQ. High-altitude pulmonary edema is aggravated by risk-loci and associated transcription factors in HIF-prolyl hydroxylases. Hum Mol Genet. 2021;30(18):1734–49.
Mishra A, Mohammad G, Thinlas T, Pasha MQ. EGLN1 variants influence its expression and SaO2 levels to associate with high-altitude pulmonary edema and adaptation. Clin Sci. 2013;124(7):479–89.
Luks AM, Auerbach PS, Freer L, Grissom CK, Keyes LE, McIntosh SE, Rodway GW, Schoene RB, Zafren K, Hackett PH. Wilderness medical society practice guidelines for the prevention and treatment of acute altitude illness: 2019 update. Wilderness Environ Med. 2019;30:S3-18.
Mishra A, Kohli S, Dua S, Mohammad G, Thinlas T, Pasha MQ. Genetic differences and aberrant methylation in the apelin system predict the risk of high altitude pulmonary edema. Proc Natl Acad Sci U S A. 2015;112(19):6134–9.
Bigham A, Bauchet M, Pinto D, Mao X, Akey JM, Mei R, Scherer SW, Julian CG, Wilson MJ, LópezHerráez D, Brutsaert T, Parra EJ, Moore LG, Shriver MD. Identifying signatures of natural selection in Tibetan and Andean populations using dense genome scan data. PLoS Genet. 2010;6(9):e1001116.
Brutsaert TD, Kiyamu M, Elias Revollendo G, Isherwood JL, Lee FS, Rivera-Ch M, Leon-Velarde F, Ghosh S, Bigham AW. Association of EGLN1 gene with high aerobic capacity of Peruvian Quechua at high altitude. Proc Natl Acad Sci U S A. 2019;116(48):24006–11.
Xiang K, Ouzhuluobu, Peng Y, Yang Z, Zhang X, Cui C, Zhang H, Li M, Zhang Y, Bianba, Gonggalanzi, Basang, Ciwangsangbu, Wu T, Chen H, Shi H, Qi X, Su B. Identification of a Tibetan-specific mutation in the hypoxic gene EGLN1 and its contribution to high-altitude adaptation. Mol Biol Evol. 2013;30(8):1889–98.
Lorenzo FR, Huff C, Myllymäki M, Olenchock B, Swierczek S, Tashi T, Gordeuk V, Wuren T, Ri-Li G, McClain DA, Khan TM, Koul PA, Guchhait P, Salama ME, Xing J, Semenza GL, Liberzon E, Wilson A, Simonson TS, Jorde LB, Kaelin WG Jr, Koivunen P, Prchal JT. A genetic mechanism for Tibetan high-altitude adaptation. Nat Genet. 2014;46(9):951–6.
Brutsaert TD, Kiyamu M, Revollendo GE, Isherwood JL, Lee FS, Rivera-Ch M, Leon-Velarde F, Ghosh S, Bigham AW. Association of EGLN1 gene with high aerobic capacity of Peruvian Quechua at high altitude. Proc Natl Acad Sci U S A. 2019;116(48):24006–11.
Zhi D, Aslibekyan S, Irvin MR, Claas SA, Borecki IB, Ordovas JM, Absher DM, Arnett DK. SNPs located at CpG sites modulate genome-epigenome interaction. Epigenetics. 2013;8(8):802–6.
Zhang X, Moen EL, Liu C, Mu W, Gamazon ER, Delaney SM, Wing C, Godley LA, Dolan ME, Zhang W. Linking the genetic architecture of cytosine modifications with human complex traits. Hum Mol Genet. 2014;23(22):5893–905.
Childebayeva A, Jones TR, Goodrich JM, Velarde FL, Rivera-Chira M, Kiyamu M, Brutsaert TD, Dolinoy DC, Bigham AW. LINE-1 and EPAS1 DNA methylation associations with high-altitude exposure. Epigenetics. 2019;14(1):1–15.
Zhang R, Lai L, He J, Chen C, You D, Duan W, Dong X, Zhu Y, Lin L, Shen S, Guo Y, Su L, Shafer A, Moran S, Fleischer T, MoksnesBjaanæs M, Karlsson A, Planck M, Staaf J, Helland Å, Esteller M, Wei Y, Chen F, Christiani DC. EGLN2 DNA methylation and expression interact with HIF1A to affect survival of early-stage NSCLC. Epigenetics. 2019;14(2):118–29.
The authors thank the volunteers of the study for their participation in this study, the staff and faculty at SNM hospital, Leh, and CSIR-IGIB for their cooperation and support. We thank Dr. Naveen Kumar Bhatraju for helping in the preparation of the figures.
This research work was funded by Cardiovascular Medical Research and Education Fund, Philadelphia, USA (Grant CLP0020, AM, KS). In part, Indian Council of Medical Research, India (GAP0119/ [ICMR No. 74/6/2015-Pers. EMS, QP and HNS]).
Ethics approval and consent to participate
The study was approved by the Human ethics committee at the Council of Scientific and Industrial Research-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, and SNM Hospital, Leh, Ladakh, India. Written informed consent was obtained from all participants involved in the study.
Consent for publication
Written informed consent has been obtained from the participants to publish this paper.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
. Fig. S1 Levels of SpO2 % in HAPE-p, HAPE-f and HLs. Fig. S2 Dot plot of CpG methylation in EGLN1 in the three study groups, i.e., HAPE-p, HLs and HAPE-f. It revealed 97 CpG sites in EGLN1 CpG island 179. Fig. S3 Dot plot of CpG methylation in HIF1AN in the three study groups, i.e., HAPE-f, HAPE-p and HLs. It revealed 46 CpG sites in HIF1AN CpG island 47. Table S1 Methylation distribution of CpG sites of EGLN1 and HIF1AN in each subject of the three study groups i.e., HAPE-f, HAPE-p and HLs. Table S2 Real-time PCR conditions for EGLN1 and HIF1AN. Table S3 Sodium bisulfite-conversion-based methylation PCR Primers and conditions for EGLN1 and HIF1AN.
About this article
Cite this article
Sharma, K., Mishra, A., Singh, H. et al. Differential methylation in EGLN1 associates with blood oxygen saturation and plasma protein levels in high-altitude pulmonary edema. Clin Epigenet 14, 123 (2022). https://doi.org/10.1186/s13148-022-01338-z
- Hypobaric hypoxia
- Prolyl hydroxylases
- DNA methylations
- Prolyl hydroxylase domain protein 2
- Factor inhibiting HIF-1α
- High altitude