- Research
- Open access
- Published:
Investigating the effects of maltreatment and acute stress on the concordance of blood and DNA methylation methods of estimating immune cell proportions
Clinical Epigenetics volume 15, Article number: 33 (2023)
Abstract
Background
Immune cell proportions can be used to detect pathophysiological states and are also critical covariates in genomic analyses. The complete blood count (CBC) is the most common method of immune cell proportion estimation, but immune cell proportions can also be estimated using whole-genome DNA methylation (DNAm). Although the concordance of CBC and DNAm estimations has been validated in various adult and clinical populations, less is known about the concordance of existing estimators among stress-exposed individuals. As early life adversity and acute psychosocial stress have both been associated with unique DNAm alterations, the concordance of CBC and DNAm immune cell proportion needs to be validated in various states of stress.
Results
We report the correlation and concordance between CBC and DNAm estimates of immune cell proportions using the Illumina EPIC DNAm array within two unique studies: Study 1, a high-risk pediatric cohort of children oversampled for exposure to maltreatment (N = 365, age 8 to 14 years), and Study 2, a sample of young adults who have participated in an acute laboratory stressor with four pre- and post-stress measurements (N = 28, number of observations = 100). Comparing CBC and DNAm proportions across both studies, estimates of neutrophils (r = 0.948, p < 0.001), lymphocytes (r = 0.916, p < 0.001), and eosinophils (r = 0.933, p < 0.001) were highly correlated, while monocyte estimates were moderately correlated (r = 0.766, p < 0.001) and basophil estimates were weakly correlated (r = 0.189, p < 0.001). In Study 1, we observed significant deviations in raw values between the two approaches for some immune cell subtypes; however, the observed differences were not significantly predicted by exposure to child maltreatment. In Study 2, while significant changes in immune cell proportions were observed in response to acute psychosocial stress for both CBC and DNAm estimates, the observed changes were similar for both approaches.
Conclusions
Although significant differences in immune cell proportion estimates between CBC and DNAm exist, as well as stress-induced changes in immune cell proportions, neither child maltreatment nor acute psychosocial stress alters the concordance of CBC and DNAm estimation methods. These results suggest that the agreement between CBC and DNAm estimators of immune cell proportions is robust to exposure to child maltreatment and acute psychosocial stress.
Background
Blood proportions of immune cells, such as monocytes, lymphocytes, and granulocytes, are tightly regulated in healthy individuals [1]. Deviations from normative immune cell proportions can be used to predict and monitor the progression of many diseases and conditions including chronic inflammation, viral infections, arterial disease, coronary heart disease, gastroenteritis, endocrine disorders, leukemia, and lymphoma [2,3,4,5,6]. Thus, it is critical to accurately measure the proportion of circulating immune cell subtypes for both diagnosis and monitoring of disease progress [7, 8].
Beyond clinical implications, it is critical to accurately measure immune cell proportions for basic research investigating connections between environmental/psychosocial exposures and health. Molecular profiles of whole blood, such as gene expression (9), DNA methylation (DNAm) [10], and telomere length [11], are often derived by averaging the measurement of these molecular features across a heterogeneous mixture of immune cell subtypes within each blood sample. However, each immune cell subtype has different functions and molecular signatures (e.g., methylation patterns, gene expression, telomere length distribution), and thus, the resulting molecular profiles of whole blood are dependent upon the immune cell proportions of the sample [10, 12]. The variability of immune cell proportions—whether due to stress or other factors—across biological samples has implications for the accuracy and reliability of genomic studies. For example, controlling for immune cell proportions in epigenome-wide association studies (EWAS) can lead to improved biological interpretability, whereas not accounting for immune cell proportions in EWAS can lead to false positive results [13,14,15].
The complete blood count (CBC) is the most widely used clinical test for determining immune cell proportions [6, 16] and analyzes major immune cell components via flow cytometry sorting. Despite its ubiquitous application in clinical practice, the CBC method has some disadvantages in population research settings. The application of flow cytometry to determine CBC estimates requires the collection of blood which is often difficult to obtain, especially in pediatric cohorts. Additionally, CBC tests must be performed within 24 h of sample collection and cannot be performed on stored (e.g., dried or frozen) samples. In contrast to CBC estimators, molecular profiles can be reliably generated from dried or frozen blood samples, as well as from noninvasive saliva or buccal swab samples. These alternative molecular methods for detecting immune cell proportions are of considerable interest for a broad range of research fields.
One alternative method for estimating immune cell proportions is the use of whole-genome DNAm data. Because of the unique pattern of methylation inherent in each cell type, methylation signals at specific 5'-cytosine-phosphate-guanine-3' (CpG) sites can be used to estimate immune cell proportions [17,18,19]. Houseman and colleagues developed the first statistical algorithm for estimating immune cell proportions using CpG probes from the Illumina 27 k DNAm array [20, 21], which was later updated by constructing reference sets for the Illumina 450 k DNAm array [14, 22,23,24,25]. Salas and colleagues provided an updated algorithm and a reference set specifically tailored to the Illumina EPIC DNAm array [26,27,28]. Each iteration of immune cell estimators was designed to make estimations with greater accuracy than previous versions. More recently, Salas and colleagues released a revised reference set for the Illumina EPIC DNAm array enabling estimation of a much larger variety of immune cell subtypes than previously possible [29]. Although the accuracy of some of these reference sets and algorithms has been validated in various clinical populations (e.g., patients with head and neck cancer, ovarian cancer, down syndrome, and obesity) [20], in cord blood [30], and in older adults [14], to our knowledge, no work has been done to determine the effects of exposure to child maltreatment (CM), as well as acute psychosocial stress, on the concordance of CBC and DNAm estimates of immune cell proportions.
Exposure to early life adversity (ELA), such as CM, has been shown to alter the DNAm profiles of individual genes such as NR3C1, BDNF, PRF1, SLC6A4, OXTR, COMT, AVP, and CRF [31,32,33,34,35,36,37], as well as alter the DNAm profiles of immune cells [38,39,40,41]. Moreover, the effects of CM and other adverse environmental exposures on DNAm profiles could extend beyond candidate genes and introduce DNAm alterations on a genome-wide scale [42, 43]. These alterations could influence the concordance of CBC and DNAm estimations of immune cell proportions if such alterations overlap with the CpG sites of established DNAm algorithms used to discriminate between immune cell subtypes. Therefore, it is important to validate the concordance of CBC and DNAm methods for estimating immune cell proportions in individuals who have experienced CM.
Beyond CM, changes in DNAm profiles in response to acute psychosocial stress could also alter the concordance of CBC and DNAm methods for estimating immune cell proportions. Evidence suggests that immune cell DNAm levels within promoter regions of stress-related genes respond to acute psychosocial stress [44, 45]. It is unknown whether these alterations persist after the acute stressor has subsided, and whether or not additional areas of the methylome are impacted by acute stress. In addition, DNAm levels in rats and mice are reported to be sensitive to acute stress in various brain regions such as the hippocampus, cortex, and periaqueductal gray area [46, 47]. Given the observed correlations between brain and blood cell methylomes [48, 49], it is plausible that similar changes in DNAm levels of immune cells could be occurring. Changes in DNAm profiles of immune cells in response to acute psychosocial stress can have implications for study designs in which biological samples are collected before, during, and following stressful events.
This study aims to assess the concordance of CBC and DNAm methods for estimating immune cell proportions in blood when implemented 1) cross-sectionally in children who have been investigated for CM, and 2) in four repeated samples of young adults undergoing an acute psychosocial stressor. We hypothesized that CM and acute psychosocial stress would alter the concordance between CBC and DNAm methods for estimating immune cell proportions, with lower concordance under conditions of stress. To test our hypotheses, we compared CBC and DNAm estimates of immune cell proportions in two studies. In Study 1, we included children from the ongoing Child Health Study (CHS), a large multidisciplinary study designed to provide prospective, longitudinal data on the health and development of children with and without a history of investigations for CM (physical and sexual abuse, and neglect). Study 1 was used to determine the effects of CM on the concordance of CBC and DNAm methods for estimating immune cell proportions. In Study 2, we used a sample of healthy young adults exposed to the Trier Social Stress Test (TSST), a laboratory stressor shown to promote robust sympathetic and adrenal stress responses [50] and elicit changes in immune cell proportions within individuals across time [51]. Study 2 was used to determine the effects of acute psychosocial stress on the concordance between CBC and DNAm methods of estimating immune cell proportions. We used the Illumina EPIC DNAm method of estimating immune cell proportions [29] and provide supplementary analyses using the previous Illumina EPIC DNAm iteration [26,27,28].
Results
Study 1: Testing the impact of child maltreatment on the concordance between CBC and DNAm estimates of immune cell proportions
Study 1 was designed such that it included both individuals with a history of CM investigations (for sexual abuse, physical abuse, or neglect) who were oversampled, and non-exposed individuals without a history of CM investigations, resulting in a total N = 365 (N = 307 youth with a maltreatment history and N = 58 comparison youth). Tests for demographic differences by CM status demonstrated that the CM group had, older age (p = 0.035), higher BMI (p = 0.002), more advanced pubertal stage (p < 0.001), and lower household income (p < 0.001) (see Table 1 for complete demographic comparisons).
There were no statistically significant differences in mean values of CBC or DNAm estimates of all immune cell subtypes when comparing CM and comparison groups (Table 1). Across all participants, CBC and DNAm estimates of immune cell proportions were highly correlated with one another for each cell type, with the exception of basophils (Pearson’s correlation: neutrophils: r = 0.948, p < 0.001; lymphocytes: r = 0.974, p < 0.001; monocytes: r = 0.782, p < 0.001; eosinophils: r = 0.933, p < 0.001; basophils: r = 0.189, p < 0.001; Additional file 2: Table S1). Excluding zero observations of basophils for CBC and DNAm estimates increased the correlation for basophils (r = 0.380, p < 0.001). Figure 1 shows CBC estimates plotted as a function of DNAm estimates with a 45-degree identity reference line and stratified by CM status. Distributions of CBC and DNAm estimates for each cell type are shown in Additional file 1: Figure S1.
We next examined the agreement between CBC and DNAm estimates among each cell type using the Bland–Altman method. Neutrophil CBC and DNAm estimates were significantly different from one another (β = -1.52, p < 0.001) with DNAm estimates being, on average, 1.52% lower than CBC estimates (Fig. 2A). Lymphocyte CBC and DNAm estimates were also significantly different from one another (β = -0.28, p = 0.014), with DNAm estimates, on average, being 0.28% lower than CBC estimates (Fig. 2B). In contrast, monocyte and eosinophil CBC and DNAm estimates did not show significant differences from one another (monocytes: β = 0.07, p = 0.28; eosinophils: β = -0.06, p = 0.46; Fig. 2C, D). Basophil CBC and DNAm estimates were significantly different from one another (β = -0.41, p < 0.001), with DNAm estimates being, on average, 0.41% lower than CBC estimates (Fig. 2E). The data dispersion for all immune cell subtypes was varied, with neutrophils having the largest range (20%-80%) in observations and basophils having the smallest range (0%-2.5%). Additionally, all cell types had a similar dispersion of differences between CBC and DNAm estimates other than basophils, which had the greatest number of observations outside the upper and lower limits of agreement.
Immune cell proportion difference scores [DNAm estimate − CBC estimate] were regressed onto CM status and covariates (Table 2). Although lymphocytes showed a significant difference in the Bland–Altman analysis, only neutrophils (β = -1.77, p < 0.001) and basophils (β = -0.37, p < 0.001) showed significant intercepts in our regression analysis, meaning that only these two cell types had significant differences between DNAm and CBC estimates after accounting for covariates. CM status was not a significant predictor of differences in CBC and DNAm estimates for all immune cell subtypes (neutrophils: β = -0.02, p = 0.96; lymphocytes: β = 0.06, p = 0.85; monocytes: β = -0.14, p = 0.43; eosinophils: β = 0.04, p = 0.85; basophils: β = -0.06, p = 0.43; Table 2).
Study 2: Testing the impact of acute psychosocial stress on the concordance between CBC and DNAm estimates of immune cell proportions
Study 2 participants ranged in age from 18–24 years (mean age = 21.3, SD = 1.3), had an average BMI of 24.2 (SD = 3.7), and were primarily female (64.3%). This study was designed to test the impact of ELA, and thus approximately half of the study participants recruited reported exposure to multiple adverse events in early life (N = 13 ELA exposure young adults and N = 15 control young adults). Four repeated blood samples were collected before and after an acute laboratory stressor (see Methods). Since study 2 involved DNAm estimates generated using DNA extracted from peripheral blood mononuclear cells (PBMCs), we conducted comparisons only between estimates of lymphocytes and monocytes. Full demographic statistics for Study 2, including CBC and DNAm estimates for each time-point, are shown in Table 3.
Across all time-points and individuals, both monocytes and lymphocytes had a Pearson correlation coefficient of 0.78 (p < 0.001). When compared to time 1 (i.e., 30 min pre-stress), time 4 (i.e., 240 min post-stress) had significantly lower CBC (β = -3.24, p < 0.001) and DNAm estimates (β = -2.22, p = 0.02) for monocyte proportions. Additionally, compared to time 1, time 4 had significantly higher CBC (β = 3.24, p < 0.001) and DNAm estimates of lymphocyte proportions (β = 2.22, p = 0.02), highlighting significant changes in PBMC immune cell proportions in response to acute stress for both CBC and DNAm estimates. Figure 3 shows CBC estimates plotted as a function of DNAm estimates with a 45-degree identity reference line and stratified by time-point. Distributions of CBC and DNAm estimates for each cell type are shown in Additional file 1: Figure S2.
We next examined the concordance between CBC and DNAm estimates for monocytes and lymphocytes using a repeated-measures Bland–Altman method. Monocyte CBC and DNAm estimates were not significantly different from one another (β = 0.59, p = 0.14) (Fig. 4A). Lymphocyte CBC and DNAm estimates were also not significantly different from one another (β = -0.59, p = 0.14) (Fig. 4B).
When regressing immune cell proportion difference scores [DNAm estimate − CBC estimate] onto time-point and covariates, time-point (i.e., pre- and post-stress) was not a significant predictor of difference scores for lymphocytes or monocytes (see Additional file 2: Table S2 for post hoc pairwise comparisons of time-points). Further, neither of the predicted intercepts for difference scores in monocytes and lymphocytes were significantly different from zero (monocytes: β = 1.16, p = 0.14; lymphocytes: β = -1.16, p = 0.14), indicating that there were no significant differences in DNAm and CBC estimate measurements across time-points. Additionally, ELA status was not a significant predictor of differences between DNAm and CBC estimates of immune cell proportions (monocytes: β = 2.52, p = 0.09; lymphocytes β = -2.52, p = 0.09), indicating that ELA status was not a confounding variable on the concordance between DNAm and CBC measurements over time (see Table 4 for full multilevel regression results).
Discussion
This study assessed the concordance between CBC and DNAm estimates of immune cell proportions using the recent Illumina EPIC DNAm method [29] in the context of CM and acute psychosocial stress. In Study 1, DNAm estimates were significantly correlated with CBC values for all immune cells. The lower correlation coefficients in monocyte and basophil estimates may be due to the limited range of these cell type proportions in comparison with neutrophils, lymphocytes, and eosinophils [52]. Using Bland–Altman analysis, DNAm estimates of both neutrophils and basophils were significantly lower than CBC estimates, though CM did not significantly explain any variation between estimation methods. DNAm estimates of lymphocytes, monocytes, and eosinophils showed no significant deviations from CBC estimates. In Study 2, DNAm estimates were significantly correlated with CBC values for both lymphocytes and monocytes. Additionally, using Bland–Altman analysis, DNAm and CBC estimates were not significantly different from one another for both monocytes and lymphocytes. While the acute stress manipulation caused significant changes in both monocyte and lymphocyte proportions over time across both approaches, it did not explain any variation in difference scores between CBC and DNAm estimates. Furthermore, in an exploratory analysis, ELA status did not impact the concordance of CBC and DNAm estimates for either lymphocytes or monocytes over time.
Previous studies investigating the concordance between DNAm and CBC estimates of immune cell proportions have been conducted in limited populations (e.g., patients with cancer, Down syndrome, obesity) [20] or niche tissues such as cord blood. Houseman and colleagues reported median difference scores between CBC and DNAm estimates to be 6.5% for neutrophils and 1.1% for monocytes when using their initial DNAm estimation library. Lymphocyte subset values ranged from 0.1–2.1% [20]. Importantly, the directionality of these results cannot be interpreted because the errors were reported as absolute values. Using the Houseman et al. DNAm estimation libraries, Koestler and colleagues reported the correlations of CBC and DNAm estimates to be around 0.60 for both lymphocytes and monocytes in PBMC samples [53], which is in line with our findings. Further, work by Accomando and colleagues [21] reported correlations between all immune cell subtypes when using the Houseman et al. DNAm estimation libraries, similar to our findings. However, Bland–Altman plots from this study exhibited negligible mean bias in immune cell subtype proportions.
Although prior work exists validating the original Houseman reference library, little research has been done to validate the concordance of CBC and DNAm estimates of immune cell proportions using the first EPIC array library generated by Salas and colleagues [26]. Salas and colleagues reported all lymphocyte cell subsets having DNAm estimates higher (> 1.25%) than CBC estimates. Monocyte values showed a similar trend with the median difference being less than 1% and neutrophils showed an opposite trend, with DNAm estimates being around 1.5% lower than CBC estimates. Each of these values are comparable to and in the same direction as the differences seen in Study 1 results when using the first EPIC array library for DNAm estimates, with the exception of neutrophils, which had a similar effect size but in the opposite direction (see Additional file 2: Tables S3–S7 for results of our analysis using the first EPIC array library generated by Salas and colleagues [26]).
To the best of the authors’ knowledge, there have been no previous studies assessing the concordance of CBC and DNAm estimates of immune cell proportions in a sample of maltreated children or in adults within the context of acute psychosocial stress. Further, this is the first study, to our knowledge, validating the recently released “FlowSorted.BloodExtended.EPIC” library [29] (see Methods). Study 1 provided a unique sample for testing the impact of CM on the concordance between CBC and DNAm estimates of immune cell proportions in a pediatric cohort. The experimental design employed by Study 2 allowed us to probe stress-induced temporal effects on the concordance of CBC and DNAm estimates of immune cell proportions. Additionally, repeated measurements of immune cell proportions in Study 2 were obtained over a 5 h time-window in response to an acute laboratory stressor, enabling us to detect the presence of temporal effects [54] of acute psychosocial stress on the concordance of immune cell proportion estimates. Contrary to expectations, we did not observe significant effects of CM or acute psychosocial stress on the concordance of CBC and DNAm immune cell estimates.
This study is not without limitations. For both studies, accurately comparing CBC and DNAm estimates of immune cell proportions is contingent on the assumption that whole blood immune cell proportions are equivalent in different blood tubes collected from the same venipuncture of an individual. Although most studies on the reliability of CBC estimates operate under the same premise [55,56,57,58], it is nevertheless still an assumption. In addition, all experiences of CM were grouped into one category which does not acknowledge the heterogeneity of biological effects resulting from differing age of first CM incident, type of CM, and severity of CM [59]. Future studies should examine the impact of detailed CM classification on the concordance of CBC and DNAm estimates. A limitation of Study 2 was the small sample size, larger studies with sufficient power are needed to more fully identify differences in immune cell estimates in response to acute psychosocial stress. Another limitation from Study 2 was the assumption that all individuals experienced a comparable increase in psychosocial stress from baseline to follow-up time-points during the acute stress procedure. Additionally, our exploratory analysis of the impact of ELA on CBC and DNAm estimate difference scores could have been improved by using a larger sample of ELA and control individuals.
The null effects of CM and acute psychosocial stress on the concordance of CBC and DNAm estimates of immune cell proportions may be due in part to minimal overlap between CpG sites altered by exposure to stress and the libraries of CpG sites used in DNAm estimators. Alternatively, alterations in DNAm patterns induced by CM or acute psychosocial stress at CpG sites overlapping with sites used in DNAm estimators of immune cell proportions may not be substantial enough to alter these estimates in a significant way. Future studies should explore whether specific CpG sites used in DNAm estimations of immune cell proportions overlap with other common CpG sites associated with physical or mental disorders. Additionally, future work should also be performed to test potential DNAm changes that take place in response to acute psychosocial stress over an extended period of time. Finally, we also recommend that more clinically practical methods [8, 60] of estimating immune cell proportions be tested for their accuracy in populations of individuals that have experienced CM and in individuals that have experienced acute psychosocial stress.
Conclusions
Although significant differences in raw value estimation between CBC and DNAm exist for some immune cell proportions, CM and acute psychosocial stress did not alter the existing concordance of CBC and DNAm estimations of immune cell proportions. These findings extend previous research and suggest that study designs which include individuals with an exposure to adverse events, as well as study designs employing acute stress paradigms can rely on existing DNAm methods to estimate proportions of immune cell subtypes.
Methods
Study 1: Participants and procedures
Study 1 participants were members of the CHS [61], a large multidisciplinary study designed to provide prospective, longitudinal data on the health and development of children with and without a history of maltreatment. The CHS is currently recruiting a large state-wide cohort of children recently investigated for CM and non-maltreated comparison children. The goals of the CHS are to elucidate the multiple etiological processes, as well as mediators and moderators, believed to play a role in the onset and maintenance of adverse health outcomes among survivors of CM, and to better inform intervention opportunities to reverse the negative consequences of CM.
Recruitment for the CHS is ongoing. Children with a recent (< 12 months) report of CM exposure are identified in collaboration with Pennsylvania’s Statewide Child Welfare Information System (CWIS). Subjects with recent involvement in the CWIS are invited to participate in the study through home mailings and phone contact by study coordinators. Eligibility criteria include: 1) aged 8 to 13 years, 2) subject of a CWIS maltreatment report (i.e., an allegation is made and investigated) and agreement for participation within 12 months of CWIS involvement, and 3) agreement of participation by a non-abusing caregiver. Non-maltreated comparison children are recruited via targeted advertisements in the same Pennsylvania counties as children with a history of CM investigations. Eligibility for participation includes: 1) no previous CWIS reports or contact, and 2) demographic similarity to a maltreatment participant. After recruitment, participating families are invited to visit the Center for Healthy Children at The Pennsylvania State University for a full day of assessments and biospecimen collection. Approval from The Pennsylvania State University Institutional Review Board was granted, and informed assent (child) or consent (caregiver) was obtained from all participants.
Families of participants arrived at the Center for Healthy Children at 7:30 a.m. After an introduction to the study, youth underwent a physical exam followed by fasting whole blood collection in 10 mL and 4 mL EDTA tubes via antecubital venipuncture by a trained phlebotomist. Blood samples collected in 4 mL EDTA tubes were sent to Quest Diagnostics for CBC analyses within 24 h of collection. Genomic DNA was extracted from whole blood using a semi-automated approach (QIASymphony, Qiagen). Whole-genome DNAm levels were analyzed using the Illumina Infinium EPIC array.
Demographic information for each participant was collected by survey. Body mass index (BMI) was obtained via a trained staff member collecting weight and height information from each participant and then calculating their standard weight to height-squared ratio. Pubertal stage was assessed by a trained staff member by taking an average of self-reported Tanner pubertal status measures of breast growth and pubic hair growth [62]. Total household income before taxes was self-reported by caregivers using a 0–11 Likert scale (0 = Under $10,000/year, 1 = $10,000-$19,999/year… 11 = More than $120,000/year). Race was caregiver-reported and collapsed into a binary variable where “White” was coded as zero and all other races were coded as one. This was done to ensure a large enough sample size in each category to detect statistically significant effects of race. Ethnicity was caregiver-reported as either “Hispanic” or “Non-Hispanic”.
Cross-sectional data reported in Study 1 were drawn from the baseline (i.e., Time 1) assessment of currently enrolled CHS participants. Although recruitment for this cohort study is ongoing with a target enrollment of 700 children, an initial subset of 439 participants were available for the purpose of these analyses. Of the 439 participants who had completed Time 1 assessments, 435 consented to anthropometric measurements and 401 consented to and successfully completed blood draws (1 caregiver refusal, 33 participant refusals, 4 attempted but incomplete). The first batch for DNAm analysis constituted 286 total samples and 275 samples survived DNAm QC measures (see DNAm Quality Control and Immune Cell Proportion Estimations subsection). A second batch, including 126 samples, was submitted for DNAm analysis, of which all samples survived DNAm QC measures. Both batches combined amounted to a total of 401 individuals. Of these 401 samples, 36 were excluded from the current analyses due to failed/missing CBC tests or other missing covariates, making our final analytic sample a total of 365 participants. No demographic differences were present between groups included and excluded from the analysis, with the exception of BMI and pubertal status (BMI being higher in the included group and pubertal status being lower; p < 0.05).
Study 2: Participants and procedures
Study 2 was comprised of a sample of 28 healthy individuals aged 18 to 25 years. Participants were recruited by word of mouth and advertisements on campus bulletin boards. During a visit to The Pennsylvania State University’s Clinical Research Center, participants were subjected to the Trier Social Stress Test (TSST) followed by a 4-h post-test sampling and questionnaire period. Testing for each participant began at 11:00am and ended by 4:15 pm. Blood was drawn at four different points during this time period (30 min before the TSST and 30, 90, and 240 min post-TSST). Participants were given specific instructions to refrain from excessive physical activity on the day of the testing, consuming alcohol for 12 h before their arrival, and eating and drinking (besides water) for 2 h prior to the testing session. The TSST was scheduled to begin at 12:00 pm to minimize the effects of circadian changes in cortisol. Detailed information on study procedures, including details on the TSST, has been reported previously [63].
The original motivation for this data collection was to examine differences in gene expression due to ELA status. For the purpose of the current investigation, we combined data from all participants, including those who had experienced ELA and controls. Exploratory analyses in Study 2 further examined differences in DNAm estimates of immune cell proportions by ELA status. ELA status of participants was assessed by a trained clinical interviewer during a phone interview using the Stressful Life Events Screening Questionnaire [64], as described previously [63]. BMI was obtained by collecting weight and height information from each participant and then calculating their standard weight to height-squared ratio. Ethnicity was self-reported as either “Hispanic” or “Non-Hispanic”.
Whole blood samples were repeatedly collected via an IV catheter into the antecubital vein. Blood samples collected in 4 mL EDTA tubes were sent to Quest Diagnostics for CBC analysis within 24 h. Blood samples collected in 10 mL EDTA tubes were immediately centrifuged for 10 min at 1500 g prior to collection of plasma. Peripheral blood mononuclear cells (PBMCs) were then isolated through density-gradient centrifugation using Ficoll. A small fraction of granulocytes in PBMC samples may have been retained (mean remaining granulocyte composition was 1.6% according to DNAm estimates) during processing. DNA was extracted from PBMCs using QIAmp mini kit (Qiagen) and sent to the Genome Sciences Core at The Pennsylvania State University for whole-genome DNAm analysis.
DNAm quality control and immune cell proportion estimations
For both studies, DNA was bisulfite converted and processed by either McGill University (Study 1) or the Genome Sciences Core at The Pennsylvania State University (Study 2). DNAm levels were probed using the Illumina Infinium EPIC array [26]. EPIC array idat imaging files were converted to DNAm M and β value matrices with the minfi [65] package using R statistical software (R v4.1.2). All samples that had an average probe detection p value > 0.05 were excluded from our analyses. Sample normalization was performed according to recommendations by Salas and colleagues using the noob normalization method (which has been shown to decrease technical variation between batches [66]) in the minfi package [29] and DNAm estimates of blood cell proportions were computed using the ProjectCellType_CP function in the FlowSorted.Blood.EPIC package, which is equivalent to the ProjectCellType function in minfi. The “FlowSorted.BloodExtended.EPIC” library [29] was used as reference data for blood cell proportion estimates in both studies.
Modeling CBC and DNAm estimates
The results from CBC tests and the DNAm estimates of immune cell proportions do not report the same categories of immune cell subtypes. CBC tests report neutrophils, lymphocytes, monocytes, eosinophils, and basophils whereas the recent Salas et al. DNAm estimates [29] provide a much larger subset of lymphocytes such as memory and naïve CD4T, CD8T, and B-cells, as well as natural killer (NK) cells and regulatory T-cells. All granulocytes (neutrophils, eosinophils, and basophils) were modeled in Study 1, whereas only PBMCs (composed only of monocytes and lymphocytes) were used in Study 2. The following formulae were used to compare the differing cell type reports from CBC and DNAm estimates:
Statistical analysis
Study 1
In Study 1, Pearson correlation coefficients were calculated to test the association between CBC and DNAm estimates of immune cell proportions. Bland–Altman analyses [67, 68] were then performed to test for the concordance between each cell type’s CBC and DNAm estimates. Briefly, Bland–Altman analysis is a widely used statistical method to test the agreement between two types of measurement constructs, usually one of which is considered the “gold standard”. The mean of the two measurements is plotted on the x-axis and their difference is plotted on the y-axis. Usually, these analyses also include information on the mean difference between measurements and what are called the Limits of Agreement (95% confidence intervals on the mean difference between methods). (See [67, 68] For a more in-depth treatment of Bland–Altman analysis).
Multivariate linear regression was used to test whether the presence of CM significantly altered the concordance of CBC and DNAm estimates of immune cell proportions. The difference between CBC and DNAm estimates [DNAm estimate − CBC estimate] was treated as the outcome and CM was treated as a predictor. Additional covariates included age, sex, BMI, household income, average Tanner measurements for pubertal stage, race, and ethnicity. Although income was assessed as a categorical variable, it was treated as continuous. “White” and “non-Hispanic” racial and ethnic categorizations were treated as reference groups due to the fact that they were the largest groups in the sample. Continuous covariates were sample-mean centered and dichotomous covariates (0 or 1) were coded as − 0.5 or 0.5 to allow for easier interpretation of regression estimates for the sample mean.
Study 2
In Study 2, correlation coefficients were calculated to test the association between CBC and DNAm estimates of immune cell proportions. A repeated-measures Bland–Altman analysis [69] for each cell type’s CBC and DNAm estimates was used to test for the concordance of these values, as described in Study 1.
Due to the repeated-measures study design, a multilevel modeling framework was used to test whether acute psychosocial stress significantly altered the concordance of CBC and DNAm estimates of immune cell proportions. The difference between CBC and DNAm estimates of cell proportions [DNAm estimate − CBC estimate] was treated as the outcome and the time-point was treated as the predictor to determine if there was significant variation in difference scores across time-points. Individual random effects were included for the model intercept and additional covariates included sex, age, BMI, ethnicity, and ELA status. All variables, other than time-point, were mean centered as described for Study 1.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable requests.
Abbreviations
- DNAm:
-
DNA methylation
- EWAS:
-
Epigenome-wide association studies
- CBC:
-
Complete blood count
- CpG:
-
5ʹ-cytosine-phosphate-guanine-3ʹ
- CM:
-
Child maltreatment
- ELA:
-
Early-life adversity
- CHS:
-
Child Health Study
- TSST:
-
Trier Social Stress Test
- CWIS:
-
Pennsylvania’s Statewide Child Welfare Information System
- BMI:
-
Body mass index
- PBMCs:
-
Peripheral blood mononuclear cells
- NK:
-
Natural killer cells
References
Scheiermann C, Frenette PS, Hidalgo A. Regulation of leucocyte homeostasis in the circulation. Cardiovasc Res. 2015;107(3):340–51.
Jung YJ, Kim JH, Park YJ, Kahng J, Lee H, Lee KY, et al. Evaluation of cell population data on the UniCel DxH 800 Coulter cellular analysis system as a screening for viral infection in children. Int J Lab Hematol. 2012;34(3):283–9.
Velioglu Y, Yuksel A. Complete blood count parameters in peripheral arterial disease. Aging Male. 2019;22(3):187–91.
Madjid M, Fatemi O. Components of the complete blood count as risk predictors for coronary heart disease. Tex Heart Inst J. 2013;40(1):17–29.
Wielińska-Wiśniewska H, Nowak JK, Da̧browski M, Szydłowska P, Szczepanik M, Cichocka K, et al. Reliability of capillary complete blood count in children with acute gastroenteritis. Front Pediatr. 2021 Aug 1;9.
Dixon L. The complete blood count: physiologic basis and clinical usage. J Perinat Neonat Nurs. 1997;11(3):1–18.
Blumenreich MS. The white blood cell and differential count. In: Walker HK, Hall WD, Hurst JW, editors. Clinical methods: the history, physical, and laboratory examinations. 3rd ed. Boston: Butterworths;1990.
Frobel J, Božić T, Lenz M, Uciechowski P, Han Y, Herwartz R, et al. Leukocyte counts based on DNA methylation at individual cytosines. Clin Chem. 2018;64(3):566–75.
Cole SW. Elevating the perspective on human stress genomics. Psychoneuroendocrinology. 2010;35(7):955–62.
Adalsteinsson BT, Gudnason H, Aspelund T, Harris TB, Launer LJ, Eiriksdottir G, et al. Heterogeneity in white blood cells has potential to confound DNA methylation measurements. PLoS ONE. 2012;7(10): e46705.
Lin J, Cheon J, Brown R, Coccia M, Puterman E, Aschbacher K, et al. Systematic and cell type-specific telomere length changes in subsets of lymphocytes. J Immunol Res. 2016;2016:5371050.
Satija R, Shalek AK. Heterogeneity in immune responses: from populations to single cells. Trends Immunol. 2014;35(5):219–29.
Qi L, Teschendorff AE. Cell-type heterogeneity: why we should adjust for it in epigenome and biomarker studies. Clin Epigenet. 2022;14(1):1–8.
Jaffe AE, Irizarry RA. Accounting for cellular heterogeneity is critical in epigenome-wide association studies. Genome Biol. 2014;15(2):1–9.
Campbell KA, Colacino JA, Park SK, Bakulski KM. Cell types in environmental epigenetic studies: biological and epidemiological frameworks. Curr Environ Health Rep. 2020;7(3):185–97.
Walters MC, Abelson HT. Interpretation of the complete blood count. Pediatr Clin N Am. 1996;43(3):599–622.
Álvarez-errico D, Vento-tormo R, Sieweke M, Ballestar E. Epigenetic control of myeloid cell differentiation, identity and function. Nat Rev Immunol. 2015;15(1):7–17.
Kim M, Costello J. DNA methylation: an epigenetic mark of cellular memory. Exp Mol Med. 2017;49(4):e322–e322.
Schmidt M, Maié T, Dahl E, Costa IG, Wagner W. Deconvolution of cellular subsets in human tissue based on targeted DNA methylation analysis at individual CpG sites. BMC Biol. 2020;18(1):1–13.
Houseman EA, Accomando WP, Koestler DC, Christensen BC, Marsit CJ, Nelson HH, et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinf. 2012;13–86.
Accomando WP, Wiencke JK, Houseman EA, Nelson HH, Kelsey KT. Quantitative reconstruction of leukocyte subsets using DNA methylation. Genome Biol. 2014;15(3):R50.
Bakulski KM, Feinberg JI, Andrews SV, Yang J, Brown S, L. McKenney S, et al. DNA methylation of cord blood cell types: applications for mixed cell birth studies. Epigenetics. 2016;11(5):354–62.
Yousefi P, Huen K, Quach H, Motwani G, Hubbard A, Eskenazi B, et al. Estimation of blood cellular heterogeneity in newborns and children for epigenome-wide association studies: blood cellular heterogeneity in newborns and children. Environ Mol Mutagen. 2015;56(9):751–8.
Guintivano J, Aryee MJ, Kaminsky ZA. A cell epigenotype specific model for the correction of brain cellular heterogeneity bias and its application to age, brain region and major depression. Epigenetics. 2013;8(3):290–302.
Reinius LE, Acevedo N, Joerink M, Pershagen G, Dahlén SE, Greco D, et al. Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility. PLoS ONE. 2012;7(7): e41361.
Salas LA, Koestler DC, Butler RA, Hansen HM, Wiencke JK, Kelsey KT, et al. An optimized library for reference-based deconvolution of whole-blood biospecimens assayed using the Illumina HumanMethylationEPIC BeadArray. Genome Biol. 2018;19(1):64.
Salas LA, Koestler DC. FlowSorted.Blood.EPIC. Immunomethylomics Working Group; 2022. Available from: https://github.com/immunomethylomics/FlowSorted.Blood.EPIC
Koestler DC, Jones MJ, Usset J, Christensen BC, Butler RA, Kobor MS, et al. Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL). BMC Bioinf. 2016;17(1):120.
Salas LA, Zhang Z, Koestler DC, Butler RA, Hansen HM, Molinaro AM, et al. Enhanced cell deconvolution of peripheral blood using DNA methylation for high-resolution immune profiling. Nat Commun. 2022;13(1):761.
Gervin K, Salas LA, Bakulski KM, van Zelm MC, Koestler DC, Wiencke JK, et al. Systematic evaluation and validation of reference and library selection methods for deconvolution of cord blood DNA methylation data. Clin Epigenet. 2019;11(1):1–15.
Murgatroyd C, Patchev AV, Wu Y, Micale V, Bockmühl Y, Fischer D, et al. Dynamic DNA methylation programs persistent adverse effects of early-life stress. Nat Neurosci. 2009;12(12):1559–66.
Elliott E, Ezra-Nevo G, Regev L, Neufeld-Cohen A, Chen A. Resilience to social stress coincides with functional DNA methylation of the Crf gene in adult mice. Nat Neurosci. 2010;13(11):1351–3.
Szyf M. The early life social environment and DNA methylation: DNA methylation mediating the long-term impact of social environments early in life. Epigenetics. 2011;6(8):971–8.
Szyf M. The early-life social environment and DNA methylation. Clin Genet. 2012;81(4):341–9.
Hing B, Braun P, Cordner ZA, Ewald ER, Moody L, McKane M, et al. Chronic social stress induces DNA methylation changes at an evolutionary conserved intergenic region in chromosome X. Epigenetics. 2018;13(6):627–41.
Lapp HE, Hunter RG. Early life exposures, neurodevelopmental disorders, and transposable elements. Neurobiol Stress. 2019;21(11): 100174.
Spengler D, Binder E, editors. Epigenetics and neuroendocrinology: clinical focus on psychiatry, Volume 2. Cham: Springer International Publishing; 2016. (Epigenetics and Human Health). https://doi.org/10.1007/978-3-319-29901-3
Parade SH, Huffhines L, Daniels TE, Stroud LR, Nugent NR, Tyrka AR. A systematic review of childhood maltreatment and DNA methylation: candidate gene and epigenome-wide approaches. Transl Psychiatry. 2021;11(1):1–33.
Cecil CAM, Zhang Y, Nolte T. Childhood maltreatment and DNA methylation: a systematic review. Neurosci Biobehav Rev. 2020;1(112):392–409.
Seif EA, Diab IH, Ibrahim SA, Hussein HA, Ghitani SA. Association between different parameters of child maltreatment and global DNA methylation. Alex J Med. 2019;55(1):82–8.
Houtepen LC, Hardy R, Maddock J, Kuh D, Anderson EL, Relton CL, et al. Childhood adversity and DNA methylation in two population-based cohorts. Transl Psychiatry. 2018;8(1):1–12.
Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML, et al. Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci. 2005;102(30):10604–9.
Yang BZ, Zhang H, Ge W, Weder N, Douglas-Palumberi H, Perepletchikova F, et al. Child abuse and epigenetic mechanisms of disease risk. Am J Prev Med. 2013;44(2):101–7.
de Rooij SR, Costello PM, Veenendaal MVE, Lillycrop KA, Gluckman PD, Hanson MA, et al. Associations between DNA methylation of a glucocorticoid receptor promoter and acute stress responses in a large healthy adult population are largely explained by lifestyle and educational differences. Psychoneuroendocrinology. 2012;37(6):782–8.
Unternaehrer E, Luers P, Mill J, Dempster E, Meyer AH, Staehli S, et al. Dynamic changes in DNA methylation of stress-associated genes (OXTR, BDNF) after acute psychosocial stress. Transl Psychiatry. 2012;2(8):e150–e150.
Rodrigues GM, Toffoli LV, Manfredo MH, Francis-Oliveira J, Silva AS, Raquel HA, et al. Acute stress affects the global DNA methylation profile in rat brain: Modulation by physical exercise. Behav Brain Res. 2015;15(279):123–8.
Floriou-Servou A, von Ziegler L, Waag R, Schläppi C, Germain PL, Bohacek J. The acute stress response in the multiomic era. Biol Psychiatry. 2021;89(12):1116–26.
Tylee DS, Kawaguchi DM, Glatt SJ. On the outside, looking in: a review and evaluation of the comparability of blood and brain “-omes.” Am J Med Genet B Neuropsychiatr Genet. 2013;162(7):595–603.
Sullivan PF, Fan C, Perou CM. Evaluating the comparability of gene expression in blood and brain. Am J Med Genet B Neuropsychiatr Genet. 2006;141B(3):261–8.
Allen AP, Kennedy PJ, Dockray S, Cryan JF, Dinan TG, Clarke G. The trier social stress test: principles and practice. Neurobiol Stress. 2016;12(6):113–26.
Etzel L, Apsley AT, Mattern BC, Hastings WJ, Heller T, Ram N, et al. Immune cell dynamics in response to an acute laboratory stressor: a within person–between group analysis of the biological impact of early life adversity. Under Review.
Bland JM, Altman DG. Correlation in restricted ranges of data. BMJ Br Med J Online. 2011;11:342.
Koestler DC, Christensen BC, Karagas MR, Marsit CJ, Langevin SM, Kelsey KT, et al. Blood-based profiles of DNA methylation predict the underlying distribution of cell types. Epigenetics. 2013;8(8):816–26.
Sapolsky RM, Romero LM, Munck AU. How do glucocorticoids influence stress responses? Integrating permissive, suppressive, stimulatory, and preparative actions*. Endocr Rev. 2000;21(1):55–89.
Gunawardena D, Jayaweera S, Madhubhashini G, Lokumarakkala DD, Senanayake SJ. Reliability of parameters of complete blood count with different storage conditions. J Clin Lab Anal. 2016;31(2): e22042.
Hedberg P, Lehto T. Aging stability of complete blood count and white blood cell differential parameters analyzed by Abbott CELL-DYN Sapphire hematology analyzer. Int J Lab Hematol. 2009;31(1):87–96.
Nakul-Aquaronne D, Sudaka-Sammarcelli I, Ferrero-Vacher C, Starck B, Bayle J. Evaluation of the Sysmex Xe-2100® hematology analyzer in hospital use. J Clin Lab Anal. 2003;17(4):113–23.
Buttarello M. Quality specification in haematology: the automated blood cell count. Clin Chim Acta. 2004;346(1):45–54.
Layfield SD, Duffy LA, Phillips KA, Lardenoije R, Klengel T, Ressler KJ. Multiomic biological approaches to the study of child abuse and neglect. Pharmacol Biochem Behav. 2021;210: 173271.
Pittella-Silva F. Targeting DNA methylation as an epigenetic leukocyte counting tool. Clin Chem. 2022 Mar 12;hvac035.
Schreier HMC, Heim CM, Rose EJ, Shalev I, Shenk CE, Noll JG. Assembling a cohort for in-depth, longitudinal assessments of the biological embedding of child maltreatment: Methods, complexities, and lessons learned. Dev Psychopathol. 2021;33(2):394–408.
Emmanuel M, Bokor BR. Tanner Stages. In: StatPearls. Treasure Island (FL): StatPearls Publishing; 2022. Available from: http://www.ncbi.nlm.nih.gov/books/NBK470280/
Shalev I, Hastings WJ, Etzel L, Israel S, Russell MA, Hendrick KA, et al. Investigating the impact of early-life adversity on physiological, immune, and gene expression responses to acute stress: A pilot feasibility study. Seedat S, editor. PLoS ONE. 2020;15(4):e0221310.
Goodman LA, Corcoran C, Turner K, Yuan N, Green BL. Assessing traumatic event exposure: General issues and preliminary findings for the Stressful Life Events Screening Questionnaire. J Trauma Stress. 1998;11(3):521–42.
Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30(10):1363–9.
Fortin JP, Triche TJ, Hansen KD. Preprocessing, normalization and integration of the Illumina HumanMethylationEPIC array with minfi. Bioinformatics. 2017;33(4):558–60.
Ludbrook J. Confidence in Altman–Bland plots: a critical review of the method of differences. Clin Exp Pharmacol Physiol. 2010;37(2):143–9.
Doğan NÖ. Bland-Altman analysis: a paradigm to understand correlation and agreement. Turk J Emerg Med. 2018;18(4):139–41.
Parker RA, Scott C, Inácio V, Stevens NT. Using multiple agreement methods for continuous repeated measures data: a tutorial for practitioners. BMC Med Res Methodol. 2020;20(1):154.
Acknowledgements
For Study 1, we thank the children and caregivers for their participation in the study, and Child Health Study staff for their dedication, hard work, and insights. For Study 2, we thank all nurses at the CRC and the participants in this study. We would also like to acknowledge the important formative work that Salas et al. [29] and Houseman et al. [20] have done.
Funding
Research reported in this manuscript was supported by grants from the National Institutes of Health, National Institute of Child Health and Human Development P50HD089922 (J.G.N), R01HD072468 (J.G.N), National Institutes of Aging through R01AG04879 (J.G.N), National Institutes of Aging through R21AG055621 (I.S.), and by the National Center for Advancing Translational Sciences through UL1 TR002014 grant. A.T.A and L.E. were supported by National Institute on Aging Grant T32 AG049676 to The Pennsylvania State University. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Author information
Authors and Affiliations
Contributions
A.T.A, L.E, and I.S contributed to the analysis, conception, design, and writing of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Study 1: Approval for this study was provided by the institutional review board of The Pennsylvania State University (protocol STUDY00006550). Informed assent (child) and consent (caregiver) was obtained from all participants in the study. All methods were carried out in accordance with relevant guidelines and regulations (Declaration of Helsinki). Study 2: The study was approved by the institutional review board of The Pennsylvania State University (protocol STUDY00008751), registered at ClinicalTrials.gov (Identifier: NCT03637751), and all participants provided written informed consent.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Additional file 1.
Supplementary Figures.
Additional file 2.
Supplementary Tables.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Apsley, A.T., Etzel, L., Hastings, W.J. et al. Investigating the effects of maltreatment and acute stress on the concordance of blood and DNA methylation methods of estimating immune cell proportions. Clin Epigenet 15, 33 (2023). https://doi.org/10.1186/s13148-023-01437-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13148-023-01437-5