The dynamic broad epigenetic (H3K4me3, H3K27ac) domain as a mark of essential genes
Clinical Epigenetics volume 13, Article number: 138 (2021)
Transcriptionally active chromatin is marked by tri-methylation of histone H3 at lysine 4 (H3K4me3) located after first exons and around transcription start sites. This epigenetic mark is typically restricted to narrow regions at the 5`end of the gene body, though a small subset of genes have a broad H3K4me3 domain which extensively covers the coding region. Although most studies focus on the H3K4me3 mark, the broad H3K4me3 domain is associated with a plethora of histone modifications (e.g., H3 acetylated at K27) and is therein termed broad epigenetic domain. Genes marked with the broad epigenetic domain are involved in cell identity and essential cell functions and have clinical potential as biomarkers for patient stratification. Reducing expression of genes with the broad epigenetic domain may increase the metastatic potential of cancer cells. Enhancers and super-enhancers interact with the broad epigenetic domain marked genes forming a hub of interactions involving nucleosome-depleted regions. Together, the regulatory elements coalesce with transcription factors, chromatin modifying/remodeling enzymes, coactivators, and the Mediator and/or Integrator complex into a transcription factory which may be analogous to a liquid–liquid phase-separated condensate. The broad epigenetic domain has a dynamic chromatin structure which supports frequent transcription bursts. In this review, we present the current knowledge of broad epigenetic domains.
Aside from being the structural unit in chromatin, the nucleosome is a signaling module responding to changes in metabolism and environmental conditions . The nucleosomal histones, consisting of two each of the histones H2A, H2B, H3, H4, are susceptible to numerous post-translational modifications (PTMs) throughout their entire length (Fig. 1). The modifications either enhance or retard the binding of proteins which alter the structure and function of the chromatin region. Histone PTMs are key components of the critical epigenetic machinery that provides protein-based regulatory information beyond the regulatory cues embedded within the DNA sequence. Among the players regulating the epigenetic machinery are writers, readers, and erasers (http://www.epigeneticmachinery.org/) . Writers are chromatin modifying enzymes that add the modification to histones, non-histone proteins, DNA, and RNA. Readers are proteins that read the cues written by the writer, while erasers represent enzymes that remove the chemical information embodied in the PTM. In the same way, metabolism and epigenetic processes are closely linked, for example, S-adenosyl-L-methionine (SAM) is required for lysine and arginine methyltransferases, acetyl-CoA for lysine acetyltransferases, iron and α-ketoglutarate for lysine demethylases, and ATP for histone kinases (for review see [3, 4]) (Fig. 1).
In the context of nucleosome signaling, histone H3 trimethylated at lysine 4 (H3K4me3) integrates a variety of signaling pathways, including transcription initiation, elongation, and RNA splicing . H3K4me3, a modification that is associated with transcriptionally active/poised chromatin (referred to as an active histone mark), exhibits two distinct distribution patterns, both conserved in yeast, plants, worms, flies, and mammals [6,7,8]. In chromatin immunoprecipitation sequencing (ChIP-seq) experiments, most H3K4me3-enriched nucleosomes are detected as sharp, narrow peaks (< 1 kb) positioned near the transcription start sites (TSSs). Alternatively, a small subset of genes is associated with broad H3K4me3 domains (> 4 kb) extending downstream into their body and exhibiting a lower signal intensity than sharp H3K4me3 peaks. It was suggested that different mechanisms regulate the two H3K4me3 distribution patterns as sharp and broad H3K4me3 peaks do not overlap , for example, the binding of different transcription factors, of different chromatin remodelers, or a differential role for super-enhancers . The nucleosomes of the broad H3K4me3 domain have many different histone PTMs . Henceforth, this chromatin feature of expressed genes will be referred to as the broad epigenetic domain.
In this review, we will focus on the broad epigenetic domains in comparison with the known features of the prevalent H3K4me3 mark enriched at the 5’ end of active genes. We will first briefly summarize the known features of the prevalent H3K4me3 mark enriched at the 5’ end of active genes as well as the enzymes that catalyze deposition/removal (writers/erasers) of H3K4me3 and proteins (readers) that bind to H3K4me3. While these interactions have been mostly studied in the context of the sharp, narrow H3K4me3 peaks near the TSSs, this current understanding of sharp H3K4me3 domains provides a solid foundation for modeling mechanistic concepts related to regulatory proteins for the broad epigenetic domains.
Function of H3K4me3 at the 5’end of active genes
H3K4me3-marked nucleosomes are detected as sharp, narrow peaks flanking the TSSs with their intensity correlating with transcriptional activity . However, the predominant peak maps to the 5′ end of the body of active genes, in association with unmethylated CpG islands [5, 12, 13]. In mammalian cells, this predominant H3K4me3 peak is located at the end of the first exon at the site of the 5’ splice site . The correlation between H3K4me3 peak intensities and transcription levels has been repeatedly demonstrated across species. Yet, there is a long-standing and fundamentally unresolved question about the functional significance of H3K4me3 and several mechanistic possibilities are plausible. H3K4me3 may be causative and instruct RNA polymerase II-mediated transcription or it could be the direct consequence of transcription. Alternatively, H3K4me3 deposition could be an independent concurrent event. Adding to this molecular intricacy, the role that H3K4me3 has on gene transcription may be context dependent . The elegant studies of Cano-Rodriguez et al.  using epigenomic editing showed local induction of H3K4me3 resulted in expression of silenced genes. The authors used the SET domain of the Zebrafish histone methyltransferase PRDM9 fused to catalytically dead Cas9 or zinc finger proteins, which was directed to specific silenced promoters. As long as the targeted promoter was not hypermethylated, directing the methyltransferase and H3K4me3 was sufficient to activate gene expression. This work supports the view that H3K4me3 facilitates transcription initiation through the binding of the H3K4me3 reader TAF3, which is a subunit of TFIID, and the recruitment of the preinitiation complex.
In addition to RNA polymerase II-mediated transcription, H3K4me3 has been linked to pre-mRNA splicing, DNA recombination, and DNA repair [6, 7]. Hence, H3K4me3 may represent a conserved versatile epigenetic mark that could support multiple DNA or RNA related cellular processes.
H3K4me3 writers, erasers, and readers
Although H3K4me3 is relatively stable and sustained potentially for hours and days, it is also a dynamic mark with its levels reflecting the balance between deposition by lysine methyltransferases (KMTs) and removal by lysine demethylases (KDMs) (Fig. 1). In mammals, there are three pairs of H3K4 methyltransferases homologous to yeast Set1: KMT2A/KMT2B, KMT2C/KMT2D, and KMT2F/G (a.k.a SETD1A and SETD1B, respectively). Each KMT2 isoform is present in a complex of auxiliary proteins required for activity and triggering the state (mono-, di-, or tri-) of methylation deposition. KMT2A-D are also known as MLL1-4. The current gene symbols for the MLL-related K4 methyl transferases are, respectively, KMT2D (MLL2), KMT2B (MLL4), KMT2C (MLL3), and KMT2A (MLL1). However, there is a confusion with the nomenclature between MLL2 and MLL4 [7, 17]. The current literature indicates that KMT2F and KMT2G complexes catalyze all three states of methylation and are responsible for most of H3K4 trimethylation including that at the broad H3K4me3 domains, while KMT2A-D produces mono- and/or di-methyl H3K4 [17,18,19,20,21,22] (Table 1). In one of these studies, H3K4 methyl forms were determined by mass spectrometry , circumventing the issue of frequent cross-reactivity of many H3K4me3 antibodies with H3K4me1 or H3K4me2 . Hence, sophisticated biochemical findings support the notion that there are functional distinctions in the enzymatic activities of the KMT2A/B/C/D class versus the KMT2F/G class that may reflect potential distinctions in functions that may not yet be fully appreciated.
The KMT2s are in large multiprotein complexes . Core subunits of the KMT2 complexes are WDR5, ASH2L, RBBP5, and DPY30. The KMT2A/B complexes also have menin and HCFC1 or HCFC2; KMT2C/D complexes have PTIP, PA1, NCOA6 and UTX. The KMT2F/G complexes also consist of WDR82, HCFC1/2, and CXXC1 (a.k.a. CFP1) (Fig. 2A). BOD1L is present in the KMT2G, but not in the KMT2F complex . Except for HCFC1/2, which do not appear in yeast, the proteins in these complexes are highly conserved between yeast, worms, plants, flies, and humans . This extensive phylogenetic conservation indicates that the KMT2s have essential roles in the basic transcriptional machinery of any eukaryotic species, and KMT2 isoform diversification during evolution may parallel increased specialization in different cell types and tissues.
The transcription factors, basal transcription factors, modified histones, and long non-coding RNA interacting with the KMT2 subunits recruit these complexes to chromatin . The CXXC1 subunit of the KMT2F/G complexes directs the enzyme to CpG islands and plays a key role in regulating gene expression through the formation of the H3K4me3 mark  (Fig. 2A). However, CXXC1 is also associated with expressed TSSs that lack CpG islands . CXXC1 is a reader of the H3K4me3 mark, binding to this modified site via its PHD domain . The absence of CXXC1 results in reduced transcription of CXXC1-occupied genes  and in disruption of cell differentiation (e.g., CD4+ T and TH17 cell differentiation) [25, 27]. In contrast to the maintenance of the H3K4me3 mark, establishment of this mark at highly transcribed genes may be through KMT2F/G complexes binding to the transcriptional machinery [15, 21].
Ubiquitinated H2B (uH2B) stimulates the activity of most KMT2s  (Fig. 1). Only KMT2G does not require uH2B to form H3K4me3 in an in vitro reconstituted chromatin system  (Table 1). Our group has previously shown that the ubiquitination of H2B is dependent on transcription . Thus, transcription elongation, which supports the formation of uH2B, may be a prerequisite for the H3K4 trimethylation uH2B-dependent activity of KMT2F. Recently, it was reported that proteasome inhibition would result in the spreading of H3K4me3 into transcribed gene bodies . Although the authors did not identify the specific epigenetic and transcriptional regulator(s) targeted by the proteasome, we offer the following for consideration. As proteasome inhibitors (e.g., MG132) result in the rapid deubiquitination of nucleosomal histones , KMT2G complexes may be responsible for the spreading of H3K4me3 as this H3K4 methyltransferase does not require uH2B for activity.
The molecular readers of the H3K4me3 mark bring a range of activities to modified chromatin regions . The readers include protein/complexes that are involved in: (1) transcription regulation [TAF3 (TFIID complex), transcription factor (BPTF) associated protein of 18 kDa (BAP18), Spindlin1, ING proteins], (2) chromatin modification (PHF23 which recruits SIN3-HDAC, KDM5B; and ING proteins which are associated with KATs and HDAC complexes); (3) chromatin remodeling [chromodomain helicase DNA binding protein 1 (CHD1)]; (4) elongation [Polymerase-Associated Factor (PAF), Facilitates Chromatin Transcription (FACT)]; and (5) RNA splicing (U2 snRNP, U2AF65) [5, 32,33,34,35,36,37] (Fig. 2A) (Table 1). Several lysine acetyltransferases (KATs) (e.g., KAT2B, KAT7, KAT2A in SAGA complex) bind to H3K4me3 . The recruitment of KATs to H3K4me3 nucleosomes results in heightened acetylation which is rapidly deacetylated by histone deacetylases (HDACs) (Fig. 2A). Inhibition of HDACs results in rapid hyperacetylation of the H3K4me3-modified nucleosome . The KATs that may be involved in the hyperacetylation of H3K4me3-nucleosomes include those that bind to the H3K4me3 (KAT2B) and those that are associated with the KMT2s (CBP/p300) [38,39,40,41].
In humans, there are six H3K4 demethylases. KDM1A and B remove methyl groups from H3K4me1 and H3K4me2, but not from H3K4me3, while KDM5A-D (a.k.a. JARID1A-D) is able to remove methyl groups from each of the 3 states of H3K4 methylation [5, 7].
Breadth of the broad epigenetic domain as a key chromatin signature
In 2010, 2014 and 2015, the functional role of broad epigenetic domains with a focus on H3K4me3 was evidenced by statistical analyses of pooled high-throughput genomics data [9, 42,43,44,45]. A meta-analysis of H3K4me3 ChIP-seq data across > 20 different cell and tissue types in mice and humans, revealed a subset of genes marked by a broad H3K4me3 domain spanning the TSS. In identifying genes with the broad H3K4me3 domain, typically ChIP-seq peaks are called using the broad-peak function in MACS2 [9, 42,43,44]. The genes in the top 5% broadest H3K4me3 domains are then selected for further analyses as these broadest H3K4me3 domains discriminated cells or tissues according to their lineage in mammalian cells [42, 43] (Fig. 3A). These very broad H3K4me3 peaks were low signal intensity compared to narrow peaks . The top 5% broadest H3K4me3 domains were enriched in genes involved in cell type-specific functions and cell identity, that is genes coding for factors essential to the establishment of cell lineage (see Glossary for definition of cell identity genes). This chromatin signature is conserved across species and taxa, from yeast to mammals including plants, flies, worms, and chicken [42, 43, 45, 46]. For examples, in human estrogen receptor positive (ER +) breast cancer MCF7 cells, the GATA3 gene has a broad epigenetic domain (Fig. 2B). In chicken erythroid cells, genes [e.g., IRF7, FTH1, HBBA (β-globin), HBA1 (α-globin)] have the broad epigenetic domain  (Fig. 3). Of note, the angiotensin-converting enzyme 2 (ACE2) gene, which codes for the receptor for the three coronaviruses HCoV-NL63, SARS-CoV and SARS-CoV-2, has a broad epigenetic domain in heart and small intestine, which are tissues with the highest ACE2 expression .
Genes with the broad epigenetic domain displayed both enhanced transcriptional consistency and output. Single-cell RNA-seq showed that the variance in expression of genes with the top 5% broadest H3K4me3 domains was reduced compared to genes without the broad H3K4me3 domain . Transcription is not a continuous process but occurs in bursts. Genes that are transcribed at higher rates have a greater number of transcriptional bursts. Among the burst parameters are burst fraction (the number of alleles in a cell that are being transcribed) and burst size (the number of transcripts generated per burst) . The burst fraction relates to the burst frequency, i.e., how frequent bursts occur, since burst size and frequency are often the measures characterizing transcription bursting. Super-enhancers, which are required to maintain broad epigenetic domains, would increase both the burst size and fraction of the target gene [10, 49].
Chromatin dynamics of broad epigenetic domains
The broad epigenetic domain has numerous histone PTMs, including H3K4me3, H3K23ac, H3K27ac, H3K79me1/2/3, H4K12ac, H4K20me1, H2BK5me, and H4R3me2a [9, 47, 50,51,52]. In addition to these histone PTMs, some broad epigenetic domains (e.g., GATA3 and FOXA1 in MCF7 cells) are associated with mitogen- and stress-activated protein kinase-generated H3S10ph and H3S28ph [53, 54]. The mitogen- and stress-activated protein kinase cooperates with CBP/p300 (gene symbol EP300) to produce H3K27ac/S28ph .
The dynamic histone modification of the broad epigenetic domain depends on the net activities of KATs/HDACs for acetylated histones, and KMTs/KDMs for methylated histones. The breadth of the broad H3K4me3 domain is dependent upon the net activity of KMT2F/G and KDM5A-D enzymes (Fig. 2A). Changes in the balance of these enzymes alter the breadth of the broad H3K4me3 domain and gene expression. For example, KDM5 inhibition results in increasing the broadness of promoter H3K4me3 peaks and increasing number of ER + breast cancer cells expressing the gene with the broad domain .
Considering the plethora of histone PTMs and the presence of chromatin remodelers (CHD1) , the broad epigenetic domain has an exceptionally dynamic chromatin structure (Fig. 4A), which would enable high transcription outputs and rapid response of inducible genes. The H3K4me3 modified nucleosomes are dynamically acetylated . In the highly acetylated state, the H3K4me3 nucleosomes located downstream of the transcription start site would maintain an atypical nucleosome structure (called the U-shaped nucleosome) following the passage of RNA polymerase II [56, 57]. Such atypical nucleosome structures could support frequent rounds of transcription bursts.
CBP/p300 mediates the acetylation of the H3K4me3 nucleosome . CBP/p300 acetylates multiple sites on the nucleosomal histones and regulates the acetylation of H2B (Ks on N-terminal tail) and H3 (K18, K27, K36)  (Fig. 1). Among the four core histones, H2B is the most dynamically acetylated . In addition to the core histones, CBP/p300 acetylates numerous other proteins (referred to as acetyl-spray), several of which are involved in chromatin modification and transcriptional regulation . Functional consequences of these acetylation events on the recipient proteins almost certainly occur and will require a major experimental effort to be fully deciphered. CBP/p300 co-condenses with transcription factors and could have a role in the transcriptional bursting of genes with the broad epigenetic domain .
The broad epigenetic domains have a nucleosome-free character as evidenced by FAIRE-seq and OCEAN-C [61, 62]. This feature of a destabilized chromatin structure for genes with the broad domain, which can be detected by FAIRE-seq or DNase-seq, is observed in vertebrate cells and is likely due to the action of CHD1 . Figure 2B shows the association of CHD1 with the MCF7 GATA3 broad epigenetic domain and the chromatin accessibility along the gene body as assayed through DNase-seq. CHD1, an ATP-dependent chromatin remodeler, is a reader of H3K4me3 and binds to the broad epigenetic domain [46, 50]. CHD1 is involved in the nucleosome turnover downstream of the transcription start site, regulates chromatin accessibility, and transcription elongation . The remodeling activity of CHD1 promotes passage of RNA polymerase II through nucleosomes, alleviates RNA polymerase II pausing, facilitates high transcriptional outputs, and incorporation of the replacement histone H3.3 [64,65,66,67] (Fig. 4A).
CHD1 interacts with the H3K4me3 readers PAF, Mediator, FACT, and the U2 snRNP splicing complex. CHD1 remodeling activity is required to maintain uH2B levels and is required for the expression of differentiation-activated genes . In maintaining the levels of uH2B, CHD1 activity would enhance the activities of the KMT2s (e.g., SETD1A) and perhaps plays a key role in maintaining the broad epigenetic domain .
The activity of CHD1 would result in nucleosome dissolution and reassembly, which could introduce the replacement histones such as H3.3 into the reassembled nucleosome as well as remove modified histones (Fig. 4A). H3.3 is named a replacement histone because it is expressed throughout most of the cell cycle and is available in the histone pools to form nucleosomes outside of S phase of the cell cycle. In contrast, most human histone genes are replication-dependent and are housed in large gene arrays on chromosomes 1 (13 genes) and 6 (55 genes). H3.3 is encoded by H3F3A and H3F3B genes that are not in the histone gene cluster. The H3.3 containing nucleosome would gain a richness of PTMs, consistent with previous reports .
Removal of the nucleosome via CHD1 also results in the removal of histone PTMs associated with the nucleosome, some of which are not readily de-modified (e.g., H4R3me2a) (Fig. 4A). The reassembly of the nucleosomes allows the chromatin modifying enzymes to lay down fresh histone PTMs to the newly formed histone octamer, which will contain the replacement histone variants, and the recruitment of readers of the histone PTMs.
With only two copies of the gene coding for H3.3, any mutation in these gene that impacts the modifications of this histone variant can have deleterious impacts on the genome, leading to cancer. Thus, these mutated histones are referred to as oncohistones . The incorporation of mutant H3.3 (an oncohistone) in which the altered amino acids change PTMs could impact chromatin properties of the broad epigenetic domain and alter expression of genes with the broad epigenetic domain, contributing to diseases associated with H3.3 mutants [69, 70].
Broad epigenetic domains and anticancer drugs
The destabilized chromatin structure of the broad epigenetic domain would make the broad epigenetic domain particularly vulnerable to the action of anticancer DNA intercalating drugs (Fig. 4B). Our studies reported that in vitro, the atypical nucleosomes associated with transcribed avian erythroid chromatin were sensitive to ethidium bromide dissociation . In situ, H3K4me3-modified nucleosomes near promoters of transcribed genes were sensitive to dissociation by the anthracycline doxorubicin, a commonly used anticancer drug [72, 73]. The nucleosome dissociation and loss of H3K4me3 from human melanoma MelJuSo cells was observed after treatment with 9 uM doxorubicin for four hours. Further this selective dissociation of H3K4me3-modified nucleosomes and the increase in nucleosome-free regions were detected by an increase in FAIRE-seq peaks near promoter regions of active genes in MelJuSo cells . The doxorubicin-mediated dissociation was independent of ATP and transcription . In the administration of doxorubicin to breast cancer patients, cardiotoxicity is a major side effect . The anticancer and cardiotoxicity of doxorubicin may be due to the dissociation of destabilized nucleosomes from the broad epigenetic domains in cancer cells and normal cells of the heart, resulting in altered gene expression. In this context, anthracyclines like doxorubicin could be considered as epigenetic drugs that could severely disrupt the structure and expression of genes with the broad epigenetic domain.
Broad epigenetic domains and cancer
Through integrative analysis of 1,134 genome-wide epigenetic profiles, mutations from > 8200 tumor-normal pairs and experimental data from clinical samples, Chen and colleagues discovered that tumor suppressor genes in normal cells were characterized by broad epigenetic domains. Broad epigenetic domains correlated with increased transcription elongation and enhancer activity, thus with extremely high gene expression. Moreover, a comparison of ENCODE H3K4me3 ChIP-seq data sets from 105 normal and 63 cancer samples showed that many tumor suppressor genes in cancer cell lines were associated with a shortening of their H3K4me3 domain .
More recently, Gopi and colleagues reported an integrative analysis of the epigenomes of 60 human cancer cell lines in the NCI-60 panel . The oncogene MYC had a broad epigenetic domain in all 60 cancer cell types. GO annotation of genes with the broad epigenetic domain revealed cancer type-specific GO terms. The authors concluded that the identification of genes with the broad epigenetic domain in the various cancer cell types provided insights into the cancer phenotype. For example, in melanoma cells, the broad epigenetic domains had enrichment in the pigmentation GO terms. Further the broad epigenetic cancer-type specific patterns were distinct from those in normal cells.
In another recent study from the research groups of Vahid Asnafi and Salvatore Spicuglia, genes with the broad H3K4me3 domain in normal thymocytes and T acute lymphoblastic leukemia (T-ALL) were identified . The authors applied an approach that differed from previous approaches [9, 42] to identify broad H3K4me3 domains; two cut-offs (high and low inflection points of ranked H3K4me3 peaks) identified broad H3K4me3 domains but also intermediates and sharp peaks . As was the case with other cell types, the thymocyte genes with broad H3K4me3 domains were involved in tissue-specific functions. Further, the genes with the broad H3K4me3 domain were (1) highly dynamic throughout T cell differentiation, and (2) coded for transcriptional regulators involved in T-ALL leukemogenesis. The broad H3K4me3 domains were associated with H3K79me2, less so with H3K36me3, and displayed reduced RNAPII pausing. In analyses of nine primary T-ALLs and seven T-ALL-derived cell lines, the authors found that T-ALL oncogenes, including oncogenic long non-coding RNAs, had the broad H3K4me3 domain. Thus, T-ALL deregulated driver oncogenes had gained the broad H3K4me3 domain signature. The authors noted that there was a switch in the genes with the broad H3K4me3 domain when comparing normal T-cell precursors with leukemic cells; the T-ALL changed T-cell identity genes with cancer-related genes.
In inhibitor studies, the authors reported that the genes with the broad H3K4me3 domain were sensitive to elongation inhibitors (THZ1 (cyclin kinase CDK7 inhibitor), EPZ-5676 (DOTL1 inhibitor)) . Of potential clinical significance, PBIT [2-(4-methylphenyl)-1,2-benzisothiazol-360 3(2H)-one], an inhibitor of the KDM5 family, strongly inhibited the proliferation of cell lines (e.g., Loucy) that had large numbers of broad H3K4me3 domains.
A comparison of the genes with the broad epigenetic (H3K4me3) domain in MCF7 [estrogen receptor (ER) + , luminal A], MDA-MB-231 (triple negative breast cancer cells), and MCF10A (basal epithelial cells representing a normal-like subtype) revealed the genes with this signature that were unique to each cell line  (manuscript in preparation). Analyses of these unique genes using Gene ontology and functional and pathway enrichment analysis using Genecodis and Ingenuity Pathway Analysis demonstrated that many genes containing the broad H3K4me3 domain code for sequence-specific DNA binding proteins, chromatin binding proteins, and proteins involved in the regulation of transcription within molecular functions. Hence, the broad epigenetic domain may mark key transcriptional regulators that drive a specific form of cancer.
In MDA-MB-231 breast cancer cells, which represent a more aggressive form of breast cancer, the PPP1R15A and SPRY4 (Sprouty4) genes are marked by a broad epigenetic (H3K4me3) domain. The broad H3K4me3 domain is absent or considerably shorter in MCF7 and in “normal” MCF10A1 cells. The PPP1R15A gene encodes protein phosphatase 1 regulatory subunit 15A (a.k.a. GADD34, growth arrest and DNA damage-inducible gene 34). The PPP1R15A protein is a pleiotropic regulator that has key functions in multiple cellular processes including DNA damage pathways, endoplasmic stress responses, and chromatin remodeling. The latter is inferred from the observation that PPP1R15A binds to hSNF5/INI1, a component of the SWI/SNF chromatin remodeling complex . Consistent with the general clinical relevance of perturbations in broad epigenetic domains, the PPP1R15A gene constitutes one of the genes in the hypoxia-related prognostic signature for breast cancer . The biological importance of these findings is not restricted to breast cancer cells, because the PPP1R15A gene also has a broad H3K4me3 domain signature in multiple cell types, including erythroleukemic K562, and lung epithelial A549 cells . The SPRY4 is a tumor suppressor gene that inhibits the receptor-transduced mitogen-activated protein kinase signaling pathway and is a negative regulator of interferon signaling . Silencing SPRY4 expression in MDA-MB-231 cells increases cell proliferation, migration, and metastatic properties [82, 83].
In MCF7 cells, but not in MDA-MB-231 and MCF10A1 cells, the FOXA1 gene has the broad epigenetic (H3K4me3) domain. Further, the GATA3 gene has a broad epigenetic (H3K4me3) domain in MCF7 cells (Fig. 2B) but not in MDA-MB-231 cells, while in MCF10A1, the broad H3K4me3 domain of the GATA3 gene is considerably shorter. FOXA1 and GATA3 have critical roles in ER + breast cancer in that these transcription factors form a regulatory network with ERα. These two transcription factors act as pioneer transcription factors which open condensed chromatin and are critical to maintain epithelial cell identity . Reduced expression of FOXA1 or GATA3 results in increased metastatic progression and poor disease-free survival . Highlighting these genes differentially marked with the broad H3K4me3 domain in MDA-MB-231 cells (SPRY4) and in MCF7 cells (FOXA1 or GATA3) demonstrates the importance of these genes in maintaining the cellular properties of these two different types of breast cancer cells. Likewise, reducing expression of these genes may increase the metastatic potential of tumor cells and potentially lead to a poor prognosis. Reduction of SPRY4 expression, for example, increases cancer stem cell properties in MDA-MB-231 cells, while reduced expression of FOXA1 and GATA3 by inhibiting the mitogen-and stress-activated protein kinase (MSK1) enhances the metastatic progression of ER + breast cancer. Silencing these genes with the broad H3K4me3 domain results in a more aggressive form of cancer.
Broad epigenetic domains and brain function
The H3K4me3 chromatin distribution of sorted neuronal and non-neuronal nuclei in human post-mortem, non-human primate, and mouse prefrontal cortex were analyzed through extensive bioinformatics approaches of epigenomic and transcriptomic data, in the context of cell-type-specific regulation . In mouse prefrontal cortex neurons, the broadest epigenetic (H3K4me3) domains (≥ 10 kb) were related to synaptic function and GABAergic signaling. Moreover, about 120 of the broadest epigenetic domains, annotated to human genes largely involved in dopaminergic and glutamatergic signaling, were conserved in chimpanzee, macaque, and mouse cortical neurons. In agreement with previous results, position of cell type-specific broad epigenetic domains correlated with the expression of genes that control cell identity. From a clinical perspective, this study supports the role of genes with the broad epigenetic domain in psychiatric disorders such as autism and schizophrenia .
Behavioral studies in mice (i.e., experiments on contextual fear conditioning) established that broad epigenetic domains are specifically present in genes activated during memory formation in cornu ammonis (CA1) neurons of the hippocampus (i.e., Calm1, Fos, and Npas4). Because Fos and Npas4 are immediate early genes, it appears that genes marked by a broad epigenetic domain are poised for activation and become highly expressed following stimulus exposure .
Breadth of the broad epigenetic domain as a key element in three-dimensional chromatin interactions
It has been shown that compared to H3K4me3 narrow enrichment tracts, broad epigenetic (H3K4me3) domains were preferentially involved in 3D chromatin interactions that were required for transcription of the relevant genes [10, 61, 86, 87] (Fig. 5). These studies used RNA polymerase II-associated ChIA-PET (chromatin interaction analysis by paired-end tag sequencing) data in K562 chronic myelogenous leukemia cells and MCF-7 breast cancer cells , ChIA-PET, Hi-C (a method based on chromosome conformation capture), and Hi-ChIP (HiC chromatin immunoprecipitation) data in K562, MCF7 and GM12878 lymphoblastoid cells , OCEAN-C (Open Chromatin Enrichment And Network Hi-C) in U266 myeloma cells, RPMI-8226 multiple myeloma cells and GM12878  and 3C (chromosome conformation capture) in tissue from the cerebellum of mice . They demonstrate the relevance of broad epigenetic domains, particularly in the context of cancer. OCEAN-C, combining Hi-C and FAIRE-seq, detects hubs of open chromatin interactions (HOCIs) . It involves a phenol–chloroform extraction and isolation of the DNA to be sequenced from the aqueous phase, thus resulting in the detection of nucleosome-depleted-region or open-chromatin interactions. This method would be more specific than the other methods above as it relies on enhancers being nucleosome-depleted regions. It was evidenced that OCEAN-C is a method able to identify interactions between open chromatin regions which are essential for gene transcription. Moreover, pathway enrichment analysis of the genes with a broad epigenetic domain overlapping with HOCIs, revealed that the majority of top enriched pathways (transcriptional misregulation in cancer, thyroid hormone synthesis, p53 signaling pathway, cell cycle, microRNAs in cancer) were related to cancer in U266 cells . Alterations in the p53 signaling pathway, which is involved in regulating cell cycle, cell death (apoptosis), and chromatin repair, are often altered in cancer cells. Further alterations in microRNAs can change the mRNAs that are translated in cancer cells.
Super-enhancers have been identified by the presence of the Mediator complex (and/or Integrator complex), H3K27ac, and CBP/p300 [88,89,90,91,92]. Note that Mediator and Integrator are major multiprotein complexes that associate with RNA polymerase II. Although Mediator and Integrator play important roles in enhancer/super-enhancer/promoter interactions, these complexes have enhancer/super-enhancer specific functions (e.g., Integrator requirement for immediate-early gene induction and expression) [91, 92]. Transcription factor circuits, such as the TCF4 circuit in mouse neural stem cells, appear to be important in the formation of the 3D interactions between super-enhancers and genes with the broad epigenetic domain. Key in forming these interactions is the Mediator complex which binds to transcription factors, chromatin remodelers (SWI/SNF), chromatin modifying complexes (KMT2, CBP/p300, NuRD histone deacetylase complex), coactivators, cohesin and other proteins. One study reported the detection of genes, enriched in transcriptional regulators, with superposed super-enhancer and broad epigenetic domain on the basis of the broad H3K4me3 domain tracking with Mediator, transcription factors (TCF4, NF1), and chromatin remodelers (CHD7) . However, whether the transcription factors are binding to the broad epigenetic domain directly or indirectly could be questioned. The transcription factor could be binding to an enhancer or super-enhancer and cross-linked to the broad epigenetic domain when interaction between the regulatory region and broad epigenetic domain occurs.
The interactions between the enhancers/super-enhancers and broad epigenetic domain marked genes are dynamic. Contributing to this dynamic interaction may be polyADP-ribosylation of the histones [93, 94]. During transcription initiation and elongation, topoisomerases I and II are involved in the release of torsional stress generated during transcription [95, 96]. The action of these enzymes in generating single-stranded and double-stranded DNA breaks could result in the recruitment of polyADP polymerase. Of the histone modifications, polyADP ribosylation is the modification most rapidly added and removed.
A model for the establishment of the broad epigenetic domain
It remains to be determined how broad epigenetic domains are established for specific genes. However, transcription factor circuits that include pioneer and cell type specific transcription factors binding to enhancers, super-enhancers and upstream promoter elements would likely establish the interactome between these regulatory elements (e.g., KLF5 in human epithelial cancers; TCF4 in neural stem cells) [89, 97, 98]. The transcription factors binding to and mediating these 3D interactions may be assembled into transcription factories [99,100,101], which may have the properties of a liquid–liquid phase-separated condensate  (Fig. 5). Transcription factors also recruit chromatin modifying enzymes [protein arginine methyltransferase 1 (PRMT1), CBP/300] to enhancers/super-enhancers and promoters [51, 60, 103,104,105]. PRMT1 is required to establish the active chromatin state and is needed for hematopoietic differentiation in avian and mammalian cells [103, 105, 106]. Interaction between the locus control region (a super-enhancer) and the β-globin promoter in the β-globin gene (broad epigenetic domain) is prevented when PRMT1 was knocked down. The interaction of the locus control region with the globin promoter increases transcriptional bursting (burst fraction and size) .
PRMT1 binds to CBP/p300 as well as many other proteins. The PRMT1 product, H4R3me2a, stimulates that activity of CBP/p300 to acetylate H3K27. Further, CBP/p300 co-condenses with transcription factors, and the co-condensation modulates transcriptional bursting. Thus, PRMT1 may have a role in activation of enhancers via the enzyme’s recruitment of CBP/p300 and H3K27 acetylation and in interaction of the enhancer/super-enhancer/locus control region with the target promoter, a step that can take place before enhancer activation . Recently, it was reported that PRMT1 regulates the gene expression program in mouse mature β-cell and is required for maintaining mature β-cell identity . These observations suggest that PRMT1 plays a pivotal role in the establishment of the broad epigenetic domain interactome.
The recruitment of the general transcription machinery would lead to the first round of transcription, which is often needed to recruit chromatin modifiers (KATs) and chromatin remodelers (CHD1 ,), and would establish histone PTMs (e.g., acetylated histones, ubiquitinated H2B) [29, 107]. The assembly of chromatin modifiers, chromatin remodelers, coactivators, Mediator, long non-coding RNA, and other regulatory molecules would contribute to the highly modified, chromatin dynamic state of genes with the broad epigenetic domain. Also for consideration is the interaction of the enhancer/super-enhancers with the target gene in establishing the broad epigenetic domain. KMT2B (MLL4) at super-enhancers is involved in the genesis of the broad epigenetic domain . However, the enzyme’s role in creating chromatin accessibility at enhancers/super-enhancers does not require the enzyme’s catalytic activity, which produces H3K4me1 [108, 109]. A model is emerging in which interactions (multivalent/hydrophobic) between the intrinsic disordered domains of Mediator (e.g., MED1 subunit), CBP/p300, and transcription factors form phase-separated condensates [60, 102, 110] (Fig. 5). Integrator is also likely involved in the formation of condensates, a role that future studies will need to address. The condensates would support long residence times of the transcription factors and chromatin modifying/remodeling factors on the enhancers/super-enhancers/locus control regions/promoters resulting in increased transcriptional burst frequencies and transcriptional consistency . As discussed by Zamudio et al. , the phase-separated three-dimensional condensate model, and within this model the chromatin organization and expression of genes with broad epigenetic domain would help in our understanding of pathological events leading to disease.
Limitations with analyses of broad epigenetic domains
The bioinformatic pipeline for peak detection from ChIP-seq data often includes a step where the issue of duplicate reads poses the challenge whether these reads should be filtered out or kept in. The duplicates are often artifacts from PCR bias but sometimes can be due to genomic regions that inherently get overamplified resulting in artifacts and noise . These regions, termed blacklisted regions, often have underlying sequences enriched with repeat sequences and elements. Removal of these regions often improve sequence quality leading to better signal-to-noise ratio . Unfortunately, they are often found in open chromatin regions and removing them may result in miscalling of pileups, underestimation of signal level and adversely affect downstream integrative and differential analysis leading to wrong biological interpretation [113, 114]. Each peak calling tool addresses this filtering issue differently by setting threshold cut offs and parameters that only “true peaks” can satisfy. For MACS2, a smaller value for lower cut off allows detection of larger number of regions, increasing the chance of confounding factors to affect the outcome of the model . For tools utilizing the Hidden Markov Model, the strictness of peak identification is often based on the cut off posed on the posterior probability . The latter method is considered better for peak determination as it provides a measure of confidence compared to simple binary outputs . Unlike narrow peaks, broad domains are typically of low intensity which results in poor model training and technical variabilities .
Of the various broad peak calling tools, MACS2 (broad setting) is the most used. The other tools include: Broadpeak, Zinba, RSeg, SICER, and histoneHMM [115, 119,120,121,122,123]. Most of these tools work by filtering duplicates and aggregating nearby peaks to determine broad peaks. The algorithm specific region bias, probability threshold and parameters vary from tool to tool. The variation in stringency levels among the tools lead to variation in identified peaks raising the issue of defining optimum broad domain parameters. This issue of default operating resolution among the tools is not limited to bivariant setting but is also seen with tools for multivariant peak calling such as ChromHMM . It is recommended not to feed broad peak data into the ChromHMM tool as it conflicts with the resolution at which it determines the combinatory states. Hence, characterization of broad domains requires careful consideration of peak calling applications and thresholding parameters. These factors are still being optimized.
A note of caution is warranted regarding the interpretation of these data when it comes to enhancers and super-enhancers. In a previous study, H3K27ac was deemed the mark of choice to identify a large fraction of super-enhancers, while minimizing erroneous DNA regions. Thus, in many of the bioinformatic studies, super-enhancer prediction entailed stitching H3K27ac peaks together within 12.5 kb of one another in the linear genome . However, we argue that this labeling of H3K27ac broad domains as enhancers or super-enhancers is prone to errors as it does not consider other marks such as H3K4me1 and does not meet the classical or functional definition of an enhancer or super-enhancer (see Glossary for definition of terms). In brief, calling a H3K27ac broad domain an enhancer or super-enhancer is a gross oversimplification that ignores the intricate interactions governing gene regulation. In fact, many histone PTMs mirrored the H3K4me3 mark either as sharp peaks near the TSSs or as broad peaks in the gene bodies, including H3K27ac [9, 86] (Fig. 2B), leading to terminology confusion and inconsistencies.
We envisage that each gene allele with a broad epigenetic domain is in a state of flux with CHD1-mediated nucleosome dissolution and reformation along the gene body (Fig. 4A). The nucleosomes would be highly modified. A limitation of the current picture we have at present is that the ChIP-seq and accessibility measurements are the average of these events in millions of the cells in the analyses. These assays can become even more complicated when using clinical specimens in which the intensity of the histone PTM levels (e.g., H3K27ac) may reflect phenotypic heterogeneity . To further explore the dynamics of the broad epigenetic domain, single cell ChIP-seq and accessibility assays are required . MAPit, which provides single allele accessibility information, would be an excellent method to view the heterogeneity of nucleosome and protein (transcription factor) occupancy of specific alleles with the broad epigenetic domain in the study population of cells. The MAPit method applies an exogenous DNA methyltransferase, Chlorella virus protein M.CviPI, which methylates GC when not occupied by a nucleosome or DNA-binding protein [128,129,130]. In addition to visualizing chromatin accessibility for each allele in the population, the method also shows the DNA methylation (five methyl CG) for each allele. A similar strategy but using a different methyltransferase (DNA N6-adenine methyltransferase) is called Fiber-seq . For both methods (MAPit and Fiber-seq) long read sequencing is required (single-molecule circular consensus sequencing with a Pacific Biosciences instrument).
Genes with the broad epigenetic signature have critical roles in cell identity and essential cell functions. Thus, identification of marked genes provides information about essential genes in normal and diseased states of cells as highlighted in our example of genes with the broad epigenetic domain in human breast cancer cells and in chicken polychromatic erythrocytes. Of clinical relevance, epigenetic therapy approaches [76, 132, 133] that alter the expression of genes with the broad epigenetic domain must be cognizant that silencing these genes may make the cancer more aggressive as shown with SPRY4 in triple negative breast cancer. Mutations in the DNA sequence transcription factor binding sites in enhancers and super-enhancers, altered expression of transcription factors, and expression of oncohistones (replacement histones such as H3.3) will impact the formation and function of the broad epigenetic domain marked genes . The chromatin structure of the broad epigenetic domain is very dynamic, with histone PTMs, nucleosome dissolution/ reassembly, and histone variant exchange. The enzymes catalyzing these events alter the function of each other by interacting and mutually modifying each other. Layered on these events is metabolism which provides the factors fueling the enzymes  (Fig. 1). The transcription factories housing the broad epigenetic domain and its interactome would have a high demand for energetic metabolites such as ATP, S-adenosyl methionine, acetyl-CoA, NAD, and α-ketoglutarate. Disturbance in the availability of these metabolites in the diet or as a consequence of disease may impact the expression of genes with the broad epigenetic domain [134, 135]. Exploring the orchestration of the metabolites and epigenetic players in the formation and maintenance of transcription factories/hubs with genes with a broad epigenetic domain and their interactome of enhancers and super-enhancers is a prerequisite to understanding cell differentiation and disease.
Availability of data and materials
Unprocessed raw files are available from the corresponding author on reasonable request.
Chromodomain helicase DNA binding protein 1
Chromatin immunoprecipitation sequencing
Facilitates Chromatin Transcription
Formaldehyde-Assisted Isolation of Regulatory Elements
Histone H3 trimethylated at lysine 4
Protein arginine methyltransferase
T acute lymphoblastic leukemia
Transcription start sites
Turner BM. The adjustable nucleosome: an epigenetic signaling module. Trends Genet. 2012;28:436–44.
Boukas L, Havrilla JM, Hickey PF, Quinlan AR, Bjornsson HT, Hansen KD. Coexpression patterns define epigenetic regulators associated with neurological dysfunction. Genome Res. 2019;29:532–42.
Boon R, Silveira GG, Mostoslavsky R. Nuclear metabolism and the regulation of the epigenome. Nat Metab. 2020;2:1190–203.
Reid MA, Dai Z, Locasale JW. The impact of cellular metabolism on chromatin dynamics and epigenetics. Nat Cell Biol. 2017;19:1298–306.
Davie JR, Xu W, Delcuve GP. Histone H3K4 trimethylation: dynamic interplay with pre-mRNA splicing. Biochem Cell Biol. 2015;94:1–11.
Howe FS, Fischl H, Murray SC, Mellor J. Is H3K4me3 instructive for transcription activation? BioEssays. 2017;39:1–12.
Collins BE, Greer CB, Coleman BC, Sweatt JD. Histone H3 lysine K4 methylation and its role in learning and memory 06 Biological Sciences 0604 Genetics 11 Medical and Health Sciences 1109 Neurosciences. Epigenet Chromatin. 2019;12:1–16.
Park S, Kim GW, Kwon SH, Lee JS. Broad domains of histone H3 lysine 4 trimethylation in transcriptional regulation and disease. FEBS J. 2020;287:2891–902.
Chen K, Chen Z, Wu D, Zhang L, Lin X, Su J, et al. Broad H3K4me3 is associated with increased transcription elongation and enhancer activity at tumor-suppressor genes. Nat Genet. 2015;47:1149–57.
Dhar SS, Zhao D, Lin T, Gu B, Pal K, Wu SJ, et al. MLL4 is required to maintain broad H3K4me3 peaks and super-enhancers at tumor suppressor genes. Mol Cell. 2018;70:825-841.e6.
Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, et al. High-Resolution profiling of histone methylations in the human genome. Cell. 2007;129:823–37.
Maunakea AK, Nagarajan RP, Bilenky M, Ballinger TJ, Dsouza C, Fouse SD, et al. Conserved role of intragenic DNA methylation in regulating alternative promoters. Nature. 2010;466:253–7.
Chen RAJ, Stempor P, Down TA, Zeiser E, Feuer SK, Ahringer J. Extreme HOT regions are CpG-dense promoters in C. elegans and humans. Genome Res. 2014;24:1138–46.
Bieberstein NI, Oesterreich FC, Straube K, Neugebauer KM. First exon length controls active chromatin signatures and transcription. Cell Rep. 2012;2:62–8.
Brown DA, Di Cerbo V, Feldmann A, Ahn J, Ito S, Blackledge NP, et al. The SET1 complex selects actively transcribed target genes via multivalent interaction with CpG island chromatin. Cell Rep. 2017;20:2313–27.
Cano-Rodriguez D, Gjaltema RAF, Jilderda LJ, Jellema P, Dokter-Fokkens J, Ruiters MHJ, et al. Writing of H3K4Me3 overcomes epigenetic silencing in a sustained but context-dependent manner. Nat Commun. 2016;7:1–11.
Yang W, Ernst P. SET/MLL family proteins in hematopoiesis and leukemia. Int J Hematol. 2017;105:7–16.
Lee JH, Tate CM, You JS, Skalnik DG. Identification and characterization of the human Set1B histone H3-Lys 4 methyltransferase complex. J Biol Chem. 2007;282:13419–28.
Shinsky SA, Monteith KE, Viggiano S, Cosgrove MS. Biochemical reconstitution and phylogenetic comparison of human SET1 family core complexes involved in histone methylation. J Biol Chem. 2015;290:6361–75.
Wu M, Wang PF, Lee JS, Martin-Brown S, Florens L, Washburn M, et al. Molecular regulation of H3K4 trimethylation by Wdr82, a component of human Set1/COMPASS. Mol Cell Biol. 2008;28:7337–44.
Poreba E, Lesniewicz K, Durzynska J. Aberrant activity of histone–lysine n-methyltransferase 2 (Kmt2) complexes in oncogenesis. Int J Mol Sci. 2020;21:1–37.
Sze CC, Ozark PA, Cao K, Ugarenko M, Das S, Wang L, et al. Coordinated regulation of cellular identity—associated H3K4me3 breadth by the COMPASS family. Sci Adv. 2020;6:eaaz4764.
Shah RN, Grzybowski AT, Cornett EM, Johnstone AL, Dickson BM, Boone BA, et al. Examining the roles of H3K4 methylation states with systematically characterized antibodies. Mol Cell. 2018;72:162-177.e7.
van Nuland R, Smits AH, Pallaki P, Jansen PWTC, Vermeulen M, Timmers HTM. Quantitative dissection and stoichiometry determination of the human SET1/MLL histone methyltransferase complexes. Mol Cell Biol. 2013;33:2067–77.
Lin F, Meng X, Guo Y, Cao W, Liu W, Xia Q, et al. Epigenetic initiation of the TH17 differentiation program is promoted by Cxxc finger protein 1. Sci Adv. 2019;5:eaax1608.
Van De Lagemaat LN, Flenley M, Lynch MD, Garrick D, Tomlinson SR, Kranc KR, et al. CpG binding protein (CFP1) occupies open chromatin regions of active genes, including enhancers and non-CpG islands. Epigenet Chromatin. 2018;11:59.
Kiuchi M, Onodera A, Kokubo K, Ichikawa T, Morimoto Y, Kawakami E, et al. The Cxxc1 subunit of the Trithorax complex directs epigenetic licensing of CD4+ T cell differentiation. J Exp Med. 2021;218:e20201690.
Kwon M, Park K, Hyun K, Lee JH, Zhou L, Cho YW, et al. H2B ubiquitylation enhances H3K4 methylation activities of human KMT2 family complexes. Nucleic Acids Res. 2020;48:5442–56.
Davie JR, Murphy LC. Level of ubiquitinated histone H2B in chromatin is coupled to ongoing transcription. Biochemistry. 1990;29:4752–7.
Kinyamu HK, Bennett BD, Bushel PR, Archer TK. Proteasome inhibition creates a chromatin landscape favorable to RNA Pol II processivity. J Biol Chem. 2020;295:1271–87.
Mimnaugh EG, Chen HY, Davie JR, Cells JE, Neckers L. Rapid deubiquitination of nucleosomal histones in human tumor cells caused by proteasome inhibitors and stress response inducers: effects on replication, transcription, translation, and the cellular stress response. Biochemistry. 1997;36:14418–29.
Sun G, Wang C, Wang S, Sun H, Zeng K, Zou R, et al. An H3K4me3 reader, BAP18 as an adaptor of COMPASS-like core subunits co-activates ERα action and associates with the sensitivity of antiestrogen in breast cancer. Nucleic Acids Res. 2020;48:10768–84.
Zhao F, Liu Y, Su X, Lee JEJE, Song Y, Wang D, et al. Molecular basis for histone H3 “k4me3-K9me3/2” methylation pattern readout by Spindlin1. J Biol Chem. 2020;295:16877–87.
Chen M, Chen X, Li S, Pan X, Gong Y, Zheng J, et al. An epigenetic mechanism underlying chromosome 17p deletion-driven tumorigenesis. Cancer Discov. 2020;11:194–207.
Villaseñor R, Pfaendler R, Ambrosi C, Butz S, Giuliani S, Bryan E, et al. ChromID identifies the protein interactome at chromatin marks. Nat Biotechnol. 2020;38:757.
Guérillon C, Larrieu D, Pedeux R. ING1 and ING2: multifaceted tumor suppressor genes. Cell Mol Life Sci. 2013;70:3753–72.
Dantas A, Al Shueili B, Yang Y, Nabbi A, Fink D, Riabowol K. Biological functions of the ING proteins. Cancers (Basel). 2019;11:1817.
Crump NT, Hazzalin CA, Bowers EM, Alani RM, Cole PA, Mahadevan LC. Dynamic acetylation of all lysine-4 trimethylated histone H3 is evolutionarily conserved and mediated by p300/CBP. Proc Natl Acad Sci USA. 2011;108:7814–9.
Khan DH, Gonzalez C, Cooper C, Sun JM, Chen HY, Healy S, et al. RNA-dependent dynamic histone acetylation regulates MCL1 alternative splicing. Nucleic Acids Res. 2014;42:1656–70.
Sha L, Ayoub A, Cho US, Dou Y. Insights on the regulation of the MLL/SET1 family histone methyltransferases. Biochim Biophys Acta Gene Regul Mech. 2020;1863:194561.
Tang Z, Chen WY, Shimada M, Nguyen UTT, Kim J, Sun XJ, et al. SET1 and p300 act synergistically, through coupled histone modifications, in transcriptional activation by p53. Cell. 2013;154:297.
Benayoun BA, Pollina EA, Ucar D, Mahmoudi S, Karra K, Wong ED, et al. H3K4me3 breadth is linked to cell identity and transcriptional consistency. Cell. 2014;158:673–88.
Benayoun BA, Pollina EA, Ucar D, Mahmoudi S, Karra K, Wong ED, et al. Erratum: H3K4me3 breadth is linked to cell identity and transcriptional consistency (Cell (2014) 158:673-688). Cell. 2015; 1281–6.
Dincer A, Gavin DP, Xu K, Zhang B, Dudley JT, Schadt EE, et al. Deciphering H3K4me3 broad domains associated with gene-regulatory networks and conserved epigenomic landscapes in the human brain. Transl Psychiatry. 2015;5:e679–e679.
Pekowska A, Benoukraf T, Ferrier P, Spicuglia S. A unique H3K4me2 profile marks tissue-specific gene regulation. Genome Res. 2010;20:1493–502.
Jahan S, Beacon TH, Xu W, Davie JR. Atypical chromatin structure of immune-related genes expressed in chicken erythrocytes. Biochem Cell Biol. 2020;98:171–7.
Beacon TH, Xu W, Davie JR. Genomic landscape of transcriptionally active histone arginine methylation marks, H3R2me2s and H4R3me2a, relative to nucleosome depleted regions. Gene. 2020;742:144593.
Beacon TH, Delcuve GP, Davie JR. Epigenetic regulation of ACE2, the receptor of the SARS-CoV-2 virus 1. Genome. 2020;64:386–99.
Bartman CR, Hsu SC, Hsiung CCS, Raj A, Blobel GA. Enhancer regulation of transcriptional bursting parameters revealed by forced chromatin looping. Mol Cell. 2016;62:237–47.
Zubek J, Stitzel ML, Ucar D, Plewczynski DM. Computational inference of H3K4me3 and H3K27ac domain length. PeerJ. 2016;2016:e1750.
Kim H, Yoon B-H, Oh CM, Lee JJ, Lee K, Song H, et al. PRMT1 is required for the maintenance of mature β-cell identity. Diabetes. 2020;69:355–68.
Hinohara K, Wu HJ, Vigneau S, McDonald TO, Igarashi KJ, Yamamoto KN, et al. KDM5 histone demethylase activity links cellular transcriptomic heterogeneity to therapeutic resistance. Cancer Cell. 2018;34:939-953.e9.
Josefowicz SZ, Shimada M, Armache A, Li CH, Miller RM, Lin S, et al. Chromatin kinases act on transcription factors and histone tails in regulation of inducible transcription. Mol Cell. 2016;64:347–61.
Gawrzak S, Rinaldi L, Gregorio S, Arenas EJ, Salvador F, Urosevic J, et al. MSK1 regulates luminal cell differentiation and metastatic dormancy in ER + breast cancer. Nat Cell Biol. 2018;20:211–21.
Khan DH, Healy S, He S, Lichtensztejn D, Klewes L, Sharma KL, et al. Mitogen-induced distinct epialleles are phosphorylated at either H3S10 or H3S28, depending on H3K27 acetylation. Mol Biol Cell. 2017;28:817–24.
Walia H, Chen HY, Sun JM, Holth LT, Davie JR. Histone acetylation is required to maintain the unfolded nucleosome structure associated with transcribing DNA. J Biol Chem. 1998;273:14516–22.
Locklear L, Ridsdale AJ, Bazett-Jones DP, Davie JR. Ultrastructure of transcriptionally competent chromatin. Nucleic Acids Res. 1990;18:7015–24.
Weinert BT, Narita T, Satpathy S, Srinivasan B, Hansen BK, Schölz C, et al. Time-resolved analysis reveals rapid dynamics and broad scope of the CBP/p300 acetylome. Cell. 2018;174:231-244.e12.
Zhang DE, Nelson DA. Histone acetylation in chicken erythrocytes. Rates of deacetylation in immature and mature red blood cells. Biochem J. 1988;250:241–5.
Ma L, Gao Z, Wu J, Zhong B, Xie Y, Huang W, et al. Co-condensation between transcription factor and coactivator p300 modulates transcriptional bursting kinetics. Mol Cell. 2021;81:1682-1697.e7.
Li T, Jia L, Cao Y, Chen Q, Li C. OCEAN-C: Mapping hubs of open chromatin interactions across the genome reveals gene regulatory networks. Genome Biol. 2018;19:54.
Jahan S, Beacon TH, Xu W, Davie JR. Atypical chromatin structure of immune-related genes expressed in chicken erythrocytes. Biochem Cell Biol. 2019;98:171–7.
Siggens L, Cordeddu L, Rönnerblad M, Lennartsson A, Ekwall K, Ronnerblad M, et al. Transcription-coupled recruitment of human CHD1 and CHD2 influences chromatin accessibility and histone H3 and H3.3 occupancy at active chromatin regions. Epigenet Chromatin. 2015;8:4.
Park D, Shivram H, Iyer VR. Chd1 co-localizes with early transcription elongation factors independently of H3K36 methylation and releases stalled RNA polymerase II at introns. Epigenet Chromatin. 2014;7:32.
Guzman-Ayala M, Sachs M, Koh FM, Onodera C, Bulut-Karslioglu A, Lin CJ, et al. Chd1 is essential for the high transcriptional output and rapid growth of the mouse epiblast. Development. 2015;142:118–27.
Baumgart SJ, Najafova Z, Hossan T, Xie W, Nagarajan S, Kari V, et al. CHD1 regulates cell fate determination by activation of differentiation-induced genes. Nucleic Acids Res. 2017;45:7722–35.
Konev AY, Tribus M, Sung YP, Podhraski V, Chin YL, Emelyanov AV, et al. CHD1 motor protein is required for deposition of histone variant H3.3 into chromatin in vivo. Science (80-). 2007;317:1087–90.
Mckittrick E, Gafken PR, Ahmad K, Henikoff S. Histone H3.3 is enriched in covalent modifications associated with active chromatin. Proc Natl Acad Sci USA. 2004;101:1525–30.
Mohammad F, Helin K. Oncohistones: drivers of pediatric cancers. Genes Dev. 2017;31:2313–24.
Trovato M, Patil V, Gehre M, Noh KM. Histone variant H3.3 mutations in defining the chromatin function in mammals. Cells. 2020;9:2716.
Ridsdale JA, Rattner JB, Davie JR. Erythroid-specific gene chromatin has an altered association with linker histones. Nucleic Acids Res. 1988;16:5915–26.
Pang B, Qiao X, Janssen L, Velds A, Groothuis T, Kerkhoven R, et al. Drug-induced histone eviction from open chromatin contributes to the chemotherapeutic effects of doxorubicin. Nat Commun. 2013. https://doi.org/10.1038/ncomms2921.
Yang F, Kemp CJ, Henikoff S. Doxorubicin enhances nucleosome turnover around promoters. Curr Biol. 2013;23:782–7.
van der Zanden SY, Qiao X, Neefjes J. New insights into the activities and toxicities of the old anticancer drug doxorubicin. FEBS J. 2020. https://doi.org/10.1111/febs.15583.
Gopi LK, Kidder BL. Integrative pan cancer analysis reveals epigenomic variation in cancer type and cell specific chromatin domains. Nat Commun. 2021. https://doi.org/10.1038/s41467-021-21707-1.
Belhocine M, Simonin M, Abad Flores JD, Cieslak A, Manosalva I, Pradel L, et al. Dynamic of broad H3K4me3 domains uncover an epigenetic switch between cell identity and cancer-related genes. Genome Res. 2021. https://doi.org/10.1101/gr.266924.120.
Beacon TH. Characterization of nucleosome-depleted regions of transcriptionally active chromatin in chicken and human genomes. 2020.
Adler HT, Chinery R, Wu DY, Kussick SJ, Payne JM, Fornace AJ, et al. Leukemic HRX fusion proteins inhibit GADD34-induced apoptosis and associate with the GADD34 and hSNF5/INI1 proteins. Mol Cell Biol. 1999;19:7050–60.
Wang J, Wang Y, Xing P, Liu Q, Zhang C, Sui Y, et al. Development and validation of a hypoxia-related prognostic signature for breast cancer. Oncol Lett. 2020;20:1906–14.
Davis CA, Hitz BC, Sloan CA, Chan ET, Davidson JM, Gabdank I, et al. The Encyclopedia of DNA elements (ENCODE): data portal update. Nucleic Acids Res. 2018;46:D794–801.
Sharma B, Joshi S, Sassano A, Majchrzak B, Kaur S, Aggarwal P, et al. Sprouty proteins are negative regulators of interferon (IFN) signaling and IFN-inducible biological responses. J Biol Chem. 2012;287:42352–60.
Jing H, Liaw L, Friesel R, Vary C, Hua S, Yang X. Suppression of Spry4 enhances cancer stem cell properties of human MDA-MB-231 breast carcinoma cells. Cancer Cell Int. 2016. https://doi.org/10.1186/s12935-016-0292-7.
Al Emran A, Marzese DM, Menon DR, Hammerlindl H, Ahmed F, Richtig E, et al. Commonly integrated epigenetic modifications of differentially expressed genes lead to adaptive resistance in cancer. Epigenomics. 2019;11:723–37.
Takaku M, Grimm SA, De Kumar B, Bennett BD, Wade PA. Cancer-specific mutation of GATA3 disrupts the transcriptional regulatory network governed by Estrogen Receptor alpha, FOXA1 and GATA3. Nucleic Acids Res. 2020;48:4756–68.
Collins BE, Sweatt JD, Greer CB. Broad domains of histone 3 lysine 4 trimethylation are associated with transcriptional activation in CA1 neurons of the hippocampus during memory formation. Neurobiol Learn Mem. 2019;161:149–57.
Cao F, Fang Y, Tan HK, Goh Y, Choy JYH, Koh BTH, et al. Super-enhancers and broad h3k4me3 domains form complex gene regulatory circuits involving chromatin interactions. Sci Rep. 2017;7:2186.
Thibodeau A, Márquez EJ, Shin DG, Vera-Licona P, Ucar D. Chromatin interaction networks revealed unique connectivity patterns of broad H3K4me3 domains and super enhancers in 3D chromatin. Sci Rep. 2017. https://doi.org/10.1038/s41598-017-14389-7.
Whyte WA, Orlando DA, Hnisz D, Abraham BJ, Lin CY, Kagey MH, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153:307–19.
Quevedo M, Meert L, Dekker MR, Dekkers DHWW, Brandsma JH, van den Berg DLCC, et al. Mediator complex interaction partners organize the transcriptional network that defines neural stem cells. Nat Commun. 2019;10:2669.
Gurumurthy A, Shen Y, Gunn EM, Bungert J. Phase separation and transcription regulation: are super-enhancers and locus control regions primary sites of transcription complex assembly? BioEssays. 2019;41:e1800164.
Yue J, Lai F, Beckedorff F, Zhang A, Pastori C, Shiekhattar R. Integrator orchestrates RAS/ERK1/2 signaling transcriptional programs. Genes Dev. 2017;31:1809–20.
Kirstein N, Gomes Dos Santos H, Blumenthal E, Shiekhattar R. The Integrator complex at the crossroad of coding and noncoding RNA. Curr Opin Cell Biol. 2021;70:37–43.
Boulikas T, Bastin B, Boulikas P, Dupuis G. Increase in histone poly(ADP-ribosylation) in mitogen-activated lymphoid cells. Exp Cell Res. 1990;187:77–84.
Benabdallah NS, Williamson I, Illingworth RS, Kane L, Boyle S, Sengupta D, et al. Decreased enhancer-promoter proximity accompanying enhancer activation. Mol Cell. 2019;76:473-484.e7.
Baranello L, Wojtowicz D, Cui K, Devaiah BN, Chung HJ, Chan-Salis KY, et al. RNA Polymerase II regulates topoisomerase 1 activity to favor efficient transcription. Cell. 2016;165:357–71.
Ju BG, Lunyak VV, Perissi V, Garcia-Bassets I, Rose DW, Glass CK, et al. A topoisomerase IIβ-mediated dsDNA break required for regulated transcription. Science (80-). 2006;312:1798–802.
Liu Y, Guo B, Aguilera-Jimenez E, Chu VS, Zhou J, Wu Z, et al. Chromatin looping shapes KLF5-dependent transcriptional programs in human epithelial cancers. Cancer Res. 2020;80:5464–77.
Quevedo M, Meert L, Dekker MR, Dekkers DHW, Brandsma JH, van den Berg DLC, et al. Publisher Correction: Mediator complex interaction partners organize the transcriptional network that defines neural stem cells. Nat Commun. 2019;10(1):2669. https://doi.org/10.1038/s41467-019-10502-8.
Edelman LB, Fraser P. Transcription factories: genetic programming in three dimensions. Curr Opin Genet Dev. 2012;22:110–4.
Eskiw CH, Fraser P. Ultrastructural study of transcription factories in mouse erythroblasts. J Cell Sci. 2011;124:3676–83.
Maass PG, Barutcu AR, Rinn JL. Interchromosomal interactions: a genomic love story of kissing chromosomes. J Cell Biol. 2019;218:27–38.
Zamudio AV, Dall’Agnese A, Henninger JE, Manteiga JC, Afeyan LK, Hannett NM, et al. Mediator condensates localize signaling factors to key cell identity genes. Mol Cell. 2019;76:753-766.e6.
Huang S, Litt M, Felsenfeld G. Methylation of histone H4 by arginine methyltransferase PRMT1 is essential in vivo for many subsequent histone modifications. Genes Dev. 2005;19:1885–93.
Huang S, Li X, Yusufzai TM, Qiu Y, Felsenfeld G. USF1 recruits histone modification complexes and is critical for maintenance of a chromatin barrier. Mol Cell Biol. 2007;27:7991–8002.
Li X, Hu X, Patel B, Zhou Z, Liang S, Ybarra R, et al. H4R3 methylation facilitates β-globin transcription by regulating histone acetyltransferase binding and H3 acetylation. Blood. 2010;115:2028–37.
Zhu L, He X, Dong H, Sun J, Wang H, Zhu Y, et al. Protein arginine methyltransferase 1 is required for maintenance of normal adult hematopoiesis. Int J Biol Sci. 2019;15:2763–73.
Martin BJE, Brind’Amour J, Kuzmin A, Jensen KN, Liu ZC, Lorincz M, et al. Transcription shapes genome-wide histone acetylation patterns. Nat Commun. 2021;12:1–9.
Dorighi KM, Swigut T, Henriques T, Bhanu NV, Scruggs BS, Nady N, et al. Mll3 and Mll4 facilitate enhancer RNA synthesis and transcription from promoters independently of H3K4 monomethylation. Mol Cell. 2017;66:568-576.e4.
Rada-Iglesias A. Is H3K4me1 at enhancers correlative or causative? Nat Genet. 2018;50:4–5. https://doi.org/10.1038/s41588-017-0018-3.
Sabari BR, Dall’Agnese A, Boija A, Klein IA, Coffey EL, Shrinivas K, et al. Coactivator condensation at super-enhancers links phase separation and gene control. Science. 2018;361:379.
Carroll TS, Liang Z, Salama R, Stark R, de Santiago I. Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data. Front Genet. 2014;5:75.
Wimberley CE, Heber S. PeakPass: automating ChIP-Seq blacklist creation. J Comput Biol. 2020;27:259–68.
Tian S, Peng S, Kalmbach M, Gaonkar KS, Bhagwate A, Ding W, et al. Identification of factors associated with duplicate rate in ChIP-seq data. PLoS ONE. 2019;14:e0214723.
Parekh S, Ziegenhain C, Vieth B, Enard W, Hellmann I. The impact of amplification on differential expression analyses by RNA-seq. Sci Rep. 2016. https://doi.org/10.1038/srep25533.
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 2008;9:R137.
Yoon B-J. Hidden Markov models and their applications in biological sequence analysis. Curr Genomics. 2009;10:402–15.
Starmer J, Magnuson T. Detecting broad domains and narrow peaks in ChIP-seq data with hiddenDomains. BMC Bioinform. 2016;17:1–10.
Wilbanks EG, Facciotti MT. Evaluation of algorithm performance in ChIP-seq peak detection. PLoS ONE. 2010;5:e11471.
Wang J, Lunyak VV, King JI. BroadPeak: A novel algorithm for identifying broad peaks in diffuse ChIP-seq datasets. Bioinformatics. 2013;29:492–3.
Rashid NU, Giresi PG, Ibrahim JG, Sun W, Lieb JD. ZINBA integrates local covariates with DNA-seq data to identify broad and narrow regions of enrichment, even within amplified genomic regions. Genome Biol. 2011;12:1–20.
Song Q, Smith AD. Identifying dispersed epigenomic domains from ChIP-Seq data. Bioinformatics. 2011;27:870–1.
Heinig M, Colomé-Tatché M, Taudt A, Rintisch C, Schafer S, Pravenec M, et al. histoneHMM: differential analysis of histone modifications with broad genomic footprints. BMC Bioinform. 2015;16:60.
Zang C, Schones DE, Zeng C, Cui K, Zhao K, Peng W. Data and text mining A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics. 2009;25:1952–8.
Ernst J, Kellis M. Chromatin-state discovery and genome annotation with ChromHMM. Nat Protoc. 2017;12:2478–92.
Hnisz D, Abraham BJ, Lee TI, Lau A, Saint-Andre V, Sigova AA, et al. Super-enhancers in the control of cell identity and disease. Cell. 2013;155:934–47.
Patten DK, Corleone G, Győrffy B, Perone Y, Slaven N, Barozzi I, et al. Enhancer mapping uncovers phenotypic heterogeneity and evolution in patients with luminal breast cancer. Nat Med. 2018;24:1469–80.
Ku WL, Pan L, Cao Y, Gao W, Zhao K. Profiling single-cell histone modifications using indexing chromatin immunocleavage sequencing. Genome Res. 2021. https://doi.org/10.1101/gr.260893.120.
Nabilsi NH, Deleyrolle LP, Darst RP, Riva A, Reynolds BA, Kladde MP. Multiplex mapping of chromatin accessibility and DNA methylation within targeted single molecules identifies epigenetic heterogeneity in neural stem cells and glioblastoma. Genome Res. 2013;24(2):329–39. https://doi.org/10.1101/gr.161737.113.
Darst RP, Pardo CE, Pondugula S, Gangaraju VK, Nabilsi NH, Bartholomew B, et al. Simultaneous single-molecule detection of endogenous C-5 DNA methylation and chromatin accessibility using MAPit. Methods Mol Biol. 2012;833:125–41.
Stees JR, Hossain MA, Sunose T, Kudo Y, Pardo CE, Nabilsi NH, et al. High fractional occupancy of a tandem MARE and its role in long-range β-globin gene regulation. Mol Cell Biol. 2015;36:238–50.
Stergachis AB, Debo BM, Haugen E, Churchman LS, Stamatoyannopoulos JA. Single-molecule regulatory architectures captured by chromatin fiber sequencing. Science (80-). 2020;368:1449–54.
Gerrard DL, Boyd JR, Stein GS, Jin VX, Frietze S. Disruption of broad epigenetic domains in PDAC cells by HAT inhibitors. Epigenomes. 2019;3:11.
Kelly AD, Issa JPJ. The promise of epigenetic therapy: reprogramming the cancer epigenome. Curr Opin Genet Dev. 2017;42:68–77.
Murano C, Binda A, Palestini P, Baruscotti M, DiFrancesco JC, Rivolta I. Minireview: effect of the ketogenic diet in excitable tissues. Am J Physiol Physiol. 2021. https://doi.org/10.1152/ajpcell.00458.2020.
Roberti A, Fernández AF, Fraga MF. Nicotinamide N-methyltransferase: at the crossroads between cellular metabolism and epigenetic regulation. Mol Metab. 2021;45:101165.
Aprelikova O, Chen K, El Touny LH, Brignatz-Guittard C, Han J, Qiu T, et al. The epigenetic modifier JMJD6 is amplified in mammary tumors and cooperates with c-Myc to enhance cellular transformation, tumor progression, and metastasis. Clin Epigenet. 2016;8:1–16.
Pennacchio LA, Bickmore W, Dean A, Nobrega MA, Bejerano G. Enhancers: five essential questions. Nat Rev Genet. 2013;14:288–95.
Sengupta S, George RE. Super-enhancer-driven transcriptional dependencies in cancer. Trends Cancer. 2017;3:269–81.
Moorthy SD, Davidson S, Shchuka VM, Singh G, Malek-Gilani N, Langroudi L, et al. Enhancers and super-enhancers have an equivalent regulatory role in embryonic stem cells through regulation of single or multiple genes. Genome Res. 2017;27:246–58.
We acknowledge that the CancerCare Manitoba Research Institute and the University of Manitoba campuses are located on original lands of Anishinaabeg, Cree, Oji-Cree, Dakota and Dene peoples, and on the homeland of the Métis Nation.
Research was supported by grant from Natural Sciences and Engineering Research Council of Canada (RGPIN-2017-05927) and CancerCare Manitoba Foundation (761020318) to JRD. Additional funding was provided by the National Institutes of Health, USA (AR049069), to AJvW, and ANID/CONICYT - FONDECYT de Iniciación 11190998 to GN.
Ethics approval and consent to participate
Consent for publication
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Broad H3K4me3 (or H3K27ac) domain: H3K4me3 (or H3K27ac) ChIP-seq peaks which, when ranked by size, constitute the top 5% broadest H3K4me3 (or H3K27ac) domains. Each peak is attributed to the gene with the closest transcription start site . It should be noted that broad domains identified in the literature as broad H3K4me3 domains are also characterized by the widely spread presence of many other marks [9, 47]. However, this broad distribution of other marks has been overlooked in the literature.
Cell identity genes: Development regulator genes that are specific to each cell type. Their expression often serves as crucial biomarker for diseased and healthy cells.
Enhancer: “Enhancers are classically defined as cis-acting DNA sequences that can increase the transcription of genes. They generally function independently of orientation and at various distances from their target promoter(s)” . An active enhancer binds multiple transcription factors, which recruit the Mediator complex, facilitating the interaction of the loaded enhancer to the cognate promoter via the RNA polymerase II and the transcription machinery. This process is called looping . As enhancers can be located tens to hundreds of kb away from the gene(s) they regulate, the identification of enhancers and their target gene(s) can be challenging. Features generally exploited to identify putative enhancers are DNase I hypersensitivity, H3K4me1 and H3K27ac marks as well as association of CBP/p300, which acetylates H3K27 among other targets . It must be noted that few of the predicted super-enhancers have been functionally investigated by deletion .
Super-enhancer: Cluster of enhancers [88, 138]. It is generally believed that “typical” enhancers are involved in the transcription of housekeeping genes, while super-enhancers only regulate cell-type-specific genes that confer cell identity . However, it has been argued that an isolated enhancer can achieve a robust transcriptional output, while a super-enhancer acts in a partially redundant manner to fine tune the transcriptional output of its cognate gene(s) .
About this article
Cite this article
Beacon, T.H., Delcuve, G.P., López, C. et al. The dynamic broad epigenetic (H3K4me3, H3K27ac) domain as a mark of essential genes. Clin Epigenet 13, 138 (2021). https://doi.org/10.1186/s13148-021-01126-1